-
Wentworth Hauge posted an update 1 day, 7 hours ago
Accumulating evidences have shown that the deregulation of circRNA has close association with many human cancers. However, these experimental verified circRNA-cancer associations are not collected in any database. Here, we develop a manually curated database (circR2Cancer) that provides experimentally supported associations between circRNAs and cancers. The current version of the circR2Cancer contains 1439 associations between 1135 circRNAs and 82 cancers by extracting data from existing literatures and databases. In addition, circR2Cancer contains the information of cancer exacted from Disease Ontology and basic biological information of circRNAs from circBase. At the same time, circR2Cancer provides a simple and friendly interface for users to conveniently browse, search and download the data. It will be a useful and valuable resource for researchers to understanding the regulation mechanism of circRNA in cancers.
http//www.biobdlab.cn8000.
http//www.biobdlab.cn8000.The ability to compare entities within a knowledge graph is a cornerstone technique for several applications, ranging from the integration of heterogeneous data to machine learning. It is of particular importance in the biomedical domain, where semantic similarity can be applied to the prediction of protein-protein interactions, associations between diseases and genes, cellular localization of proteins, among others. In selleck compound , several knowledge graph-based semantic similarity measures have been developed, but building a gold standard data set to support their evaluation is non-trivial. We present a collection of 21 benchmark data sets that aim at circumventing the difficulties in building benchmarks for large biomedical knowledge graphs by exploiting proxies for biomedical entity similarity. These data sets include data from two successful biomedical ontologies, Gene Ontology and Human Phenotype Ontology, and explore proxy similarities calculated based on protein sequence similarity, protein family similarity, protein-protein interactions and phenotype-based gene similarity. Data sets have varying sizes and cover four different species at different levels of annotation completion. For each data set, we also provide semantic similarity computations with state-of-the-art representative measures. Database URL https//github.com/liseda-lab/kgsim-benchmark.Publicly available genetic databases promote data sharing and fuel scientific discoveries for the prevention, treatment and management of disease. In 2018, we built Color Data, a user-friendly, open access database containing genotypic and self-reported phenotypic information from 50 000 individuals who were sequenced for 30 genes associated with hereditary cancer. In a continued effort to promote access to these types of data, we launched Color Data v2, an updated version of the Color Data database. This new release includes additional clinical genetic testing results from more than 18 000 individuals who were sequenced for 30 genes associated with hereditary cardiovascular conditions as well as polygenic risk scores for breast cancer, coronary artery disease and atrial fibrillation. In addition, we used self-reported phenotypic information to implement the following four clinical risk models Gail Model for 5-year risk of breast cancer, Claus Model for lifetime risk of breast cancer, simple office-based Framingham Coronary Heart Disease Risk Score for 10-year risk of coronary heart disease and CHARGE-AF simple score for 5-year risk of atrial fibrillation. These new features and capabilities are highlighted through two sample queries in the database. We hope that the broad dissemination of these data will help researchers continue to explore genotype-phenotype correlations and identify novel variants for functional analysis, enabling scientific discoveries in the field of population genomics. Database URL https//data.color.com/.Species checklists are a crucial source of information for research and policy. Unfortunately, many traditional species checklists vary wildly in their content, format, availability and maintenance. #link# The fact that these are not open, findable, accessible, interoperable and reusable (FAIR) severely hampers fast and efficient information flow to policy and decision-making that are required to tackle the current biodiversity crisis. Here, we propose a reproducible, semi-automated workflow to transform traditional checklist data into a FAIR and open species registry. We showcase our workflow by applying it to the publication of the Manual of Alien Plants, a species checklist specifically developed for the Tracking Invasive Alien Species (TrIAS) project. Our approach combines source data management, reproducible data transformation to Darwin Core using R, version control, data documentation and publication to the Global Biodiversity Information Facility (GBIF). This checklist publication workflow is openly available for data holders and applicable to species registries varying in thematic, taxonomic or geographical scope and could serve as an important tool to open up research and strengthen environmental decision-making.
Although adolescent dietary patterns tend to be of poor quality, it is unclear whether dietary patterns established in adolescence persist into adulthood.
We examined trajectories across adolescence and early adulthood for 2 major dietary patterns and their associations with childhood and parental factors.
Using data from the Western Australian Pregnancy Cohort (Raine Study), intakes of 38 food groups were estimated at ages 14, 17, 20 and 22 y in 1414 participants using evaluated FFQs. Using factor analysis, 2 major dietary patterns (healthy and Western) were consistently identified across follow-ups. Sex-specific group-based modeling assessed the variation in individual dietary pattern z scores to identify group trajectories for each pattern between ages 14 and 22 y and to assess their associations with childhood and parental factors.
Two major trajectory groups were identified for each pattern. Between ages 14 and 22 y, a majority of the cohort (70% males, 73% females) formed a trajectory group with consistently low z scores for the healthy dietary pattern.