Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Drabkin HJ, Blake JA. Manual Gene Ontology annotation workflow at the Mouse Genome Informatics Database. Database (Oxford) 2012;2012:bas045. [PMID: 23110975 PMCID: PMC3483533 DOI: 10.1093/database/bas045] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

For:	Drabkin HJ, Blake JA. Manual Gene Ontology annotation workflow at the Mouse Genome Informatics Database. Database (Oxford) 2012;2012:bas045. [PMID: 23110975 PMCID: PMC3483533 DOI: 10.1093/database/bas045] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Number

Cited by Other Article(s)

Lee YY, Endale M, Wu G, Ruben MD, Francey LJ, Morris AR, Choo NY, Anafi RC, Smith DF, Liu AC, Hogenesch JB. Integration of genome-scale data identifies candidate sleep regulators. Sleep 2023;46:zsac279. [PMID: 36462188 PMCID: PMC9905783 DOI: 10.1093/sleep/zsac279] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 09/02/2022] [Indexed: 12/05/2022] Open

Affiliation(s)

Yin Yeng Lee Divisions of Human Genetics and Immunobiology, Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, 45229, USA Department of Pharmacology and Systems Physiology, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
Mehari Endale Department of Physiology and Aging, University of Florida College of Medicine, Gainesville, FL 32610, USA
Gang Wu Divisions of Human Genetics and Immunobiology, Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, 45229, USA
Marc D Ruben Divisions of Human Genetics and Immunobiology, Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, 45229, USA
Lauren J Francey Divisions of Human Genetics and Immunobiology, Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, 45229, USA
Andrew R Morris Department of Physiology and Aging, University of Florida College of Medicine, Gainesville, FL 32610, USA
Natalie Y Choo Division of Pediatric Otolaryngology-Head and Neck Surgery, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA
Ron C Anafi Department of Medicine, Chronobiology and Sleep Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
David F Smith Division of Pediatric Otolaryngology-Head and Neck Surgery, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA Division of Pulmonary Medicine and the Sleep Center, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA Center for Circadian Medicine, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA Department of Otolaryngology - Head and Neck Surgery, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
Andrew C Liu Department of Physiology and Aging, University of Florida College of Medicine, Gainesville, FL 32610, USA
John B Hogenesch Divisions of Human Genetics and Immunobiology, Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, 45229, USA Center for Circadian Medicine, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA

Collapse

Zhang Q, Bai X, Shi J, Wang X, Zhang B, Dai L, Lin T, Gao Y, Zhang Y, Zhao X. DIA proteomics identified the potential targets associated with angiogenesis in the mammary glands of dairy cows with hemorrhagic mastitis. Front Vet Sci 2022;9:980963. [PMID: 36003411 PMCID: PMC9393364 DOI: 10.3389/fvets.2022.980963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 07/22/2022] [Indexed: 11/13/2022] Open

Affiliation(s)

Quanwei Zhang College of Veterinary Medicine, Gansu Agriculture University, Lanzhou, China College of Life Science and Technology, Gansu Agriculture University, Lanzhou, China Gansu Key Laboratory of Animal Reproductive Physiology and Reproductive Regulation, Lanzhou, China *Correspondence: Quanwei Zhang
Xu Bai College of Veterinary Medicine, Gansu Agriculture University, Lanzhou, China Gansu Key Laboratory of Animal Reproductive Physiology and Reproductive Regulation, Lanzhou, China
Jun Shi College of Veterinary Medicine, Gansu Agriculture University, Lanzhou, China Gansu Key Laboratory of Animal Reproductive Physiology and Reproductive Regulation, Lanzhou, China
Xueying Wang College of Life Science and Technology, Gansu Agriculture University, Lanzhou, China Gansu Key Laboratory of Animal Reproductive Physiology and Reproductive Regulation, Lanzhou, China
Bohao Zhang College of Life Science and Technology, Gansu Agriculture University, Lanzhou, China Gansu Key Laboratory of Animal Reproductive Physiology and Reproductive Regulation, Lanzhou, China
Lijun Dai College of Veterinary Medicine, Gansu Agriculture University, Lanzhou, China Gansu Key Laboratory of Animal Reproductive Physiology and Reproductive Regulation, Lanzhou, China
Ting Lin College of Veterinary Medicine, Gansu Agriculture University, Lanzhou, China Gansu Key Laboratory of Animal Reproductive Physiology and Reproductive Regulation, Lanzhou, China
Yuan Gao College of Veterinary Medicine, Gansu Agriculture University, Lanzhou, China Gansu Key Laboratory of Animal Reproductive Physiology and Reproductive Regulation, Lanzhou, China
Yong Zhang College of Veterinary Medicine, Gansu Agriculture University, Lanzhou, China College of Life Science and Technology, Gansu Agriculture University, Lanzhou, China Gansu Key Laboratory of Animal Reproductive Physiology and Reproductive Regulation, Lanzhou, China
Xingxu Zhao College of Veterinary Medicine, Gansu Agriculture University, Lanzhou, China College of Life Science and Technology, Gansu Agriculture University, Lanzhou, China Gansu Key Laboratory of Animal Reproductive Physiology and Reproductive Regulation, Lanzhou, China Xingxu Zhao

Collapse

Ramsey J, McIntosh B, Renfro D, Aleksander SA, LaBonte S, Ross C, Zweifel AE, Liles N, Farrar S, Gill JJ, Erill I, Ades S, Berardini TZ, Bennett JA, Brady S, Britton R, Carbon S, Caruso SM, Clements D, Dalia R, Defelice M, Doyle EL, Friedberg I, Gurney SMR, Hughes L, Johnson A, Kowalski JM, Li D, Lovering RC, Mans TL, McCarthy F, Moore SD, Murphy R, Paustian TD, Perdue S, Peterson CN, Prüß BM, Saha MS, Sheehy RR, Tansey JT, Temple L, Thorman AW, Trevino S, Vollmer AC, Walbot V, Willey J, Siegele DA, Hu JC. Crowdsourcing biocuration: The Community Assessment of Community Annotation with Ontologies (CACAO). PLoS Comput Biol 2021;17:e1009463. [PMID: 34710081 PMCID: PMC8553046 DOI: 10.1371/journal.pcbi.1009463] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Affiliation(s)

Jolene Ramsey Department of Biochemistry & Biophysics, Texas A&M University, College Station, Texas, United States of America Center for Phage Technology, Texas A&M University, College Station, Texas, United States of America
Brenley McIntosh Department of Biochemistry & Biophysics, Texas A&M University, College Station, Texas, United States of America
Daniel Renfro Department of Biochemistry & Biophysics, Texas A&M University, College Station, Texas, United States of America
Suzanne A. Aleksander Department of Biochemistry & Biophysics, Texas A&M University, College Station, Texas, United States of America
Sandra LaBonte Department of Biochemistry & Biophysics, Texas A&M University, College Station, Texas, United States of America
Curtis Ross Department of Biochemistry & Biophysics, Texas A&M University, College Station, Texas, United States of America Center for Phage Technology, Texas A&M University, College Station, Texas, United States of America
Adrienne E. Zweifel Department of Biochemistry & Biophysics, Texas A&M University, College Station, Texas, United States of America
Nathan Liles Department of Biochemistry & Biophysics, Texas A&M University, College Station, Texas, United States of America
Shabnam Farrar Department of Biochemistry & Biophysics, Texas A&M University, College Station, Texas, United States of America
Jason J. Gill Center for Phage Technology, Texas A&M University, College Station, Texas, United States of America Department of Animal Science, Texas A&M University, College Station, Texas, United States of America
Ivan Erill Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, Maryland, United States of America Department of Computer Science and Electrical Engineering, University of Maryland Baltimore County, Baltimore, Maryland, United States of America
Sarah Ades Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
Tanya Z. Berardini The Arabidopsis Information Resource, Phoenix Bioinformatics, Newark, California, United States of America
Jennifer A. Bennett Department of Biology and Earth Science, Otterbein University, Westerville, Ohio, United States of America
Siobhan Brady Department of Plant Biology and Genome Center, University of California Davis, Davis, California, United States of America
Robert Britton Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, Michigan, United States of America
Seth Carbon Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
Steven M. Caruso Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, Maryland, United States of America
Dave Clements Department of Biology, John Hopkins University, Baltimore, Maryland, United States of America
Ritu Dalia Department of Biology, Drexel University, Philadelphia, Pennsylvania, United States of America
Meredith Defelice Department of Biochemistry & Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
Erin L. Doyle Biology Department, Doane University, Crete, Nebraska, United States of America
Iddo Friedberg Department of Microbiology, Miami University, Oxford, Ohio, United States of America
Susan M. R. Gurney Department of Biology, Drexel University, Philadelphia, Pennsylvania, United States of America
Lee Hughes Department of Biological Sciences, University of North Texas, Denton, Texas, United States of America
Allison Johnson Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, Virginia, United States of America
Jason M. Kowalski Biological Sciences Department, University of Wisconsin-Parkside, Kenosha, Wisconsin, United States of America
Donghui Li The Arabidopsis Information Resource, Phoenix Bioinformatics, Newark, California, United States of America
Ruth C. Lovering Institute of Cardiovascular Science, University College London, London, United Kingdom
Tamara L. Mans Department of Biochemistry and Biotechnology, Minnesota State University Moorhead, Brooklyn Park, Minnesota, United States of America
Fiona McCarthy Department of Basic Science, College of Veterinary Medicine, Mississippi State University, Starkville, Mississippi, United States of America
Sean D. Moore Burnett School of Biomedical Sciences, University of Central Florida, Orlando, Florida, United States of America
Rebecca Murphy Department of Biology, Centenary College of Louisiana, Shreveport, Louisiana, United States of America
Timothy D. Paustian Department of Bacteriology, University of Wisconsin, Madison, Wisconsin, United States of America
Sarah Perdue Biological Sciences Department, University of Wisconsin-Parkside, Kenosha, Wisconsin, United States of America
Celeste N. Peterson Biology Department, Suffolk University, Boston, Massachusetts, United States of America
Birgit M. Prüß Microbiological Sciences Department, North Dakota State University, Fargo, North Dakota, United States of America
Margaret S. Saha Department of Biology, College of William & Mary, Williamsburg, Virginia, United States of America
Robert R. Sheehy Biology Department, Radford University, Radford, Virginia, United States of America
John T. Tansey Department of Biochemistry and Molecular Biology, Otterbein University, Westerville, Ohio, United States of America
Louise Temple School of Integrated Sciences, James Madison University, Harrisonburg, Virginia, United States of America
Alexander William Thorman Department of Environmental and Public Health Sciences, University of Cincinnati, Cincinnati, Ohio, United States of America
Saul Trevino Department of Chemistry, Math, and Physics, Houston Baptist University, Houston, Texas, United States of America
Amy Cheng Vollmer Department of Biology, Swarthmore College, Swarthmore, Pennsylvania, United States of America
Virginia Walbot Department of Biology, Stanford University, Stanford, California, United States of America
Joanne Willey Department of Science Education, Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Hempstead, New York, United States of America
Deborah A. Siegele Department of Biology, Texas A&M University, College Station, Texas, United States of America
James C. Hu Department of Biochemistry & Biophysics, Texas A&M University, College Station, Texas, United States of America Center for Phage Technology, Texas A&M University, College Station, Texas, United States of America

Collapse

Review of Preferential Suspicious Genes in Microtia Patients Through Various Approaches. J Craniofac Surg 2020;31:538-541. [PMID: 31977690 DOI: 10.1097/scs.0000000000006244] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Chi-miR-3031 regulates beta-casein via the PI3K/AKT-mTOR signaling pathway in goat mammary epithelial cells (GMECs). BMC Vet Res 2018;14:369. [PMID: 30482199 PMCID: PMC6258393 DOI: 10.1186/s12917-018-1695-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 11/12/2018] [Indexed: 12/19/2022] Open

Howe DG, Blake JA, Bradford YM, Bult CJ, Calvi BR, Engel SR, Kadin JA, Kaufman TC, Kishore R, Laulederkind SJF, Lewis SE, Moxon SAT, Richardson JE, Smith C. Model organism data evolving in support of translational medicine. Lab Anim (NY) 2018;47:277-289. [PMID: 30224793 PMCID: PMC6322546 DOI: 10.1038/s41684-018-0150-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Accepted: 08/13/2018] [Indexed: 02/07/2023]

Abstract

Model organism databases (MODs) have been collecting and integrating biomedical research data for 30 years and were designed to meet specific needs of each model organism research community. The contributions of model organism research to understanding biological systems would be hard to overstate. Modern molecular biology methods and cost reductions in nucleotide sequencing have opened avenues for direct application of model organism research to elucidating mechanisms of human diseases. Thus, the mandate for model organism research and databases has now grown to include facilitating use of these data in translational applications. Challenges in meeting this opportunity include the distribution of research data across many databases and websites, a lack of data format standards for some data types, and sustainability of scale and cost for genomic database resources like MODs. The issues of widely distributed data and application of data standards are some of the challenges addressed by FAIR (Findable, Accessible, Interoperable, and Re-usable) data principles. The Alliance of Genome Resources is now moving to address these challenges by bringing together expertly curated research data from fly, mouse, rat, worm, yeast, zebrafish, and the Gene Ontology consortium. Centralized multi-species data access, integration, and format standardization will lower the data utilization barrier in comparative genomics and translational applications and will provide a framework in which sustainable scale and cost can be addressed. This article presents a brief historical perspective on how the Alliance model organisms are complementary and how they have already contributed to understanding the etiology of human diseases. In addition, we discuss four challenges for using data from MODs in translational applications and how the Alliance is working to address them, in part by applying FAIR data principles. Ultimately, combined data from these animal models are more powerful than the sum of the parts.

Collapse

Christie KR, Blake JA. Sensing the cilium, digital capture of ciliary data for comparative genomics investigations. Cilia 2018;7:3. [PMID: 29713460 PMCID: PMC5907423 DOI: 10.1186/s13630-018-0057-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Accepted: 04/03/2018] [Indexed: 01/03/2023] Open

Abstract

Background

Cilia are specialized, hair-like structures that project from the cell bodies of eukaryotic cells. With increased understanding of the distribution and functions of various types of cilia, interest in these organelles is accelerating. To effectively use this great expansion in knowledge, this information must be made digitally accessible and available for large-scale analytical and computational investigation. Capture and integration of knowledge about cilia into existing knowledge bases, thus providing the ability to improve comparative genomic data analysis, is the objective of this work.

Methods

We focused on the capture of information about cilia as studied in the laboratory mouse, a primary model of human biology. The workflow developed establishes a standard for capture of comparative functional data relevant to human biology. We established the 310 closest mouse orthologs of the 302 human genes defined in the SYSCILIA Gold Standard set of ciliary genes. For the mouse genes, we identified biomedical literature for curation and used Gene Ontology (GO) curation paradigms to provide functional annotations from these publications.

Results

Employing a methodology for comprehensive capture of experimental data about cilia genes in structured, digital form, we established a workflow for curation of experimental literature detailing molecular function and roles of cilia proteins starting with the mouse orthologs of the human SYSCILIA gene set. We worked closely with the GO Consortium ontology development editors and the SYSCILIA Consortium to improve the representation of ciliary biology within the GO. During the time frame of the ontology improvement project, we have fully curated 134 of these 310 mouse genes, resulting in an increase in the number of ciliary and other experimental annotations.

Conclusions

We have improved the GO annotations available for mouse genes orthologous to the human genes in the SYSCILIA Consortium’s Gold Standard set. In addition, ciliary terminology in the GO itself was improved in collaboration with GO ontology developers and the SYSCILIA Consortium. These improvements to the GO terms for the functions and roles of ciliary proteins, along with the increase in annotations of the corresponding genes, enhance the representation of ciliary processes and localizations and improve access to these data during large-scale bioinformatic analyses.

Electronic supplementary material

The online version of this article (10.1186/s13630-018-0057-0) contains supplementary material, which is available to authorized users.

Collapse

Gaudet P, Škunca N, Hu JC, Dessimoz C. Primer on the Gene Ontology. Methods Mol Biol 2017;1446:25-37. [PMID: 27812933 DOI: 10.1007/978-1-4939-3743-1_3] [Citation(s) in RCA: 54] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]

Shaw DR. Searching the Mouse Genome Informatics (MGI) Resources for Information on Mouse Biology from Genotype to Phenotype. ACTA ACUST UNITED AC 2016;56:1.7.1-1.7.16. [PMID: 27930808 DOI: 10.1002/cpbi.18] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Blake JA, Eppig JT, Kadin JA, Richardson JE, Smith CL, Bult CJ. Mouse Genome Database (MGD)-2017: community knowledge resource for the laboratory mouse. Nucleic Acids Res 2016;45:D723-D729. [PMID: 27899570 PMCID: PMC5210536 DOI: 10.1093/nar/gkw1040] [Citation(s) in RCA: 199] [Impact Index Per Article: 24.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2016] [Accepted: 10/28/2016] [Indexed: 11/30/2022] Open

Davis MR, Arner E, Duffy CRE, De Sousa PA, Dahlman I, Arner P, Summers KM. Expression of FBN1 during adipogenesis: Relevance to the lipodystrophy phenotype in Marfan syndrome and related conditions. Mol Genet Metab 2016;119:174-85. [PMID: 27386756 PMCID: PMC5044862 DOI: 10.1016/j.ymgme.2016.06.009] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/06/2016] [Revised: 06/18/2016] [Accepted: 06/18/2016] [Indexed: 01/27/2023]

Abstract

Fibrillin-1 is a large glycoprotein encoded by the FBN1 gene in humans. It provides strength and elasticity to connective tissues and is involved in regulating the bioavailability of the growth factor TGFβ. Mutations in FBN1 may be associated with depleted or abnormal adipose tissue, seen in some patients with Marfan syndrome and lipodystrophies. As this lack of adipose tissue does not result in high morbidity or mortality, it is generally under-appreciated, but is a cause of psychosocial problems particularly to young patients. We examined the role of fibrillin-1 in adipogenesis. In inbred mouse strains we found significant variation in the level of expression in the Fbn1 gene that correlated with variation in several measures of body fat, suggesting that mouse fibrillin-1 is associated with the level of fat tissue. Furthermore, we found that FBN1 mRNA was up-regulated in the adipose tissue of obese women compared to non-obese, and associated with an increase in adipocyte size. We used human mesenchymal stem cells differentiated in culture to adipocytes to show that fibrillin-1 declines after the initiation of differentiation. Gene expression results from a similar experiment (available through the FANTOM5 project) revealed that the decline in fibrillin-1 protein was paralleled by a decline in FBN1 mRNA. Examination of the FBN1 gene showed that the region commonly affected in FBN1-associated lipodystrophy is highly conserved both across the three human fibrillin genes and across genes encoding fibrillin-1 in vertebrates. These results suggest that fibrillin-1 is involved as the undifferentiated mesenchymal stem cells transition to adipogenesis but then declines as the developing adipocytes take on their final phenotype. Since the C-terminal peptide of fibrillin-1 is a glucogenic hormone, individuals with low fibrillin-1 (for example with FBN1 mutations associated with lipodystrophy) may fail to differentiate adipocytes and/or to accumulate adipocyte lipids, although this still needs to be shown experimentally.

Collapse

Fluck J, Madan S, Ansari S, Kodamullil AT, Karki R, Rastegar-Mojarad M, Catlett NL, Hayes W, Szostak J, Hoeng J, Peitsch M. Training and evaluation corpora for the extraction of causal relationships encoded in biological expression language (BEL). DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016;2016:baw113. [PMID: 27554092 PMCID: PMC4995071 DOI: 10.1093/database/baw113] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Accepted: 07/07/2016] [Indexed: 01/21/2023]

Abstract

Success in extracting biological relationships is mainly dependent on the complexity of the task as well as the availability of high-quality training data. Here, we describe the new corpora in the systems biology modeling language BEL for training and testing biological relationship extraction systems that we prepared for the BioCreative V BEL track. BEL was designed to capture relationships not only between proteins or chemicals, but also complex events such as biological processes or disease states. A BEL nanopub is the smallest unit of information and represents a biological relationship with its provenance. In BEL relationships (called BEL statements), the entities are normalized to defined namespaces mainly derived from public repositories, such as sequence databases, MeSH or publicly available ontologies. In the BEL nanopubs, the BEL statements are associated with citation information and supportive evidence such as a text excerpt. To enable the training of extraction tools, we prepared BEL resources and made them available to the community. We selected a subset of these resources focusing on a reduced set of namespaces, namely, human and mouse genes, ChEBI chemicals, MeSH diseases and GO biological processes, as well as relationship types ‘increases’ and ‘decreases’. The published training corpus contains 11 000 BEL statements from over 6000 supportive text excerpts. For method evaluation, we selected and re-annotated two smaller subcorpora containing 100 text excerpts. For this re-annotation, the inter-annotator agreement was measured by the BEL track evaluation environment and resulted in a maximal F-score of 91.18% for full statement agreement. In addition, for a set of 100 BEL statements, we do not only provide the gold standard expert annotations, but also text excerpts pre-selected by two automated systems. Those text excerpts were evaluated and manually annotated as true or false supportive in the course of the BioCreative V BEL track task.

Database URL:http://wiki.openbel.org/display/BIOC/Datasets

Collapse

Drabkin HJ, Christie KR, Dolan ME, Hill DP, Ni L, Sitnikov D, Blake JA. Application of comparative biology in GO functional annotation: the mouse model. Mamm Genome 2015;26:574-83. [PMID: 26141960 PMCID: PMC4602061 DOI: 10.1007/s00335-015-9580-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Accepted: 06/23/2015] [Indexed: 01/22/2023]

Huntley RP, Harris MA, Alam-Faruque Y, Blake JA, Carbon S, Dietze H, Dimmer EC, Foulger RE, Hill DP, Khodiyar VK, Lock A, Lomax J, Lovering RC, Mutowo-Meullenet P, Sawford T, Van Auken K, Wood V, Mungall CJ. A method for increasing expressivity of Gene Ontology annotations using a compositional approach. BMC Bioinformatics 2014;15:155. [PMID: 24885854 PMCID: PMC4039540 DOI: 10.1186/1471-2105-15-155] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2014] [Accepted: 05/15/2014] [Indexed: 11/22/2022] Open

Balakrishnan R, Harris MA, Huntley R, Van Auken K, Cherry JM. A guide to best practices for Gene Ontology (GO) manual annotation. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2013;2013:bat054. [PMID: 23842463 PMCID: PMC3706743 DOI: 10.1093/database/bat054] [Citation(s) in RCA: 105] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]