Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Söhngen C, Chang A, Schomburg D. Development of a classification scheme for disease-related enzyme information. BMC Bioinformatics 2011;12:329. [PMID: 21827651 PMCID: PMC3166944 DOI: 10.1186/1471-2105-12-329] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2011] [Accepted: 08/09/2011] [Indexed: 11/24/2022] Open

For:	Söhngen C, Chang A, Schomburg D. Development of a classification scheme for disease-related enzyme information. BMC Bioinformatics 2011;12:329. [PMID: 21827651 PMCID: PMC3166944 DOI: 10.1186/1471-2105-12-329] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2011] [Accepted: 08/09/2011] [Indexed: 11/24/2022] Open

Number

Cited by Other Article(s)

Chang A, Jeske L, Ulbrich S, Hofmann J, Koblitz J, Schomburg I, Neumann-Schaal M, Jahn D, Schomburg D. BRENDA, the ELIXIR core data resource in 2021: new developments and updates. Nucleic Acids Res 2021;49:D498-D508. [PMID: 33211880 PMCID: PMC7779020 DOI: 10.1093/nar/gkaa1025] [Citation(s) in RCA: 279] [Impact Index Per Article: 93.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2020] [Revised: 10/14/2020] [Accepted: 10/26/2020] [Indexed: 12/31/2022] Open

Cui H, Zhang L, Ford B, Cheng HL, Macklin JA, Reznicek A, Starr J. Measurement Recorder: developing a useful tool for making species descriptions that produces computable phenotypes. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020;2020:5995854. [PMID: 33216896 PMCID: PMC7678789 DOI: 10.1093/database/baaa079] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Revised: 08/24/2020] [Accepted: 08/27/2020] [Indexed: 12/31/2022]

Abstract

To use published phenotype information in computational analyses, there have been efforts to convert descriptions of phenotype characters from human languages to ontologized statements. This postpublication curation process is not only slow and costly, it is also burdened with significant intercurator variation (including curator-author variation), due to different interpretations of a character by various individuals. This problem is inherent in any human-based intellectual activity. To address this problem, making scientific publications semantically clear (i.e. computable) by the authors at the time of publication is a critical step if we are to avoid postpublication curation. To help authors efficiently produce species phenotypes while producing computable data, we are experimenting with an author-driven ontology development approach and developing and evaluating a series of ontology-aware software modules that would create publishable species descriptions that are readily useable in scientific computations. The first software module prototype called Measurement Recorder has been developed to assist authors in defining continuous measurements and reported in this paper. Two usability studies of the software were conducted with 22 undergraduate students majoring in information science and 32 in biology. Results suggest that participants can use Measurement Recorder without training and they find it easy to use after limited practice. Participants also appreciate the semantic enhancement features. Measurement Recorder's character reuse features facilitate character convergence among participants by 48% and have the potential to further reduce user errors in defining characters. A set of software design issues have also been identified and then corrected. Measurement Recorder enables authors to record measurements in a semantically clear manner and enriches phenotype ontology along the way. Future work includes representing the semantic data as Resource Description Framework (RDF) knowledge graphs and characterizing the division of work between authors as domain knowledge providers and ontology engineers as knowledge formalizers in this new author-driven ontology development approach.

Collapse

Jeske L, Placzek S, Schomburg I, Chang A, Schomburg D. BRENDA in 2019: a European ELIXIR core data resource. Nucleic Acids Res 2019;47:D542-D549. [PMID: 30395242 PMCID: PMC6323942 DOI: 10.1093/nar/gky1048] [Citation(s) in RCA: 225] [Impact Index Per Article: 45.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Revised: 10/05/2018] [Accepted: 10/30/2018] [Indexed: 12/22/2022] Open

Dahdul W, Manda P, Cui H, Balhoff JP, Dececchi TA, Ibrahim N, Lapp H, Vision T, Mabee PM. Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems. Database (Oxford) 2018;2018:5255130. [PMID: 30576485 PMCID: PMC6301375 DOI: 10.1093/database/bay110] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Revised: 08/22/2018] [Accepted: 09/24/2018] [Indexed: 11/12/2022]

Abstract

Natural language descriptions of organismal phenotypes, a principal object of study in biology, are abundant in the biological literature. Expressing these phenotypes as logical statements using ontologies would enable large-scale analysis on phenotypic information from diverse systems. However, considerable human effort is required to make these phenotype descriptions amenable to machine reasoning. Natural language processing tools have been developed to facilitate this task, and the training and evaluation of these tools depend on the availability of high quality, manually annotated gold standard data sets. We describe the development of an expert-curated gold standard data set of annotated phenotypes for evolutionary biology. The gold standard was developed for the curation of complex comparative phenotypes for the Phenoscape project. It was created by consensus among three curators and consists of entity-quality expressions of varying complexity. We use the gold standard to evaluate annotations created by human curators and those generated by the Semantic CharaParser tool. Using four annotation accuracy metrics that can account for any level of relationship between terms from two phenotype annotations, we found that machine-human consistency, or similarity, was significantly lower than inter-curator (human-human) consistency. Surprisingly, allowing curatorsaccess to external information did not significantly increase the similarity of their annotations to the gold standard or have a significant effect on inter-curator consistency. We found that the similarity of machine annotations to the gold standard increased after new relevant ontology terms had been added. Evaluation by the original authors of the character descriptions indicated that the gold standard annotations came closer to representing their intended meaning than did either the curator or machine annotations. These findings point toward ways to better design software to augment human curators and the use of the gold standard corpus will allow training and assessment of new tools to improve phenotype annotation accuracy at scale.

Collapse

Schomburg I, Jeske L, Ulbrich M, Placzek S, Chang A, Schomburg D. The BRENDA enzyme information system–From a database to an expert system. J Biotechnol 2017;261:194-206. [DOI: 10.1016/j.jbiotec.2017.04.020] [Citation(s) in RCA: 102] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2017] [Revised: 04/11/2017] [Accepted: 04/18/2017] [Indexed: 02/06/2023]

Liang SH, Walther BA, Shieh BS. Contrasting determinants for the introduction and establishment success of exotic birds in Taiwan using decision trees models. PeerJ 2017;5:e3092. [PMID: 28316893 PMCID: PMC5354111 DOI: 10.7717/peerj.3092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Accepted: 02/14/2017] [Indexed: 11/20/2022] Open

Abstract

Background

Biological invasions have become a major threat to biodiversity, and identifying determinants underlying success at different stages of the invasion process is essential for both prevention management and testing ecological theories. To investigate variables associated with different stages of the invasion process in a local region such as Taiwan, potential problems using traditional parametric analyses include too many variables of different data types (nominal, ordinal, and interval) and a relatively small data set with too many missing values.

Methods

We therefore used five decision tree models instead and compared their performance. Our dataset contains 283 exotic bird species which were transported to Taiwan; of these 283 species, 95 species escaped to the field successfully (introduction success); of these 95 introduced species, 36 species reproduced in the field of Taiwan successfully (establishment success). For each species, we collected 22 variables associated with human selectivity and species traits which may determine success during the introduction stage and establishment stage. For each decision tree model, we performed three variable treatments: (I) including all 22 variables, (II) excluding nominal variables, and (III) excluding nominal variables and replacing ordinal values with binary ones. Five performance measures were used to compare models, namely, area under the receiver operating characteristic curve (AUROC), specificity, precision, recall, and accuracy.

Results

The gradient boosting models performed best overall among the five decision tree models for both introduction and establishment success and across variable treatments. The most important variables for predicting introduction success were the bird family, the number of invaded countries, and variables associated with environmental adaptation, whereas the most important variables for predicting establishment success were the number of invaded countries and variables associated with reproduction.

Discussion

Our final optimal models achieved relatively high performance values, and we discuss differences in performance with regard to sample size and variable treatments. Our results showed that, for both the establishment model and introduction model, the number of invaded countries was the most important or second most important determinant, respectively. Therefore, we suggest that future success for introduction and establishment of exotic birds may be gauged by simply looking at previous success in invading other countries. Finally, we found that species traits related to reproduction were more important in establishment models than in introduction models; importantly, these determinants were not averaged but either minimum or maximum values of species traits. Therefore, we suggest that in addition to averaged values, reproductive potential represented by minimum and maximum values of species traits should be considered in invasion studies.

Collapse

Placzek S, Schomburg I, Chang A, Jeske L, Ulbrich M, Tillack J, Schomburg D. BRENDA in 2017: new perspectives and new tools in BRENDA. Nucleic Acids Res 2016;45:D380-D388. [PMID: 27924025 PMCID: PMC5210646 DOI: 10.1093/nar/gkw952] [Citation(s) in RCA: 175] [Impact Index Per Article: 21.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Accepted: 10/17/2016] [Indexed: 01/11/2023] Open

Chen L, Peng S, Yang B. Predicting alien herb invasion with machine learning models: biogeographical and life-history traits both matter. Biol Invasions 2015. [DOI: 10.1007/s10530-015-0870-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Chang A, Schomburg I, Placzek S, Jeske L, Ulbrich M, Xiao M, Sensen CW, Schomburg D. BRENDA in 2015: exciting developments in its 25th year of existence. Nucleic Acids Res 2014;43:D439-46. [PMID: 25378310 PMCID: PMC4383907 DOI: 10.1093/nar/gku1068] [Citation(s) in RCA: 150] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open

Standardization in enzymology—Data integration in the world׳s enzyme information system BRENDA. ACTA ACUST UNITED AC 2014. [DOI: 10.1016/j.pisc.2014.02.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

OralCard: A bioinformatic tool for the study of oral proteome. Arch Oral Biol 2013;58:762-72. [DOI: 10.1016/j.archoralbio.2012.12.012] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2012] [Revised: 11/26/2012] [Accepted: 12/30/2012] [Indexed: 10/27/2022]

De Filippo C, Ramazzotti M, Fontana P, Cavalieri D. Bioinformatic approaches for functional annotation and pathway inference in metagenomics data. Brief Bioinform 2013;13:696-710. [PMID: 23175748 PMCID: PMC3505041 DOI: 10.1093/bib/bbs070] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Cooper L, Walls RL, Elser J, Gandolfo MA, Stevenson DW, Smith B, Preece J, Athreya B, Mungall CJ, Rensing S, Hiss M, Lang D, Reski R, Berardini TZ, Li D, Huala E, Schaeffer M, Menda N, Arnaud E, Shrestha R, Yamazaki Y, Jaiswal P. The plant ontology as a tool for comparative plant anatomy and genomic analyses. PLANT & CELL PHYSIOLOGY 2013;54:e1. [PMID: 23220694 PMCID: PMC3583023 DOI: 10.1093/pcp/pcs163] [Citation(s) in RCA: 87] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]

Affiliation(s)

Laurel Cooper Department of Botany and Plant Pathology, Oregon State University, 2082 Cordley Hall, Corvallis, OR 97331-2902, USA These authors contributed equally to this work These authors contributed equally to the development of the Plant Ontology
Ramona L. Walls New York Botanical Garden, 2900 Southern Blvd., Bronx, NY 10458-5126, USA These authors contributed equally to this work These authors contributed equally to the development of the Plant Ontology
Justin Elser Department of Botany and Plant Pathology, Oregon State University, 2082 Cordley Hall, Corvallis, OR 97331-2902, USA These authors contributed equally to the development of the Plant Ontology
Maria A. Gandolfo L.H. Bailey Hortorium, Department of Plant Biology, Cornell University, 412 Mann Library Building, Ithaca, NY 14853, USA These authors contributed equally to the development of the Plant Ontology
Dennis W. Stevenson New York Botanical Garden, 2900 Southern Blvd., Bronx, NY 10458-5126, USA These authors contributed equally to the development of the Plant Ontology
Barry Smith Department of Philosophy, University at Buffalo, 126 Park Hall, Buffalo, NY 14260, USA These authors contributed equally to the development of the Plant Ontology
Justin Preece Department of Botany and Plant Pathology, Oregon State University, 2082 Cordley Hall, Corvallis, OR 97331-2902, USA
Balaji Athreya Department of Botany and Plant Pathology, Oregon State University, 2082 Cordley Hall, Corvallis, OR 97331-2902, USA
Christopher J. Mungall Berkeley Bioinformatics Open-Source Projects, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Mailstop 64-121, Berkeley, CA 94720, USA
Stefan Rensing Faculty of Biology and BIOSS Centre for Biological Signalling Studies, University of Freiburg, Schänzlestr. 1, D-79104 Freiburg, Germany
Manuel Hiss Faculty of Biology and BIOSS Centre for Biological Signalling Studies, University of Freiburg, Schänzlestr. 1, D-79104 Freiburg, Germany
Daniel Lang Plant Biotechnology, Faculty of Biology, University of Freiburg, Germany
Ralf Reski Plant Biotechnology, Faculty of Biology, University of Freiburg, Germany FRIAS - Freiburg Institute for Advanced Studies, University of Freiburg, Freiburg, Germany
Tanya Z. Berardini Department of Plant Biology, Carnegie Institution for Science, Stanford, CA 94305, USA
Donghui Li Department of Plant Biology, Carnegie Institution for Science, Stanford, CA 94305, USA
Eva Huala Department of Plant Biology, Carnegie Institution for Science, Stanford, CA 94305, USA
Mary Schaeffer Agriculture Research Services, United States Department of Agriculture, Columbia, MO 65211, USA Division of Plant Sciences, Department of Agronomy, University of Missouri, Columbia, MO 65211, USA
Naama Menda Boyce Thompson Institute for Plant Research, 533 Tower Road, Ithaca, NY 148533, USA
Elizabeth Arnaud Bioversity International, via dei Tre Denari, 174/a, Maccarese, Rome, Italy
Rosemary Shrestha Genetic Resources Program, Centro Internacional de Mejoramiento de Maiz y Trigo (CIMMYT), Apdo. Postal 6-641, 06600 Mexico, D.F., Mexico
Yukiko Yamazaki Center for Genetic Resource Information, National Institute of Genetics, Mishima, Shizuoka, 411-8540 Japan
Pankaj Jaiswal Department of Botany and Plant Pathology, Oregon State University, 2082 Cordley Hall, Corvallis, OR 97331-2902, USA These authors contributed equally to the development of the Plant Ontology *Corresponding author: E-mail,: ; Fax, +1-541-737-3573

Collapse

Schomburg I, Chang A, Placzek S, Söhngen C, Rother M, Lang M, Munaretto C, Ulas S, Stelzer M, Grote A, Scheer M, Schomburg D. BRENDA in 2013: integrated reactions, kinetic data, enzyme function data, improved disease classification: new options and contents in BRENDA. Nucleic Acids Res 2013;41:D764-72. [PMID: 23203881 PMCID: PMC3531171 DOI: 10.1093/nar/gks1049] [Citation(s) in RCA: 271] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2012] [Revised: 10/08/2012] [Accepted: 10/10/2012] [Indexed: 11/13/2022] Open

Schad E, Tompa P, Hegyi H. The relationship between proteome size, structural disorder and organism complexity. Genome Biol 2011;12:R120. [PMID: 22182830 PMCID: PMC3334615 DOI: 10.1186/gb-2011-12-12-r120] [Citation(s) in RCA: 136] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2011] [Revised: 10/25/2011] [Accepted: 12/19/2011] [Indexed: 11/22/2022] Open