1
|
Caño De Las Heras S, Gargalo CL, Caccavale F, Gernaey KV, Krühne U. NyctiDB: A non-relational bioprocesses modeling database supported by an ontology. FRONTIERS IN CHEMICAL ENGINEERING 2022. [DOI: 10.3389/fceng.2022.1036867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Strategies to exploit and enable the digitalization of industrial processes are on course to become game-changers in optimizing (bio)chemical facilities. To achieve this, these industries face an increasing need for process models and, as importantly, an efficient way to store the models and data/information. Therefore, this work proposes developing an online information storage system that can facilitate the reuse and expansion of process models and make them available to the digitalization cycle. This system is named NyctiDB, and it is a novel non-relational database coupled with a bioprocess ontology. The ontology supports the selection and classification of bioprocess models focused information, while the database is in charge of the online storage of said information. Through a series of online collections, NyctiDB contains essential knowledge for the design, monitoring, control, and optimization of a bioprocess based on its mathematical model. Once NyctiDB has been implemented, its applicability and usefulness are demonstrated through two applications. Application A shows how NyctiDB is integrated inside the software architecture of an online educational bioprocess simulator. This implies that NyctiDB provides the information for the visualization of different bioprocess behaviours and the modifications of the models in the software. Moreover, the information related to the parameters and conditions of each model is used to support the users’ understanding of the process. Additionally, application B illustrates that NyctiDB can be used as AI enabler to further the research in this field through open-source and reliable data. This can, in fact, be used as the information source for the AI frameworks when developing, for example, hybrid models or smart expert systems for bioprocesses. Henceforth, this work aims to provide a blueprint on how to collect bioprocess modeling information and connect it to facilitate and empower the Internet-of-Things paradigm and the digitalization of the biomanufacturing industries.
Collapse
|
2
|
Asikis T, Klinglmayr J, Helbing D, Pournaras E. How value-sensitive design can empower sustainable consumption. ROYAL SOCIETY OPEN SCIENCE 2021; 8:201418. [PMID: 33614080 PMCID: PMC7890503 DOI: 10.1098/rsos.201418] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/09/2020] [Accepted: 12/10/2020] [Indexed: 06/12/2023]
Abstract
In a so-called overpopulated world, sustainable consumption is of existential importance. However, the expanding spectrum of product choices and their production complexity challenge consumers to make informed and value-sensitive decisions. Recent approaches based on (personalized) psychological manipulation are often intransparent, potentially privacy-invasive and inconsistent with (informational) self-determination. By contrast, responsible consumption based on informed choices currently requires reasoning to an extent that tends to overwhelm human cognitive capacity. As a result, a collective shift towards sustainable consumption remains a grand challenge. Here, we demonstrate a novel personal shopping assistant implemented as a smart phone app that supports a value-sensitive design and leverages sustainability awareness, using experts' knowledge and 'wisdom of the crowd' for transparent product information and explainable product ratings. Real-world field experiments in two supermarkets confirm higher sustainability awareness and a bottom-up behavioural shift towards more sustainable consumption. These results encourage novel business models for retailers and producers, ethically aligned with consumer preferences and with higher sustainability.
Collapse
Affiliation(s)
- Thomas Asikis
- Professorship of Computational Social Science, ETH Zurich, Zurich, Switzerland
| | | | - Dirk Helbing
- Professorship of Computational Social Science, ETH Zurich, Zurich, Switzerland
| | | |
Collapse
|
3
|
Tchechmedjiev A, Abdaoui A, Emonet V, Zevio S, Jonquet C. SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes. BMC Bioinformatics 2018; 19:405. [PMID: 30400805 PMCID: PMC6218966 DOI: 10.1186/s12859-018-2429-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 10/10/2018] [Indexed: 12/01/2022] Open
Abstract
Background Despite a wide adoption of English in science, a significant amount of biomedical data are produced in other languages, such as French. Yet a majority of natural language processing or semantic tools as well as domain terminologies or ontologies are only available in English, and cannot be readily applied to other languages, due to fundamental linguistic differences. However, semantic resources are required to design semantic indexes and transform biomedical (text)data into knowledge for better information mining and retrieval. Results We present the SIFR Annotator (http://bioportal.lirmm.fr/annotator), a publicly accessible ontology-based annotation web service to process biomedical text data in French. The service, developed during the Semantic Indexing of French Biomedical Data Resources (2013–2019) project is included in the SIFR BioPortal, an open platform to host French biomedical ontologies and terminologies based on the technology developed by the US National Center for Biomedical Ontology. The portal facilitates use and fostering of ontologies by offering a set of services –search, mappings, metadata, versioning, visualization, recommendation– including for annotation purposes. We introduce the adaptations and improvements made in applying the technology to French as well as a number of language independent additional features –implemented by means of a proxy architecture– in particular annotation scoring and clinical context detection. We evaluate the performance of the SIFR Annotator on different biomedical data, using available French corpora –Quaero (titles from French MEDLINE abstracts and EMEA drug labels) and CépiDC (ICD-10 coding of death certificates)– and discuss our results with respect to the CLEF eHealth information extraction tasks. Conclusions We show the web service performs comparably to other knowledge-based annotation approaches in recognizing entities in biomedical text and reach state-of-the-art levels in clinical context detection (negation, experiencer, temporality). Additionally, the SIFR Annotator is the first openly web accessible tool to annotate and contextualize French biomedical text with ontology concepts leveraging a dictionary currently made of 28 terminologies and ontologies and 333 K concepts. The code is openly available, and we also provide a Docker packaging for easy local deployment to process sensitive (e.g., clinical) data in-house (https://github.com/sifrproject).
Collapse
Affiliation(s)
- Andon Tchechmedjiev
- Laboratory of Informatics, Robotics and Microelectronics of Montpellier (LIRMM), University of Montpellier, CNRS, 161, rue Ada, 34095, Montpellier cedex 5, France. .,LGI2P, IMT Mines Ales, Univ Montpellier, Alès, France.
| | - Amine Abdaoui
- Laboratory of Informatics, Robotics and Microelectronics of Montpellier (LIRMM), University of Montpellier, CNRS, 161, rue Ada, 34095, Montpellier cedex 5, France
| | - Vincent Emonet
- Laboratory of Informatics, Robotics and Microelectronics of Montpellier (LIRMM), University of Montpellier, CNRS, 161, rue Ada, 34095, Montpellier cedex 5, France
| | - Stella Zevio
- Laboratory of Informatics, Robotics and Microelectronics of Montpellier (LIRMM), University of Montpellier, CNRS, 161, rue Ada, 34095, Montpellier cedex 5, France
| | - Clement Jonquet
- Laboratory of Informatics, Robotics and Microelectronics of Montpellier (LIRMM), University of Montpellier, CNRS, 161, rue Ada, 34095, Montpellier cedex 5, France.,Center for Biomedical Informatics Research (BMIR), Stanford University, 1265 Welch Rd, Stanford, CA, 94305, USA
| |
Collapse
|
4
|
Discovering and linking public omics data sets using the Omics Discovery Index. Nat Biotechnol 2018; 35:406-409. [PMID: 28486464 DOI: 10.1038/nbt.3790] [Citation(s) in RCA: 125] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
5
|
Martínez-Romero M, Jonquet C, O'Connor MJ, Graybeal J, Pazos A, Musen MA. NCBO Ontology Recommender 2.0: an enhanced approach for biomedical ontology recommendation. J Biomed Semantics 2017; 8:21. [PMID: 28592275 PMCID: PMC5463318 DOI: 10.1186/s13326-017-0128-y] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Accepted: 04/13/2017] [Indexed: 01/25/2023] Open
Abstract
BACKGROUND Ontologies and controlled terminologies have become increasingly important in biomedical research. Researchers use ontologies to annotate their data with ontology terms, enabling better data integration and interoperability across disparate datasets. However, the number, variety and complexity of current biomedical ontologies make it cumbersome for researchers to determine which ones to reuse for their specific needs. To overcome this problem, in 2010 the National Center for Biomedical Ontology (NCBO) released the Ontology Recommender, which is a service that receives a biomedical text corpus or a list of keywords and suggests ontologies appropriate for referencing the indicated terms. METHODS We developed a new version of the NCBO Ontology Recommender. Called Ontology Recommender 2.0, it uses a novel recommendation approach that evaluates the relevance of an ontology to biomedical text data according to four different criteria: (1) the extent to which the ontology covers the input data; (2) the acceptance of the ontology in the biomedical community; (3) the level of detail of the ontology classes that cover the input data; and (4) the specialization of the ontology to the domain of the input data. RESULTS Our evaluation shows that the enhanced recommender provides higher quality suggestions than the original approach, providing better coverage of the input data, more detailed information about their concepts, increased specialization for the domain of the input data, and greater acceptance and use in the community. In addition, it provides users with more explanatory information, along with suggestions of not only individual ontologies but also groups of ontologies to use together. It also can be customized to fit the needs of different ontology recommendation scenarios. CONCLUSIONS Ontology Recommender 2.0 suggests relevant ontologies for annotating biomedical text data. It combines the strengths of its predecessor with a range of adjustments and new features that improve its reliability and usefulness. Ontology Recommender 2.0 recommends over 500 biomedical ontologies from the NCBO BioPortal platform, where it is openly available (both via the user interface at http://bioportal.bioontology.org/recommender , and via a Web service API).
Collapse
Affiliation(s)
- Marcos Martínez-Romero
- Stanford Center for Biomedical Informatics Research, 1265 Welch Road, Stanford University School of Medicine, Stanford, CA, 94305-5479, USA.
| | - Clement Jonquet
- Stanford Center for Biomedical Informatics Research, 1265 Welch Road, Stanford University School of Medicine, Stanford, CA, 94305-5479, USA.,Laboratory of Informatics, Robotics and Microelectronics of Montpellier (LIRMM), University of Montpellier, 161 rue Ada, 34095, Montpellier, Cdx 5, France
| | - Martin J O'Connor
- Stanford Center for Biomedical Informatics Research, 1265 Welch Road, Stanford University School of Medicine, Stanford, CA, 94305-5479, USA
| | - John Graybeal
- Stanford Center for Biomedical Informatics Research, 1265 Welch Road, Stanford University School of Medicine, Stanford, CA, 94305-5479, USA
| | - Alejandro Pazos
- Department of Information and Communication Technologies, Computer Science Building, Elviña Campus, University of A Coruña, 15071, A Coruña, Spain
| | - Mark A Musen
- Stanford Center for Biomedical Informatics Research, 1265 Welch Road, Stanford University School of Medicine, Stanford, CA, 94305-5479, USA
| |
Collapse
|
6
|
Huang J, Eilbeck K, Smith B, Blake JA, Dou D, Huang W, Natale DA, Ruttenberg A, Huan J, Zimmermann MT, Jiang G, Lin Y, Wu B, Strachan HJ, de Silva N, Kasukurthi MV, Jha VK, He Y, Zhang S, Wang X, Liu Z, Borchert GM, Tan M. The development of non-coding RNA ontology. INT J DATA MIN BIOIN 2016; 15:214-232. [PMID: 27990175 DOI: 10.1504/ijdmb.2016.077072] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Identification of non-coding RNAs (ncRNAs) has been significantly improved over the past decade. On the other hand, semantic annotation of ncRNA data is facing critical challenges due to the lack of a comprehensive ontology to serve as common data elements and data exchange standards in the field. We developed the Non-Coding RNA Ontology (NCRO) to handle this situation. By providing a formally defined ncRNA controlled vocabulary, the NCRO aims to fill a specific and highly needed niche in semantic annotation of large amounts of ncRNA biological and clinical data.
Collapse
Affiliation(s)
- Jingshan Huang
- School of Computing, University of South Alabama, Shelby Hall, Room 1123, 150 Jaguar Drive Mobile, AL 36688, USA,
| | - Karen Eilbeck
- Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, Utah, USA,
| | - Barry Smith
- University at Buffalo - SUNY, Buffalo, New York 14260, USA,
| | | | - Dejing Dou
- Computer and Information Science Department, University of Oregon, Eugene, Oregon 97403, USA,
| | - Weili Huang
- Miracle Query, Inc., Eugene, Oregon 97405, USA,
| | - Darren A Natale
- Georgetown University Medical Center, Washington DC 20007, USA,
| | | | - Jun Huan
- Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, Kansas 66045, USA,
| | - Michael T Zimmermann
- Division of Biomedical Statistics and Informatics, College of Medicine at Mayo Clinic, Rochester, Minnesota 55905, USA,
| | - Guoqian Jiang
- Division of Biomedical Statistics and Informatics, College of Medicine at Mayo Clinic, Rochester, Minnesota 55905, USA,
| | - Yu Lin
- Data Coordination and Integration Center, University of Miami, Miami, Florida 33146, USA,
| | - Bin Wu
- Endocrinology Department, Kunming Medical University, Kunming, Yunnan, 650032 China,
| | - Harrison J Strachan
- School of Computing, University of South Alabama, Mobile, Alabama 36688, USA,
| | - Nisansa de Silva
- Computer and Information Science, University of Oregon, Eugene, Oregon 97403, USA,
| | | | - Vikash Kumar Jha
- School of Computing, University of South Alabama, Mobile, Alabama 36688, USA,
| | - Yongqun He
- Lab Animal Medicine, Microbiology, Immunology and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA,
| | - Shaojie Zhang
- Department of Computer Science, University of Central Florida, Orlando, Florida 32816, USA,
| | - Xiaowei Wang
- Cancer Biology, Washington University in St. Louis, St. Louis, Missouri 63130, USA,
| | - Zixing Liu
- Mitchell Cancer Institute, University of South Alabama, Mobile, Alabama 36604, USA,
| | - Glen M Borchert
- Department of Biology, University of South Alabama, Mobile, Alabama 36688, USA,
| | - Ming Tan
- Mitchell Cancer Institute, University of South Alabama, Mobile, Alabama 36604, USA,
| |
Collapse
|
7
|
The Non-Coding RNA Ontology (NCRO): a comprehensive resource for the unification of non-coding RNA biology. J Biomed Semantics 2016; 7:24. [PMID: 27152146 PMCID: PMC4857245 DOI: 10.1186/s13326-016-0066-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 04/19/2016] [Indexed: 11/17/2022] Open
Abstract
In recent years, sequencing technologies have enabled the identification of a wide range of non-coding RNAs (ncRNAs). Unfortunately, annotation and integration of ncRNA data has lagged behind their identification. Given the large quantity of information being obtained in this area, there emerges an urgent need to integrate what is being discovered by a broad range of relevant communities. To this end, the Non-Coding RNA Ontology (NCRO) is being developed to provide a systematically structured and precisely defined controlled vocabulary for the domain of ncRNAs, thereby facilitating the discovery, curation, analysis, exchange, and reasoning of data about structures of ncRNAs, their molecular and cellular functions, and their impacts upon phenotypes. The goal of NCRO is to serve as a common resource for annotations of diverse research in a way that will significantly enhance integrative and comparative analysis of the myriad resources currently housed in disparate sources. It is our belief that the NCRO ontology can perform an important role in the comprehensive unification of ncRNA biology and, indeed, fill a critical gap in both the Open Biological and Biomedical Ontologies (OBO) Library and the National Center for Biomedical Ontology (NCBO) BioPortal. Our initial focus is on the ontological representation of small regulatory ncRNAs, which we see as the first step in providing a resource for the annotation of data about all forms of ncRNAs. The NCRO ontology is free and open to all users, accessible at: http://purl.obolibrary.org/obo/ncro.owl.
Collapse
|
8
|
Muñoz-Mármol M, Crespo J, Fritts MJ, Maojo V. Towards the taxonomic categorization and recognition of nanoparticle shapes. NANOMEDICINE-NANOTECHNOLOGY BIOLOGY AND MEDICINE 2015; 11:457-65. [DOI: 10.1016/j.nano.2014.07.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2014] [Revised: 07/09/2014] [Accepted: 07/17/2014] [Indexed: 11/30/2022]
|
9
|
Viti F, Scaglione S, Orro A, Milanesi L. Guidelines for managing data and processes in bone and cartilage tissue engineering. BMC Bioinformatics 2014; 15 Suppl 1:S14. [PMID: 24564199 PMCID: PMC4015954 DOI: 10.1186/1471-2105-15-s1-s14] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Background In the last decades, a wide number of researchers/clinicians involved in tissue engineering field published several works about the possibility to induce a tissue regeneration guided by the use of biomaterials. To this aim, different scaffolds have been proposed, and their effectiveness tested through in vitro and/or in vivo experiments. In this context, integration and meta-analysis approaches are gaining importance for analyses and reuse of data as, for example, those concerning the bone and cartilage biomarkers, the biomolecular factors intervening in cell differentiation and growth, the morphology and the biomechanical performance of a neo-formed tissue, and, in general, the scaffolds' ability to promote tissue regeneration. Therefore standards and ontologies are becoming crucial, to provide a unifying knowledge framework for annotating data and supporting the semantic integration and the unambiguous interpretation of novel experimental results. Results In this paper a conceptual framework has been designed for bone/cartilage tissue engineering domain, by now completely lacking standardized methods. A set of guidelines has been provided, defining the minimum information set necessary for describing an experimental study involved in bone and cartilage regenerative medicine field. In addition, a Bone/Cartilage Tissue Engineering Ontology (BCTEO) has been developed to provide a representation of the domain's concepts, specifically oriented to cells, and chemical composition, morphology, physical characterization of biomaterials involved in bone/cartilage tissue engineering research. Conclusions Considering that tissue engineering is a discipline that traverses different semantic fields and employs many data types, the proposed instruments represent a first attempt to standardize the domain knowledge and can provide a suitable means to integrate data across the field.
Collapse
|
10
|
Staying afloat in the sensor data deluge. Trends Ecol Evol 2012; 27:121-9. [DOI: 10.1016/j.tree.2011.11.009] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2011] [Revised: 11/24/2011] [Accepted: 11/24/2011] [Indexed: 11/22/2022]
|
11
|
Richter S, Loesel R, Purschke G, Schmidt-Rhaesa A, Scholtz G, Stach T, Vogt L, Wanninger A, Brenneis G, Döring C, Faller S, Fritsch M, Grobe P, Heuer CM, Kaul S, Møller OS, Müller CHG, Rieger V, Rothe BH, Stegner MEJ, Harzsch S. Invertebrate neurophylogeny: suggested terms and definitions for a neuroanatomical glossary. Front Zool 2010; 7:29. [PMID: 21062451 PMCID: PMC2996375 DOI: 10.1186/1742-9994-7-29] [Citation(s) in RCA: 232] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2010] [Accepted: 11/09/2010] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND Invertebrate nervous systems are highly disparate between different taxa. This is reflected in the terminology used to describe them, which is very rich and often confusing. Even very general terms such as 'brain', 'nerve', and 'eye' have been used in various ways in the different animal groups, but no consensus on the exact meaning exists. This impedes our understanding of the architecture of the invertebrate nervous system in general and of evolutionary transformations of nervous system characters between different taxa. RESULTS We provide a glossary of invertebrate neuroanatomical terms with a precise and consistent terminology, taxon-independent and free of homology assumptions. This terminology is intended to form a basis for new morphological descriptions. A total of 47 terms are defined. Each entry consists of a definition, discouraged terms, and a background/comment section. CONCLUSIONS The use of our revised neuroanatomical terminology in any new descriptions of the anatomy of invertebrate nervous systems will improve the comparability of this organ system and its substructures between the various taxa, and finally even lead to better and more robust homology hypotheses.
Collapse
Affiliation(s)
- Stefan Richter
- Universität Rostock, Institut für Biowissenschaften, Abteilung für Allgemeine und Spezielle Zoologie, Universitätsplatz 2, D-18055 Rostock, Germany
| | - Rudi Loesel
- RWTH Aachen, Institute of Biology II, Department of Developmental Biology and Morphology of Animals, Mies-van-der-Rohe-Straße 15, D-52056 Aachen, Germany
| | - Günter Purschke
- Universität Osnabrück, Fachbereich Biologie/Chemie, AG Zoologie, Barbarastraße 11,, D-49069 Osnabrück, Germany
| | - Andreas Schmidt-Rhaesa
- Biozentrum Grindel/Zoological Museum, Martin-Luther-King-Platz 3, D-20146 Hamburg, Germany
| | - Gerhard Scholtz
- Humboldt-Universität zu Berlin, Institut für Biologie - Vergleichende Zoologie, Philippstraße 13, D-10115 Berlin, Germany
| | - Thomas Stach
- Freie Universität Berlin, Zoologie - Systematik und Evolutionsforschung, Königin-Luise-Straße 1-3, D-14195 Berlin, Germany
| | - Lars Vogt
- Universität Bonn, Institut für Evolutionsbiologie und Ökologie, An der Immenburg 1, D-53121 Bonn, Germany
| | - Andreas Wanninger
- University of Copenhagen, Department of Biology, Research Group for Comparative Zoology, Universitetsparken 15, DK-2100 Copenhagen, Denmark
| | - Georg Brenneis
- Universität Rostock, Institut für Biowissenschaften, Abteilung für Allgemeine und Spezielle Zoologie, Universitätsplatz 2, D-18055 Rostock, Germany
- Humboldt-Universität zu Berlin, Institut für Biologie - Vergleichende Zoologie, Philippstraße 13, D-10115 Berlin, Germany
| | - Carmen Döring
- Universität Osnabrück, Fachbereich Biologie/Chemie, AG Zoologie, Barbarastraße 11,, D-49069 Osnabrück, Germany
| | - Simone Faller
- RWTH Aachen, Institute of Biology II, Department of Developmental Biology and Morphology of Animals, Mies-van-der-Rohe-Straße 15, D-52056 Aachen, Germany
| | - Martin Fritsch
- Universität Rostock, Institut für Biowissenschaften, Abteilung für Allgemeine und Spezielle Zoologie, Universitätsplatz 2, D-18055 Rostock, Germany
| | - Peter Grobe
- Universität Bonn, Institut für Evolutionsbiologie und Ökologie, An der Immenburg 1, D-53121 Bonn, Germany
| | - Carsten M Heuer
- RWTH Aachen, Institute of Biology II, Department of Developmental Biology and Morphology of Animals, Mies-van-der-Rohe-Straße 15, D-52056 Aachen, Germany
| | - Sabrina Kaul
- Freie Universität Berlin, Zoologie - Systematik und Evolutionsforschung, Königin-Luise-Straße 1-3, D-14195 Berlin, Germany
| | - Ole S Møller
- Universität Rostock, Institut für Biowissenschaften, Abteilung für Allgemeine und Spezielle Zoologie, Universitätsplatz 2, D-18055 Rostock, Germany
| | - Carsten HG Müller
- Ernst-Moritz-Arndt-Universität Greifswald, Zoologisches Institut, Cytologie und Evolutionsbiologie, Johann-Sebastian-Bach-Straße 11/12, D-17487 Greifswald, Germany
| | - Verena Rieger
- Ernst-Moritz-Arndt-Universität Greifswald, Zoologisches Institut, Cytologie und Evolutionsbiologie, Johann-Sebastian-Bach-Straße 11/12, D-17487 Greifswald, Germany
| | - Birgen H Rothe
- Biozentrum Grindel/Zoological Museum, Martin-Luther-King-Platz 3, D-20146 Hamburg, Germany
| | - Martin EJ Stegner
- Universität Rostock, Institut für Biowissenschaften, Abteilung für Allgemeine und Spezielle Zoologie, Universitätsplatz 2, D-18055 Rostock, Germany
| | - Steffen Harzsch
- Ernst-Moritz-Arndt-Universität Greifswald, Zoologisches Institut, Cytologie und Evolutionsbiologie, Johann-Sebastian-Bach-Straße 11/12, D-17487 Greifswald, Germany
| |
Collapse
|
12
|
Gkoutos GV, Green ECJ, Mallon AM, Blake A, Greenaway S, Hancock JM, Davidson D. Ontologies for the description of mouse phenotypes. Comp Funct Genomics 2010; 5:545-51. [PMID: 18629136 PMCID: PMC2447424 DOI: 10.1002/cfg.430] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2004] [Accepted: 10/18/2004] [Indexed: 11/12/2022] Open
Abstract
Ontologies are becoming increasingly important for the efficient storage, retrieval
and mining of biological data. The description of phenotypes using ontologies is a
particularly complex problem. We outline a schema that can be used to describe
phenotypes by combining orthologous axiomatic ontologies. We also describe tools for
storing, browsing and searching such complex ontologies. Central to this approach is
that assays (protocols for measuring phenotypic characters) describe what has been
measured as well as how this was done, allowing assays to link individual organisms to
ontologies describing phenotypes. We have evaluated this approach by automatically
annotating data on 600 000 mutant mice phenotypes using the SHIRPA protocol. We
believe this approach will enable the flexible, extensible and detailed description of
phenotypes from any organism.
Collapse
Affiliation(s)
- G V Gkoutos
- MRC Mammalian Genetics Unit, Harwell, Oxfordshire OX11 0RD, UK.
| | | | | | | | | | | | | |
Collapse
|
13
|
Vogt L. Spatio-structural granularity of biological material entities. BMC Bioinformatics 2010; 11:289. [PMID: 20509878 PMCID: PMC3098069 DOI: 10.1186/1471-2105-11-289] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2010] [Accepted: 05/28/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND With the continuously increasing demands on knowledge- and data-management that databases have to meet, ontologies and the theories of granularity they use become more and more important. Unfortunately, currently used theories and schemes of granularity unnecessarily limit the performance of ontologies due to two shortcomings: (i) they do not allow the integration of multiple granularity perspectives into one granularity framework; (ii) they are not applicable to cumulative-constitutively organized material entities, which cover most of the biomedical material entities. RESULTS The above mentioned shortcomings are responsible for the major inconsistencies in currently used spatio-structural granularity schemes. By using the Basic Formal Ontology (BFO) as a top-level ontology and Keet's general theory of granularity, a granularity framework is presented that is applicable to cumulative-constitutively organized material entities. It provides a scheme for granulating complex material entities into their constitutive and regional parts by integrating various compositional and spatial granularity perspectives. Within a scale dependent resolution perspective, it even allows distinguishing different types of representations of the same material entity. Within other scale dependent perspectives, which are based on specific types of measurements (e.g. weight, volume, etc.), the possibility of organizing instances of material entities independent of their parthood relations and only according to increasing measures is provided as well. All granularity perspectives are connected to one another through overcrossing granularity levels, together forming an integrated whole that uses the compositional object perspective as an integrating backbone. This granularity framework allows to consistently assign structural granularity values to all different types of material entities. CONCLUSIONS The here presented framework provides a spatio-structural granularity framework for all domain reference ontologies that model cumulative-constitutively organized material entities. With its multi-perspectives approach it allows querying an ontology stored in a database at one's own desired different levels of detail: The contents of a database can be organized according to diverse granularity perspectives, which in their turn provide different views on its content (i.e. data, knowledge), each organized into different levels of detail.
Collapse
Affiliation(s)
- Lars Vogt
- Institut für Evolutionsbiologie und Okologie, Universität Bonn, An der Immenburg 1, Bonn, Germany.
| |
Collapse
|
14
|
NanoParticle Ontology for cancer nanotechnology research. J Biomed Inform 2010; 44:59-74. [PMID: 20211274 DOI: 10.1016/j.jbi.2010.03.001] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2009] [Revised: 01/26/2010] [Accepted: 03/03/2010] [Indexed: 11/21/2022]
Abstract
Data generated from cancer nanotechnology research are so diverse and large in volume that it is difficult to share and efficiently use them without informatics tools. In particular, ontologies that provide a unifying knowledge framework for annotating the data are required to facilitate the semantic integration, knowledge-based searching, unambiguous interpretation, mining and inferencing of the data using informatics methods. In this paper, we discuss the design and development of NanoParticle Ontology (NPO), which is developed within the framework of the Basic Formal Ontology (BFO), and implemented in the Ontology Web Language (OWL) using well-defined ontology design principles. The NPO was developed to represent knowledge underlying the preparation, chemical composition, and characterization of nanomaterials involved in cancer research. Public releases of the NPO are available through BioPortal website, maintained by the National Center for Biomedical Ontology. Mechanisms for editorial and governance processes are being developed for the maintenance, review, and growth of the NPO.
Collapse
|
15
|
Sarkar IN. Biomedical informatics and translational medicine. J Transl Med 2010; 8:22. [PMID: 20187952 PMCID: PMC2837642 DOI: 10.1186/1479-5876-8-22] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2009] [Accepted: 02/26/2010] [Indexed: 11/23/2022] Open
Abstract
Biomedical informatics involves a core set of methodologies that can provide a foundation for crossing the "translational barriers" associated with translational medicine. To this end, the fundamental aspects of biomedical informatics (e.g., bioinformatics, imaging informatics, clinical informatics, and public health informatics) may be essential in helping improve the ability to bring basic research findings to the bedside, evaluate the efficacy of interventions across communities, and enable the assessment of the eventual impact of translational medicine innovations on health policies. Here, a brief description is provided for a selection of key biomedical informatics topics (Decision Support, Natural Language Processing, Standards, Information Retrieval, and Electronic Health Records) and their relevance to translational medicine. Based on contributions and advancements in each of these topic areas, the article proposes that biomedical informatics practitioners ("biomedical informaticians") can be essential members of translational medicine teams.
Collapse
Affiliation(s)
- Indra Neil Sarkar
- Center for Clinical and Translational Science, Department of Microbiology and Molecular Genetics, University of Vermont, College of Medicine, 89 Beaumont Ave, Given Courtyard N309, Burlington, VT 05405, USA.
| |
Collapse
|
16
|
Renear AH, Palmer CL. Strategic Reading, Ontologies, and the Future of Scientific Publishing. Science 2009; 325:828-32. [PMID: 19679805 DOI: 10.1126/science.1157784] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Affiliation(s)
- Allen H. Renear
- Center for Informatics Research in Science and Scholarship, Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA
| | - Carole L. Palmer
- Center for Informatics Research in Science and Scholarship, Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA
| |
Collapse
|
17
|
Abstract
A drug discovery startup company or academic lab entering the screening arena faces numerous challenges as it tries to manage the large quantity of data generated by a typical drug discovery screening campaign. Although there are sophisticated off-the-shelf software solutions available, their use requires substantial forethought and attention to detail if the data they capture are to be of sufficient quality to serve the various purposes to which it will be put. For newcomers to the field of screening data management in particular, the problem is compounded by a lack of literature covering the practical aspects of managing screening data. The authors provide some practical advice based on their experience of using a commercially available software suite. They discuss issues ranging from the organizational aspects to examples of how the form and content of metadata can have a big impact on whether results can be easily queried, pivoted, and reported. It is also hoped that their experiences might provide an opportunity for reflection to data management practitioners operating in established environments.
Collapse
|
18
|
The future role of bio-ontologies for developing a general data standard in biology: chance and challenge for zoo-morphology. ZOOMORPHOLOGY 2008. [DOI: 10.1007/s00435-008-0081-5] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
19
|
Deus HF, Stanislaus R, Veiga DF, Behrens C, Wistuba II, Minna JD, Garner HR, Swisher SG, Roth JA, Correa AM, Broom B, Coombes K, Chang A, Vogel LH, Almeida JS. A Semantic Web management model for integrative biomedical informatics. PLoS One 2008; 3:e2946. [PMID: 18698353 PMCID: PMC2491554 DOI: 10.1371/journal.pone.0002946] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2008] [Accepted: 07/12/2008] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Data, data everywhere. The diversity and magnitude of the data generated in the Life Sciences defies automated articulation among complementary efforts. The additional need in this field for managing property and access permissions compounds the difficulty very significantly. This is particularly the case when the integration involves multiple domains and disciplines, even more so when it includes clinical and high throughput molecular data. METHODOLOGY/PRINCIPAL FINDINGS The emergence of Semantic Web technologies brings the promise of meaningful interoperation between data and analysis resources. In this report we identify a core model for biomedical Knowledge Engineering applications and demonstrate how this new technology can be used to weave a management model where multiple intertwined data structures can be hosted and managed by multiple authorities in a distributed management infrastructure. Specifically, the demonstration is performed by linking data sources associated with the Lung Cancer SPORE awarded to The University of Texas MD Anderson Cancer Center at Houston and the Southwestern Medical Center at Dallas. A software prototype, available with open source at www.s3db.org, was developed and its proposed design has been made publicly available as an open source instrument for shared, distributed data management. CONCLUSIONS/SIGNIFICANCE The Semantic Web technologies have the potential to addresses the need for distributed and evolvable representations that are critical for systems Biology and translational biomedical research. As this technology is incorporated into application development we can expect that both general purpose productivity software and domain specific software installed on our personal computers will become increasingly integrated with the relevant remote resources. In this scenario, the acquisition of a new dataset should automatically trigger the delegation of its analysis.
Collapse
Affiliation(s)
- Helena F. Deus
- Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Lisboa, Portugal
| | - Romesh Stanislaus
- Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - Diogo F. Veiga
- Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - Carmen Behrens
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - Ignacio I. Wistuba
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
- Department of Pathology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - John D. Minna
- Hamon Center for Therapeutic Oncology Research, Simmons Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - Harold R. Garner
- Hamon Center for Therapeutic Oncology Research, Simmons Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
- Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
- Center for Biomedical Inventions, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - Stephen G. Swisher
- Department of Thoracic and Cardiovascular Surgery, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - Jack A. Roth
- Department of Thoracic and Cardiovascular Surgery, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - Arlene M. Correa
- Department of Thoracic and Cardiovascular Surgery, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - Bradley Broom
- Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - Kevin Coombes
- Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - Allen Chang
- Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - Lynn H. Vogel
- Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
- Department of Biomedical Informatics, Columbia University, New York, New York, United States of America
| | - Jonas S. Almeida
- Department of Bioinformatics and Computational Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| |
Collapse
|
20
|
Abstract
In recent years, biological ontologies have emerged as a means of representing and organizing biological concepts, enabling biologists, bioinformaticians, and others to derive meaning from large datasets.This chapter provides an overview of formal principles and practical considerations of ontology construction and application. Ontology development concepts are illustrated using examples drawn from the Gene Ontology (GO) and other OBO ontologies.
Collapse
|
21
|
Xu Q, Shi Y, Lu Q, Zhang G, Luo Q, Li Y. GORouter: an RDF model for providing semantic query and inference services for Gene Ontology and its associations. BMC Bioinformatics 2008; 9 Suppl 1:S6. [PMID: 18315859 PMCID: PMC2259407 DOI: 10.1186/1471-2105-9-s1-s6] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Background The most renowned biological ontology, Gene Ontology (GO) is widely used for annotations of genes and gene products of different organisms. However, there are shortcomings in the Resource Description Framework (RDF) data file provided by the GO consortium: 1) Lack of sufficient semantic relationships between pairs of terms coming from the three independent GO sub-ontologies, that limit the power to provide complex semantic queries and inference services based on it. 2) The term-centric view of GO annotation data and the fact that all information is stored in a single file. This makes attempts to retrieve GO annotations based on big volume datasets unmanageable. 3) No support of GOSlim. Results We propose a RDF model, GORouter, which encodes heterogeneous original data in a uniform RDF format, creates additional ontology mappings between GO terms, and introduces a set of inference rulebases. Furthermore, we use the Oracle Network Data Model (NDM) as the native RDF data repository and the table function RDF_MATCH to seamlessly combine the result of RDF queries with traditional relational data. As a result, the scale of GORouter is minimized; information not directly involved in semantic inference is put into relational tables. Conclusion Our work demonstrates how to use multiple semantic web tools and techniques to provide a mixture of semantic query and inference solutions of GO and its associations. GORouter is licensed under Apache License Version 2.0, and is accessible via the website: .
Collapse
Affiliation(s)
- Qingwei Xu
- The Key Laboratory of Biomedical Photonics of the Ministry of Education, HUST, Wuhan 430074, China.
| | | | | | | | | | | |
Collapse
|
22
|
Abstract
The past twenty years have witnessed an explosion of biological data in diverse database formats governed by heterogeneous infrastructures. Not only are semantics (attribute terms) different in meaning across databases, but their organization varies widely. Ontologies are a concept imported from computing science to describe different conceptual frameworks that guide the collection, organization and publication of biological data. An ontology is similar to a paradigm but has very strict implications for formatting and meaning in a computational context. The use of ontologies is a means of communicating and resolving semantic and organizational differences between biological databases in order to enhance their integration. The purpose of interoperability (or sharing between divergent storage and semantic protocols) is to allow scientists from around the world to share and communicate with each other. This paper describes the rapid accumulation of biological data, its various organizational structures, and the role that ontologies play in interoperability.
Collapse
Affiliation(s)
- Nadine Schuurman
- Department of Geography, Simon Fraser University RCB 7123, 8888 University Drive, Burnaby, British Columbia, Canada.
| | | |
Collapse
|
23
|
Xu Q, Huang Y, Liu Q, Zhang G, Li Y, Lu Q. A Semantic Web model of GO and its annotations. CHINESE SCIENCE BULLETIN-CHINESE 2008. [DOI: 10.1007/s11434-008-0137-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
24
|
Abstract
Biomedical ontologies are emerging as critical tools in genomic and proteomic research, where complex data in disparate resources need to be integrated. A number of ontologies describe properties that can be attributed to proteins. For example, protein functions are described by the Gene Ontology (GO) and human diseases by SNOMED CT or ICD10. There is, however, a gap in the current set of ontologies - one that describes the protein entities themselves and their relationships. We have designed the PRotein Ontology (PRO) to facilitate protein annotation and to guide new experiments. The components of PRO extend from the classification of proteins on the basis of evolutionary relationships to the representation of the multiple protein forms of a gene (products generated by genetic variation, alternative splicing, proteolytic cleavage, and other post-translational modifications). PRO will allow the specification of relationships between PRO, GO and other ontologies in the OBO Foundry. Here we describe the initial development of PRO, illustrated using human and mouse proteins involved in the transforming growth factor-beta and bone morphogenetic protein signaling pathways.
Collapse
|
25
|
Sansone SA, Rocca-Serra P, Tong W, Fostel J, Morrison N, Jones AR. A strategy capitalizing on synergies: the Reporting Structure for Biological Investigation (RSBI) working group. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2007; 10:164-71. [PMID: 16901222 DOI: 10.1089/omi.2006.10.164] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
In this article we present the Reporting Structure for Biological Investigation (RSBI), a working group under the Microarray Gene Expression Data (MGED) Society umbrella. RSBI brings together several communities to tackle the challenges associated with integrating data and representing complex biological investigations, employing multiple OMICS technologies. Currently, RSBI includes environmental genomics, nutrigenomics and toxicogenomics communities, where independent activities are underway to develop databases and establish data communication standards within their respective domains. The RSBI working group has been conceived as a "single point of focus" for these communities, conforming to general accepted view that duplication and incompatibility should be avoided where possible. This endeavour has aimed to synergize insular solutions into one common terminology between biologically driven standardisation efforts and has also resulted in strong collaborations and shared understanding between those in the technological domain. Through extensive liaisons with many standards efforts, several threads have been woven with the hope that ultimately technology-centered standards and their specific extensions into biological domains of interest will not only stand alone, but will also be able to function together, as interchangeable modules.
Collapse
|
26
|
Abstract
UNLABELLED OBO-Edit is an open source, platform-independent ontology editor developed and maintained by the Gene Ontology Consortium. Implemented in Java, OBO-Edit uses a graph-oriented approach to display and edit ontologies. OBO-Edit is particularly valuable for viewing and editing biomedical ontologies. AVAILABILITY https://sourceforge.net/project/showfiles.php?group_id=36855.
Collapse
Affiliation(s)
- John Day-Richter
- Berkeley Bioinformatics and Ontology Project, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | | | | |
Collapse
|
27
|
Ilic K, Kellogg EA, Jaiswal P, Zapata F, Stevens PF, Vincent LP, Avraham S, Reiser L, Pujar A, Sachs MM, Whitman NT, McCouch SR, Schaeffer ML, Ware DH, Stein LD, Rhee SY. The plant structure ontology, a unified vocabulary of anatomy and morphology of a flowering plant. PLANT PHYSIOLOGY 2007; 143:587-99. [PMID: 17142475 PMCID: PMC1803752 DOI: 10.1104/pp.106.092825] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Formal description of plant phenotypes and standardized annotation of gene expression and protein localization data require uniform terminology that accurately describes plant anatomy and morphology. This facilitates cross species comparative studies and quantitative comparison of phenotypes and expression patterns. A major drawback is variable terminology that is used to describe plant anatomy and morphology in publications and genomic databases for different species. The same terms are sometimes applied to different plant structures in different taxonomic groups. Conversely, similar structures are named by their species-specific terms. To address this problem, we created the Plant Structure Ontology (PSO), the first generic ontological representation of anatomy and morphology of a flowering plant. The PSO is intended for a broad plant research community, including bench scientists, curators in genomic databases, and bioinformaticians. The initial releases of the PSO integrated existing ontologies for Arabidopsis (Arabidopsis thaliana), maize (Zea mays), and rice (Oryza sativa); more recent versions of the ontology encompass terms relevant to Fabaceae, Solanaceae, additional cereal crops, and poplar (Populus spp.). Databases such as The Arabidopsis Information Resource, Nottingham Arabidopsis Stock Centre, Gramene, MaizeGDB, and SOL Genomics Network are using the PSO to describe expression patterns of genes and phenotypes of mutants and natural variants and are regularly contributing new annotations to the Plant Ontology database. The PSO is also used in specialized public databases, such as BRENDA, GENEVESTIGATOR, NASCArrays, and others. Over 10,000 gene annotations and phenotype descriptions from participating databases can be queried and retrieved using the Plant Ontology browser. The PSO, as well as contributed gene associations, can be obtained at www.plantontology.org.
Collapse
Affiliation(s)
- Katica Ilic
- Department of Plant Biology, Carnegie Institution, Stanford, California 94305, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Ahrens CH, Wagner U, Rehrauer HK, Türker C, Schlapbach R. Current challenges and approaches for the synergistic use of systems biology data in the scientific community. EXS 2007; 97:277-307. [PMID: 17432272 DOI: 10.1007/978-3-7643-7439-6_12] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Today's rapid development and broad application of high-throughput analytical technologies are transforming biological research and provide an amount of data and analytical opportunities to understand the fundamentals of biological processes undreamt of in past years. To fully exploit the potential of the large amount of data, scientists must be able to understand and interpret the information in an integrative manner. While the sheer data volume and heterogeneity of technical platforms within each discipline already poses a significant challenge, the heterogeneity of platforms and data formats across disciplines makes the integrative management, analysis, and interpretation of data a significantly more difficult task. This challenge thus lies at the heart of systems biology, which aims at a quantitative understanding of biological systems to the extent that systemic features can be predicted. In this chapter, we discuss several key issues that need to be addressed in order to put an integrated systems biology data analysis and mining within reach.
Collapse
Affiliation(s)
- Christian H Ahrens
- Functional Genomics Center Zurich, Winterthurerstrasse 190, Y32H66, CH-8057 Zurich, Switzerland.
| | | | | | | | | |
Collapse
|
29
|
Osborne JD, Zhu LJ, Lin SM, Kibbe WA. Interpreting microarray results with gene ontology and MeSH. Methods Mol Biol 2007; 377:223-42. [PMID: 17634620 DOI: 10.1007/978-1-59745-390-5_14] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Methods are described to take a list of genes generated from a microarray experiment and interpret these results using various tools and ontologies. A workflow is described that details how to convert gene identifiers with SOURCE and MatchMiner and then use these converted gene lists to search the gene ontology (GO) and the medical subject headings (MeSH) ontology. Examples of searching GO with DAVID, EASE, and GOMiner are provided along with an interpretation of results. The mining of MeSH using high-density array pattern interpreter with a set of gene identifiers is also described.
Collapse
Affiliation(s)
- John D Osborne
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University, Chicago, IL, USA
| | | | | | | |
Collapse
|
30
|
Friedman C, Borlawsky T, Shagina L, Xing HR, Lussier YA. Bio-Ontology and text: bridging the modeling gap. Bioinformatics 2006; 22:2421-9. [PMID: 16870928 PMCID: PMC2879055 DOI: 10.1093/bioinformatics/btl405] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Natural language processing (NLP) techniques are increasingly being used in biology to automate the capture of new biological discoveries in text, which are being reported at a rapid rate. Yet, information represented in NLP data structures is classically very different from information organized with ontologies as found in model organisms or genetic databases. To facilitate the computational reuse and integration of information buried in unstructured text with that of genetic databases, we propose and evaluate a translational schema that represents a comprehensive set of phenotypic and genetic entities, as well as their closely related biomedical entities and relations as expressed in natural language. In addition, the schema connects different scales of biological information, and provides mappings from the textual information to existing ontologies, which are essential in biology for integration, organization, dissemination and knowledge management of heterogeneous phenotypic information. A common comprehensive representation for otherwise heterogeneous phenotypic and genetic datasets, such as the one proposed, is critical for advancing systems biology because it enables acquisition and reuse of unprecedented volumes of diverse types of knowledge and information from text. RESULTS A novel representational schema, PGschema, was developed that enables translation of phenotypic, genetic and their closely related information found in textual narratives to a well-defined data structure comprising phenotypic and genetic concepts from established ontologies along with modifiers and relationships. Evaluation for coverage of a selected set of entities showed that 90% of the information could be represented (95% confidence interval: 86-93%; n = 268). Moreover, PGschema can be expressed automatically in an XML format using natural language techniques to process the text. To our knowledge, we are providing the first evaluation of a translational schema for NLP that contains declarative knowledge about genes and their associated biomedical data (e.g. phenotypes). AVAILABILITY http://zellig.cpmc.columbia.edu/PGschema
Collapse
Affiliation(s)
- Carol Friedman
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA.
| | | | | | | | | |
Collapse
|
31
|
Bucciarelli B, Hanan J, Palmquist D, Vance CP. A standardized method for analysis of Medicago truncatula phenotypic development. PLANT PHYSIOLOGY 2006; 142:207-19. [PMID: 16877701 PMCID: PMC1557601 DOI: 10.1104/pp.106.082594] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2006] [Accepted: 07/12/2006] [Indexed: 05/11/2023]
Abstract
Medicago truncatula has become a model system to study legume biology. It is imperative that detailed growth characteristics of the most commonly used cultivar, line A17 cv Jemalong, be documented. Such analysis creates a basis to analyze phenotypic alterations due to genetic lesions or environmental stress and is essential to characterize gene function and its relationship to morphological development. We have documented morphological development of M. truncatula to characterize its temporal developmental growth pattern; developed a numerical nomenclature coding system that identifies stages in morphological development; tested the coding system to identify phenotypic differences under phosphorus (P) and nitrogen (N) deprivation; and created visual models using the L-system formalism. The numerical nomenclature coding system, based on a series of defined growth units, represents incremental steps in morphological development. Included is a decimal component dividing growth units into nine substages. A measurement component helps distinguish alterations that may be missed by the coding system. Growth under N and P deprivation produced morphological alterations that were distinguishable using the coding system and its measurement component. N and P deprivation resulted in delayed leaf development and expansion, delayed axillary shoot emergence and elongation, decreased leaf and shoot size, and altered root growth. Timing and frequency of flower emergence in P-deprived plants was affected. This numerical coding system may be used as a standardized method to analyze phenotypic variation in M. truncatula due to nutrient stress, genetic lesions, or other factors and should allow valid growth comparisons across geographically distant laboratories.
Collapse
Affiliation(s)
- Bruna Bucciarelli
- United States Department of Agriculture, Agricultural Research Service, Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, 55108, USA
| | | | | | | |
Collapse
|
32
|
Castro AG, Rocca-Serra P, Stevens R, Taylor C, Nashar K, Ragan MA, Sansone SA. The use of concept maps during knowledge elicitation in ontology development processes--the nutrigenomics use case. BMC Bioinformatics 2006; 7:267. [PMID: 16725019 PMCID: PMC1524992 DOI: 10.1186/1471-2105-7-267] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2005] [Accepted: 05/25/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Incorporation of ontologies into annotations has enabled 'semantic integration' of complex data, making explicit the knowledge within a certain field. One of the major bottlenecks in developing bio-ontologies is the lack of a unified methodology. Different methodologies have been proposed for different scenarios, but there is no agreed-upon standard methodology for building ontologies. The involvement of geographically distributed domain experts, the need for domain experts to lead the design process, the application of the ontologies and the life cycles of bio-ontologies are amongst the features not considered by previously proposed methodologies. RESULTS Here, we present a methodology for developing ontologies within the biological domain. We describe our scenario, competency questions, results and milestones for each methodological stage. We introduce the use of concept maps during knowledge acquisition phases as a feasible transition between domain expert and knowledge engineer. CONCLUSION The contributions of this paper are the thorough description of the steps we suggest when building an ontology, example use of concept maps, consideration of applicability to the development of lower-level ontologies and application to decentralised environments. We have found that within our scenario conceptual maps played an important role in the development process.
Collapse
Affiliation(s)
- Alexander Garcia Castro
- Microarray Informatics Team, The European Bioinformatics Institute – European Molecular Biology Laboratory Outstation, Wellcome Trust Genome Campus CB10 1SD, Cambridge Hinxton, UK
- Australian Research Council Centre in Bioinformatics, Institute for Molecular Bioscience, The University of Queensland 4072, St Lucia, Australia
- Institute for Molecular Bioscience, The University of Queensland 4072, Brisbane, Australia
- Australian Centre for Plant Functional Genomics, The University of Queensland 4072, Brisbane, Australia
| | - Philippe Rocca-Serra
- Microarray Informatics Team, The European Bioinformatics Institute – European Molecular Biology Laboratory Outstation, Wellcome Trust Genome Campus CB10 1SD, Cambridge Hinxton, UK
| | - Robert Stevens
- School of Computer Science, University of Manchester, Kilburn Building, Oxford Road Manchester M13 9PL, Manchester, UK
| | - Chris Taylor
- Microarray Informatics Team, The European Bioinformatics Institute – European Molecular Biology Laboratory Outstation, Wellcome Trust Genome Campus CB10 1SD, Cambridge Hinxton, UK
| | - Karim Nashar
- School of Computer Science, University of Manchester, Kilburn Building, Oxford Road Manchester M13 9PL, Manchester, UK
| | - Mark A Ragan
- Australian Research Council Centre in Bioinformatics, Institute for Molecular Bioscience, The University of Queensland 4072, St Lucia, Australia
- Institute for Molecular Bioscience, The University of Queensland 4072, Brisbane, Australia
| | - Susanna-Assunta Sansone
- Microarray Informatics Team, The European Bioinformatics Institute – European Molecular Biology Laboratory Outstation, Wellcome Trust Genome Campus CB10 1SD, Cambridge Hinxton, UK
| |
Collapse
|
33
|
Abstract
Bioinformatics plays an essential role in today's plant science. As the amount of data grows exponentially, there is a parallel growth in the demand for tools and methods in data management, visualization, integration, analysis, modeling, and prediction. At the same time, many researchers in biology are unfamiliar with available bioinformatics methods, tools, and databases, which could lead to missed opportunities or misinterpretation of the information. In this review, we describe some of the key concepts, methods, software packages, and databases used in bioinformatics, with an emphasis on those relevant to plant science. We also cover some fundamental issues related to biological sequence analyses, transcriptome analyses, computational proteomics, computational metabolomics, bio-ontologies, and biological databases. Finally, we explore a few emerging research topics in bioinformatics.
Collapse
Affiliation(s)
- Seung Yon Rhee
- Department of Plant Biology, Carnegie Institution, Stanford, California 94305, USA.
| | | | | |
Collapse
|
34
|
|
35
|
Antezana E, Tsiporkova E, Mironov V, Kuiper M. A Cell-Cycle Knowledge Integration Framework. LECTURE NOTES IN COMPUTER SCIENCE 2006. [DOI: 10.1007/11799511_4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
36
|
|
37
|
Van Regenmortel MHV. Reductionism and complexity in molecular biology. Scientists now have the tools to unravel biological and overcome the limitations of reductionism. EMBO Rep 2005; 5:1016-20. [PMID: 15520799 PMCID: PMC1299179 DOI: 10.1038/sj.embor.7400284] [Citation(s) in RCA: 228] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Affiliation(s)
- Marc H V Van Regenmortel
- Ecole Supérieure de Biotechnologie de Strasbourg at the Centre National de la Recherche Scientifique (CNRS) in Strasbourg, France.
| |
Collapse
|
38
|
Abstract
The effective integration of data and knowledge from many disparate sources will be crucial to future drug discovery. Data integration is a key element of conducting scientific investigations with modern platform technologies, managing increasingly complex discovery portfolios and processes, and fully realizing economies of scale in large enterprises. However, viewing data integration as simply an 'IT problem' underestimates the novel and serious scientific and management challenges it embodies - challenges that could require significant methodological and even cultural changes in our approach to data.
Collapse
Affiliation(s)
- David B Searls
- Bioinformatics Division, Genetics Research, GlaxoSmithKline Pharmaceuticals, 709 Swedeland Road, P.O. Box 1539, King of Prussia, Pennsylvania 19406, USA.
| |
Collapse
|