1
|
Deans AR, Nastasi LF, Davis C. GallOnt: An ontology for plant gall phenotypes. Biodivers Data J 2024; 12:e128585. [PMID: 39229384 PMCID: PMC11369494 DOI: 10.3897/bdj.12.e128585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 08/18/2024] [Indexed: 09/05/2024] Open
Abstract
Galls are novel plant structures that develop in response to select biotic stressors. These structures, extended phenotypes of the inducer, usually serve to protect and feed the inducer or its progeny. This life history strategy has evolved dozens of times, and tens of thousands of species - including many bacteria, fungi, nematodes, mites and insects - are capable of manipulating plants in this way. The variation in gall phenotypes is extraordinary across species but usually predictable for each species of inducer. We introduce here a new ontology, GallOnt, that facilitates consistent descriptions and the semantic representation of and reasoning over plant gall phenotype data. GallOnt was largely developed from ontologies in the Open Biological and Biomedical Ontology (OBO) Foundry and stands to connect plant gall phenotypes to knowledge derived from model plant systems, including genotype-phenotype and agricultural research. We also introduce the idea of a new gall data standard - Minimum Information for the Description of Galls (MIDG version 0.1) - as a starting point for discussions regarding cecidology best practices.
Collapse
Affiliation(s)
- Andrew R Deans
- Frost Entomological Museum, The Pennsylvania State University, University Park, United States of AmericaFrost Entomological Museum, The Pennsylvania State UniversityUniversity ParkUnited States of America
| | - Louis Frank Nastasi
- Frost Entomological Museum, The Pennsylvania State University, University Park, United States of AmericaFrost Entomological Museum, The Pennsylvania State UniversityUniversity ParkUnited States of America
| | - Charles Davis
- Frost Entomological Museum, The Pennsylvania State University, University Park, United States of AmericaFrost Entomological Museum, The Pennsylvania State UniversityUniversity ParkUnited States of America
| |
Collapse
|
2
|
Schenk PM, Wright AJ, West R, Hastings J, Lorencatto F, Moore C, Hayes E, Schneider V, Howes E, Michie S. An ontology of mechanisms of action in behaviour change interventions. Wellcome Open Res 2024; 8:337. [PMID: 38481854 PMCID: PMC10933577 DOI: 10.12688/wellcomeopenres.19489.1] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/24/2024] [Indexed: 06/04/2024] Open
Abstract
Background Behaviour change interventions influence behaviour through causal processes called "mechanisms of action" (MoAs). Reports of such interventions and their evaluations often use inconsistent or ambiguous terminology, creating problems for searching, evidence synthesis and theory development. This inconsistency includes the reporting of MoAs. An ontology can help address these challenges by serving as a classification system that labels and defines MoAs and their relationships. The aim of this study was to develop an ontology of MoAs of behaviour change interventions. Methods To develop the MoA Ontology, we (1) defined the ontology's scope; (2) identified, labelled and defined the ontology's entities; (3) refined the ontology by annotating (i.e., coding) MoAs in intervention reports; (4) refined the ontology via stakeholder review of the ontology's comprehensiveness and clarity; (5) tested whether researchers could reliably apply the ontology to annotate MoAs in intervention evaluation reports; (6) refined the relationships between entities; (7) reviewed the alignment of the MoA Ontology with other relevant ontologies, (8) reviewed the ontology's alignment with the Theories and Techniques Tool; and (9) published a machine-readable version of the ontology. Results An MoA was defined as "a process that is causally active in the relationship between a behaviour change intervention scenario and its outcome behaviour". We created an initial MoA Ontology with 261 entities through Steps 2-5. Inter-rater reliability for annotating study reports using these entities was α=0.68 ("acceptable") for researchers familiar with the ontology and α=0.47 for researchers unfamiliar with it. As a result of additional revisions (Steps 6-8), 23 further entities were added to the ontology resulting in 284 entities organised in seven hierarchical levels. Conclusions The MoA Ontology extensively captures MoAs of behaviour change interventions. The ontology can serve as a controlled vocabulary for MoAs to consistently describe and synthesise evidence about MoAs across diverse sources.
Collapse
Affiliation(s)
- Paulina M. Schenk
- Centre for Behaviour Change, University College London, London, England, UK
| | - Alison J. Wright
- Centre for Behaviour Change, University College London, London, England, UK
- Institute of Pharmaceutical Science, King's College London, London, England, UK
| | - Robert West
- Department of Behavioural Science and Health, University College London, London, England, UK
| | - Janna Hastings
- Institute for Implementation Science in Health Care, Universitat Zurich, Zürich, Zurich, Switzerland
- School of Medicine, University of St Gallen, St. Gallen, St. Gallen, Switzerland
| | - Fabiana Lorencatto
- Centre for Behaviour Change, University College London, London, England, UK
| | - Candice Moore
- Centre for Behaviour Change, University College London, London, England, UK
| | - Emily Hayes
- Centre for Behaviour Change, University College London, London, England, UK
| | - Verena Schneider
- Research Department of Epidemiology and Public Health, University College London, London, England, UK
| | - Ella Howes
- Leeds Unit for Complex Intervention Development, University of Leeds, Leeds, England, UK
| | - Susan Michie
- Centre for Behaviour Change, University College London, London, England, UK
| |
Collapse
|
3
|
Marques MM, Wright AJ, Corker E, Johnston M, West R, Hastings J, Zhang L, Michie S. The Behaviour Change Technique Ontology: Transforming the Behaviour Change Technique Taxonomy v1. Wellcome Open Res 2024; 8:308. [PMID: 37593567 PMCID: PMC10427801 DOI: 10.12688/wellcomeopenres.19363.1] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/25/2024] [Indexed: 08/19/2023] Open
Abstract
Background The Behaviour Change Technique Taxonomy v1 (BCTTv1) specifies the potentially active content of behaviour change interventions. Evaluation of BCTTv1 showed the need to extend it into a formal ontology, improve its labels and definitions, add BCTs and subdivide existing BCTs. We aimed to develop a Behaviour Change Technique Ontology (BCTO) that would meet these needs. Methods The BCTO was developed by: (1) collating and synthesising feedback from multiple sources; (2) extracting information from published studies and classification systems; (3) multiple iterations of reviewing and refining entities, and their labels, definitions and relationships; (4) refining the ontology via expert stakeholder review of its comprehensiveness and clarity; (5) testing whether researchers could reliably apply the ontology to identify BCTs in intervention reports; and (6) making it available online and creating a computer-readable version. Results Initially there were 282 proposed changes to BCTTv1. Following first-round review, 19 BCTs were split into two or more BCTs, 27 new BCTs were added and 26 BCTs were moved into a different group, giving 161 BCTs hierarchically organised into 12 logically defined higher-level groups in up to five hierarchical levels. Following expert stakeholder review, the refined ontology had 247 BCTs hierarchically organised into 20 higher-level groups. Independent annotations of intervention evaluation reports by researchers familiar and unfamiliar with the ontology resulted in good levels of inter-rater reliability (0.82 and 0.79, respectively). Following revision informed by this exercise, 34 BCTs were added, resulting in the first published version of the BCTO containing 281 BCTs organised into 20 higher-level groups over five hierarchical levels. Discussion The BCTO provides a standard terminology and comprehensive classification system for the content of behaviour change interventions that can be reliably used to describe interventions. The development and maintenance of an ontology is an iterative and ongoing process; no ontology is ever 'finished'. The BCTO will continue to evolve and grow (e.g. new BCTs or improved definitions) as a result of user feedback and new available evidence.
Collapse
Affiliation(s)
- Marta M. Marques
- Centre for Behaviour Change, University College London, London, England, UK
- Comprehensive Health Research Centre (CHRC), NOVA National School of Public Health, NOVA University of Lisbon, Lisbon, Lisbon, Portugal
| | - Alison J. Wright
- Centre for Behaviour Change, University College London, London, England, UK
- Institute of Pharmaceutical Science, King's College London, London, England, UK
| | - Elizabeth Corker
- Clinical and Applied Psychology Unit, Department of Psychology, The University of Sheffield, Sheffield, England, UK
| | - Marie Johnston
- Aberdeen Health Psychology Group, University of Aberdeen, Aberdeen, Scotland, UK
| | - Robert West
- Centre for Behaviour Change, University College London, London, England, UK
| | - Janna Hastings
- Institute for Implementation Science in Health Care, Faculty of Medicine, University of Zurich, Zurich, Switzerland
- School of Medicine, University of St Gallen, St. Gallen, St. Gallen, Switzerland
| | - Lisa Zhang
- Centre for Behaviour Change, University College London, London, England, UK
| | - Susan Michie
- Centre for Behaviour Change, University College London, London, England, UK
| |
Collapse
|
4
|
Azzi R, Bordea G, Griffier R, Nikiema JN, Mougin F. Enriching the FIDEO ontology with food-drug interactions from online knowledge sources. J Biomed Semantics 2024; 15:1. [PMID: 38438913 PMCID: PMC10913206 DOI: 10.1186/s13326-024-00302-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 02/05/2024] [Indexed: 03/06/2024] Open
Abstract
The increasing number of articles on adverse interactions that may occur when specific foods are consumed with certain drugs makes it difficult to keep up with the latest findings. Conflicting information is available in the scientific literature and specialized knowledge bases because interactions are described in an unstructured or semi-structured format. The FIDEO ontology aims to integrate and represent information about food-drug interactions in a structured way. This article reports on the new version of this ontology in which more than 1700 interactions are integrated from two online resources: DrugBank and Hedrine. These food-drug interactions have been represented in FIDEO in the form of precompiled concepts, each of which specifies both the food and the drug involved. Additionally, competency questions that can be answered are reviewed, and avenues for further enrichment are discussed.
Collapse
Affiliation(s)
- Rabia Azzi
- Univ. Bordeaux, Inserm, BPH, U1219, F-33000, Bordeaux, France
- CHU de Bordeaux, Service d'information médicale, F-33000, Bordeaux, France
| | - Georgeta Bordea
- Univ. Bordeaux, Inserm, BPH, U1219, F-33000, Bordeaux, France
- Univ. La Rochelle, L3i, F-17000, La Rochelle, France
| | - Romain Griffier
- Univ. Bordeaux, Inserm, BPH, U1219, F-33000, Bordeaux, France
- CHU de Bordeaux, Service d'information médicale, F-33000, Bordeaux, France
| | - Jean Noël Nikiema
- Department of Management, Evaluation and Health Policy, School of Public Health, Université de Montréal, Québec, Canada
| | - Fleur Mougin
- Univ. Bordeaux, Inserm, BPH, U1219, F-33000, Bordeaux, France.
| |
Collapse
|
5
|
Wang Y, Ye M, Zhang F, Freeman ZT, Yu H, Ye X, He Y. Ontology-based taxonomical analysis of experimentally verified natural and laboratory human coronavirus hosts and its implication for COVID-19 virus origination and transmission. PLoS One 2024; 19:e0295541. [PMID: 38252647 PMCID: PMC10802970 DOI: 10.1371/journal.pone.0295541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 11/26/2023] [Indexed: 01/24/2024] Open
Abstract
To fully understand COVID-19, it is critical to study all possible hosts of SARS-CoV-2 (the pathogen of COVID-19). In this work, we collected, annotated, and performed ontology-based taxonomical analysis of all the reported and verified hosts for all human coronaviruses including SARS-CoV, MERS-CoV, SARS-CoV-2, HCoV-229E, HCoV-NL63, HCoV-OC43, and HCoV-HKU1. A total of 37 natural hosts and 19 laboratory animal hosts of human coronaviruses were identified based on experimental evidence. Our analysis found that all the verified susceptible natural and laboratory animals belong to therian mammals. Specifically, these 37 natural therian hosts include one wildlife marsupial mammal (i.e., Virginia opossum) and 36 Eutheria mammals (a.k.a. placental mammals). The 19 laboratory animal hosts are also classified as therian mammals. The mouse models with genetically modified human ACE2 or DPP4 were more susceptible to virulent human coronaviruses with clear symptoms, suggesting the critical role of ACE2 and DPP4 to coronavirus virulence. Coronaviruses became more virulent and adaptive in the mouse hosts after a series of viral passages in the mice, providing clue to the possible coronavirus origination. The Huanan Seafood Wholesale Market animals identified early in the COVID-19 outbreak were also systematically analyzed as possible COVID-19 hosts. To support knowledge standardization and query, the annotated host knowledge was modeled and represented in the Coronavirus Infectious Disease Ontology (CIDO). Based on our and others' findings, we further propose a MOVIE model (i.e., Multiple-Organism viral Variations and Immune Evasion) to address how viral variations in therian animal hosts and the host immune evasion might have led to dynamic COVID-19 pandemic outcomes.
Collapse
Affiliation(s)
- Yang Wang
- Guizhou University School of Medicine, Guiyang, Guizhou, China
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and NHC Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou University, Guiyang, Guizhou, China
- Unit for Laboratory Animal Medicine, University of Michigan Medical School, Ann Arbor, MI, United States of America
| | - Muhui Ye
- Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong, China
| | - Fengwei Zhang
- Guizhou University School of Medicine, Guiyang, Guizhou, China
| | - Zachary Thomas Freeman
- Unit for Laboratory Animal Medicine, University of Michigan Medical School, Ann Arbor, MI, United States of America
| | - Hong Yu
- Guizhou University School of Medicine, Guiyang, Guizhou, China
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and NHC Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou University, Guiyang, Guizhou, China
| | - Xianwei Ye
- Guizhou University School of Medicine, Guiyang, Guizhou, China
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and NHC Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou University, Guiyang, Guizhou, China
| | - Yongqun He
- Unit for Laboratory Animal Medicine, University of Michigan Medical School, Ann Arbor, MI, United States of America
- Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, MI, United States of America
- Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, United States of America
| |
Collapse
|
6
|
Bernabé CH, Queralt-Rosinach N, Silva Souza VE, Bonino da Silva Santos LO, Mons B, Jacobsen A, Roos M. The use of foundational ontologies in biomedical research. J Biomed Semantics 2023; 14:21. [PMID: 38082345 PMCID: PMC10712036 DOI: 10.1186/s13326-023-00300-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 11/29/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND The FAIR principles recommend the use of controlled vocabularies, such as ontologies, to define data and metadata concepts. Ontologies are currently modelled following different approaches, sometimes describing conflicting definitions of the same concepts, which can affect interoperability. To cope with that, prior literature suggests organising ontologies in levels, where domain specific (low-level) ontologies are grounded in domain independent high-level ontologies (i.e., foundational ontologies). In this level-based organisation, foundational ontologies work as translators of intended meaning, thus improving interoperability. Despite their considerable acceptance in biomedical research, there are very few studies testing foundational ontologies. This paper describes a systematic literature mapping that was conducted to understand how foundational ontologies are used in biomedical research and to find empirical evidence supporting their claimed (dis)advantages. RESULTS From a set of 79 selected papers, we identified that foundational ontologies are used for several purposes: ontology construction, repair, mapping, and ontology-based data analysis. Foundational ontologies are claimed to improve interoperability, enhance reasoning, speed up ontology development and facilitate maintainability. The complexity of using foundational ontologies is the most commonly cited downside. Despite being used for several purposes, there were hardly any experiments (1 paper) testing the claims for or against the use of foundational ontologies. In the subset of 49 papers that describe the development of an ontology, it was observed a low adherence to ontology construction (16 papers) and ontology evaluation formal methods (4 papers). CONCLUSION Our findings have two main implications. First, the lack of empirical evidence about the use of foundational ontologies indicates a need for evaluating the use of such artefacts in biomedical research. Second, the low adherence to formal methods illustrates how the field could benefit from a more systematic approach when dealing with the development and evaluation of ontologies. The understanding of how foundational ontologies are used in the biomedical field can drive future research towards the improvement of ontologies and, consequently, data FAIRness. The adoption of formal methods can impact the quality and sustainability of ontologies, and reusing these methods from other fields is encouraged.
Collapse
Affiliation(s)
- César H Bernabé
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands.
| | | | | | - Luiz Olavo Bonino da Silva Santos
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
- University of Twente, Enschede, The Netherlands
| | - Barend Mons
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Annika Jacobsen
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Marco Roos
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands.
| |
Collapse
|
7
|
Wright AJ, Zhang L, Howes E, Veall C, Corker E, Johnston M, Hastings J, West R, Michie S. Specifying how intervention content is communicated: Development of a Style of Delivery Ontology. Wellcome Open Res 2023; 8:456. [PMID: 39193088 PMCID: PMC11347912 DOI: 10.12688/wellcomeopenres.19899.1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/13/2023] [Indexed: 08/29/2024] Open
Abstract
Background: Investigating and enhancing the effectiveness of behaviour change interventions requires detailed and consistent specification of all aspects of interventions. We need to understand not only their content, that is the specific techniques, but also the source, mode, schedule, and style in which this content is delivered. Delivery style refers to the manner by which content is communicated to intervention participants. This paper reports the development of an ontology for specifying the style of delivery of interventions that depend on communication. This forms part of the Behaviour Change Intervention Ontology, which aims to cover all aspects of behaviour change intervention scenarios. Methods: The Style of Delivery Ontology was developed following methods for ontology development used in the Human Behaviour-Change Project, with seven key steps: 1) defining the scope of the ontology, 2) identifying key entities and developing their preliminary definitions by reviewing 100 behaviour change intervention evaluation reports and existing classification systems, 3) refining the ontology by piloting the ontology through annotations of 100 reports, 4) stakeholder review by eight behavioural science and public health experts, 5) inter-rater reliability testing through annotating 100 reports using the ontology, 6) specifying ontological relationships between entities, and 7) disseminating and maintaining the ontology. Results: The resulting ontology is a five-level hierarchical structure comprising 145 unique entities relevant to style of delivery. Key areas include communication processes, communication styles, and attributes of objects used in communication processes. Inter-rater reliability for annotating intervention evaluation reports was α=0.77 (good) for those familiar with the ontology and α=0.62 (acceptable) for those unfamiliar with it. Conclusions: The Style of Delivery Ontology can be used for both annotating and describing behaviour change interventions in a consistent and coherent manner, thereby improving evidence comparison, synthesis, replication, and implementation of effective interventions.
Collapse
Affiliation(s)
- Alison J. Wright
- Centre for Behaviour Change, University College London, London, England, UK
- Institute of Pharmaceutical Science, King's College London, London, England, UK
| | - Lisa Zhang
- Centre for Behaviour Change, University College London, London, England, UK
| | - Ella Howes
- Centre for Behaviour Change, University College London, London, England, UK
| | - Clement Veall
- Centre for Behaviour Change, University College London, London, England, UK
| | - Elizabeth Corker
- Grounded Research, Rotherham Doncaster and South Humber NHS Foundation Trust, Doncaster, England, UK
- Clinical and Applied Psychology Unit, The University of Sheffield, Sheffield, England, UK
| | - Marie Johnston
- Aberdeen Health Psychology Group, University of Aberdeen, Aberdeen, Scotland, UK
| | - Janna Hastings
- Institute for Implementation Science in Health Care, Faculty of Medicine, University of Zurich, Zürich, Switzerland
- School of Medicine, University of St Gallen, St. Gallen, Switzerland
| | - Robert West
- Institute of Epidemiology and Health Care, University College London, London, England, UK
| | - Susan Michie
- Centre for Behaviour Change, University College London, London, England, UK
| |
Collapse
|
8
|
Huffman A, Ong E, Brunson T, Sanati N, Zheng J, Masci AM, Wu G, He Y. Ontology-based representation and analysis of conditional vaccine immune responses using Omics data. CEUR WORKSHOP PROCEEDINGS 2023; 3603:1-12. [PMID: 39583065 PMCID: PMC11584146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 11/26/2024]
Abstract
ImmPort, the world's largest repository of immunology data, includes many vaccine immune response datasets. ImmPort maps the metadata of these studies to ontology and database schema. As of February 28, 2023, our ImmPort data analysis identified 6.258 immune exposures using 47 vaccines in 4,607 human subjects, and 324 cohort studies from the ImmPort. We hypothesized that an integrative ontological representation of the data from these studies would enhance our understanding and analysis of these ImmPort vaccine studies, and with ontological classification and tools such as VIGET, we could further study the effects of different conditions such as vaccine types and host biological sex on the vaccine response gene expression profiles. Our Vaccine Ontology (VO) analysis classified these 37 vaccines into bacterial, viral, and protozoan vaccine types with different vaccine properties. The ImmPort metadata types were modeled with the Vaccine Investigation Ontology (VIO). Our new ontology-based pipeline extracted vaccine response data from the ImmPort database, annotated them based on ontology, obtained corresponding gene expression data from the GEO, and performed consistent omics data analysis. Our use case found gene profiles shared and differed from live and killed inactivated influenza vaccines. Furthermore, our Omics data analysis using the VIGET tool found that female and male human subjects have differential host responses for influenza vaccines. For example, our study showed much stronger early female responses to influenza vaccination than males, and males was able to show active immune responses at a later stage. Interestingly, the female (but not male) human subject group also showed significantly enriched neutrophil degranulation at Day 3 after influenza vaccination; however, males (but not females) displayed significantly enriched neutrophil degranulation at Day 14 after influenza vaccination. These mechanisms have been used to find differences between the gene lists and pathways of host responses to different vaccines conditional to different factors including vaccine types and host biological sex. Moreover, this framework can be expanded to other vaccines and vaccine categories easily.
Collapse
Affiliation(s)
- Anthony Huffman
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States
| | - Edison Ong
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States
| | - Tim Brunson
- Oregon Health & Science University, Portland, Oregon, OR, United States
| | - Nasim Sanati
- Oregon Health & Science University, Portland, Oregon, OR, United States
| | - Jie Zheng
- Unit of Laboratory Animal Medicine, University of Michigan, Ann Arbor, MI, United States
| | - Anna Maria Masci
- Office of Data Science National Institute of Environmental Health Science, Durham, NC, United States
| | - Guanming Wu
- Oregon Health & Science University, Portland, Oregon, OR, United States
| | - Yongqun He
- Unit of Laboratory Animal Medicine, University of Michigan, Ann Arbor, MI, United States
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, United States
- Center for Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States
| |
Collapse
|
9
|
Jackson DB, Racz R, Kim S, Brock S, Burkhart K. Rewiring Drug Research and Development through Human Data-Driven Discovery (HD 3). Pharmaceutics 2023; 15:1673. [PMID: 37376121 PMCID: PMC10303279 DOI: 10.3390/pharmaceutics15061673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 05/29/2023] [Accepted: 06/02/2023] [Indexed: 06/29/2023] Open
Abstract
In an era of unparalleled technical advancement, the pharmaceutical industry is struggling to transform data into increased research and development efficiency, and, as a corollary, new drugs for patients. Here, we briefly review some of the commonly discussed issues around this counterintuitive innovation crisis. Looking at both industry- and science-related factors, we posit that traditional preclinical research is front-loading the development pipeline with data and drug candidates that are unlikely to succeed in patients. Applying a first principles analysis, we highlight the critical culprits and provide suggestions as to how these issues can be rectified through the pursuit of a Human Data-driven Discovery (HD3) paradigm. Consistent with other examples of disruptive innovation, we propose that new levels of success are not dependent on new inventions, but rather on the strategic integration of existing data and technology assets. In support of these suggestions, we highlight the power of HD3, through recently published proof-of-concept applications in the areas of drug safety analysis and prediction, drug repositioning, the rational design of combination therapies and the global response to the COVID-19 pandemic. We conclude that innovators must play a key role in expediting the path to a largely human-focused, systems-based approach to drug discovery and research.
Collapse
Affiliation(s)
| | - Rebecca Racz
- Division of Applied Regulatory Science, Office of Clinical Pharmacology, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD 20993, USA; (R.R.); (K.B.)
| | - Sarah Kim
- Department of Pharmaceutics, Center for Pharmacometrics and Systems Pharmacology, College of Pharmacy, University of Florida, Orlando, FL 32827, USA;
| | | | - Keith Burkhart
- Division of Applied Regulatory Science, Office of Clinical Pharmacology, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD 20993, USA; (R.R.); (K.B.)
| |
Collapse
|
10
|
Zhang S, Benis N, Cornet R. Automated approach for quality assessment of RDF resources. BMC Med Inform Decis Mak 2023; 23:90. [PMID: 37165363 PMCID: PMC10170671 DOI: 10.1186/s12911-023-02182-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 04/20/2023] [Indexed: 05/12/2023] Open
Abstract
INTRODUCTION The Semantic Web community provides a common Resource Description Framework (RDF) that allows representation of resources such that they can be linked. To maximize the potential of linked data - machine-actionable interlinked resources on the Web - a certain level of quality of RDF resources should be established, particularly in the biomedical domain in which concepts are complex and high-quality biomedical ontologies are in high demand. However, it is unclear which quality metrics for RDF resources exist that can be automated, which is required given the multitude of RDF resources. Therefore, we aim to determine these metrics and demonstrate an automated approach to assess such metrics of RDF resources. METHODS An initial set of metrics are identified through literature, standards, and existing tooling. Of these, metrics are selected that fulfil these criteria: (1) objective; (2) automatable; and (3) foundational. Selected metrics are represented in RDF and semantically aligned to existing standards. These metrics are then implemented in an open-source tool. To demonstrate the tool, eight commonly used RDF resources were assessed, including data models in the healthcare domain (HL7 RIM, HL7 FHIR, CDISC CDASH), ontologies (DCT, SIO, FOAF, ORDO), and a metadata profile (GRDDL). RESULTS Six objective metrics are identified in 3 categories: Resolvability (1), Parsability (1), and Consistency (4), and represented in RDF. The tool demonstrates that these metrics can be automated, and application in the healthcare domain shows non-resolvable URIs (ranging from 0.3% to 97%) among all eight resources and undefined URIs in HL7 RIM, and FHIR. In the tested resources no errors were found for parsability and the other three consistency metrics for correct usage of classes and properties. CONCLUSION We extracted six objective and automatable metrics from literature, as the foundational quality requirements of RDF resources to maximize the potential of linked data. Automated tooling to assess resources has shown to be effective to identify quality issues that must be avoided. This approach can be expanded to incorporate more automatable metrics so as to reflect additional quality dimensions with the assessment tool implementing more metrics.
Collapse
Affiliation(s)
- Shuxin Zhang
- Department of Medical Informatics, Amsterdam UMC location University of Amsterdam, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Public Health, Methodology & Digital Health, Amsterdam, The Netherlands
| | - Nirupama Benis
- Department of Medical Informatics, Amsterdam UMC location University of Amsterdam, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Public Health, Methodology & Digital Health, Amsterdam, The Netherlands
| | - Ronald Cornet
- Department of Medical Informatics, Amsterdam UMC location University of Amsterdam, Meibergdreef 9, Amsterdam, The Netherlands
- Amsterdam Public Health, Methodology & Digital Health, Amsterdam, The Netherlands
| |
Collapse
|
11
|
Yu H, Li L, Huffman A, Beverley J, Hur J, Merrell E, Huang HH, Wang Y, Liu Y, Ong E, Cheng L, Zeng T, Zhang J, Li P, Liu Z, Wang Z, Zhang X, Ye X, Handelman SK, Sexton J, Eaton K, Higgins G, Omenn GS, Athey B, Smith B, Chen L, He Y. A new framework for host-pathogen interaction research. Front Immunol 2022; 13:1066733. [PMID: 36591248 PMCID: PMC9797517 DOI: 10.3389/fimmu.2022.1066733] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 11/14/2022] [Indexed: 12/23/2022] Open
Abstract
COVID-19 often manifests with different outcomes in different patients, highlighting the complexity of the host-pathogen interactions involved in manifestations of the disease at the molecular and cellular levels. In this paper, we propose a set of postulates and a framework for systematically understanding complex molecular host-pathogen interaction networks. Specifically, we first propose four host-pathogen interaction (HPI) postulates as the basis for understanding molecular and cellular host-pathogen interactions and their relations to disease outcomes. These four postulates cover the evolutionary dispositions involved in HPIs, the dynamic nature of HPI outcomes, roles that HPI components may occupy leading to such outcomes, and HPI checkpoints that are critical for specific disease outcomes. Based on these postulates, an HPI Postulate and Ontology (HPIPO) framework is proposed to apply interoperable ontologies to systematically model and represent various granular details and knowledge within the scope of the HPI postulates, in a way that will support AI-ready data standardization, sharing, integration, and analysis. As a demonstration, the HPI postulates and the HPIPO framework were applied to study COVID-19 with the Coronavirus Infectious Disease Ontology (CIDO), leading to a novel approach to rational design of drug/vaccine cocktails aimed at interrupting processes occurring at critical host-coronavirus interaction checkpoints. Furthermore, the host-coronavirus protein-protein interactions (PPIs) relevant to COVID-19 were predicted and evaluated based on prior knowledge of curated PPIs and domain-domain interactions, and how such studies can be further explored with the HPI postulates and the HPIPO framework is discussed.
Collapse
Affiliation(s)
- Hong Yu
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and National Health Commission (NHC) Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou Province, Guiyang, Guizhou, China
- Department of Basic Medicine, Guizhou University Medical College, Guiyang, Guizhou, China
| | - Li Li
- Department of Genetics, Harvard Medical School, Boston, MA, United States
| | - Anthony Huffman
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - John Beverley
- Department of Philosophy, University at Buffalo, Buffalo, NY, United States
- Asymmetric Operations Sector, Johns Hopkins University Applied Physics Laboratory, Laurel, MD, United States
| | - Junguk Hur
- Department of Biomedical Sciences, University of North Dakota School of Medicine and Health Sciences, Grand Forks, ND, United States
| | - Eric Merrell
- Department of Philosophy, University at Buffalo, Buffalo, NY, United States
| | - Hsin-hui Huang
- University of Michigan Medical School, Ann Arbor, MI, United States
- Department of Biotechnology and Laboratory Science in Medicine, National Yang-Ming University, Taipei, Taiwan
| | - Yang Wang
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and National Health Commission (NHC) Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou Province, Guiyang, Guizhou, China
- Department of Basic Medicine, Guizhou University Medical College, Guiyang, Guizhou, China
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Yingtong Liu
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Edison Ong
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Liang Cheng
- Department of Bioinformatics, Harbin Medical University, Harbin, Helongjian, China
| | - Tao Zeng
- Key Laboratory of Systems Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China
| | - Jingsong Zhang
- Key Laboratory of Systems Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China
| | - Pengpai Li
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, China
| | - Zhiping Liu
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, China
| | - Zhigang Wang
- Department of Biomedical Engineering, Institute of Basic Medical Sciences and School of Basic Medicine, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, China
| | - Xiangyan Zhang
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and National Health Commission (NHC) Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou Province, Guiyang, Guizhou, China
- Department of Basic Medicine, Guizhou University Medical College, Guiyang, Guizhou, China
| | - Xianwei Ye
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and National Health Commission (NHC) Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou Province, Guiyang, Guizhou, China
- Department of Basic Medicine, Guizhou University Medical College, Guiyang, Guizhou, China
| | | | - Jonathan Sexton
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Kathryn Eaton
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Gerry Higgins
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Gilbert S. Omenn
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Brian Athey
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Barry Smith
- Department of Philosophy, University at Buffalo, Buffalo, NY, United States
| | - Luonan Chen
- Key Laboratory of Systems Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China
| | - Yongqun He
- University of Michigan Medical School, Ann Arbor, MI, United States
| |
Collapse
|
12
|
He Y, Yu H, Huffman A, Lin AY, Natale DA, Beverley J, Zheng L, Perl Y, Wang Z, Liu Y, Ong E, Wang Y, Huang P, Tran L, Du J, Shah Z, Shah E, Desai R, Huang HH, Tian Y, Merrell E, Duncan WD, Arabandi S, Schriml LM, Zheng J, Masci AM, Wang L, Liu H, Smaili FZ, Hoehndorf R, Pendlington ZM, Roncaglia P, Ye X, Xie J, Tang YW, Yang X, Peng S, Zhang L, Chen L, Hur J, Omenn GS, Athey B, Smith B. A comprehensive update on CIDO: the community-based coronavirus infectious disease ontology. J Biomed Semantics 2022; 13:25. [PMID: 36271389 PMCID: PMC9585694 DOI: 10.1186/s13326-022-00279-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 09/13/2022] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND The current COVID-19 pandemic and the previous SARS/MERS outbreaks of 2003 and 2012 have resulted in a series of major global public health crises. We argue that in the interest of developing effective and safe vaccines and drugs and to better understand coronaviruses and associated disease mechenisms it is necessary to integrate the large and exponentially growing body of heterogeneous coronavirus data. Ontologies play an important role in standard-based knowledge and data representation, integration, sharing, and analysis. Accordingly, we initiated the development of the community-based Coronavirus Infectious Disease Ontology (CIDO) in early 2020. RESULTS As an Open Biomedical Ontology (OBO) library ontology, CIDO is open source and interoperable with other existing OBO ontologies. CIDO is aligned with the Basic Formal Ontology and Viral Infectious Disease Ontology. CIDO has imported terms from over 30 OBO ontologies. For example, CIDO imports all SARS-CoV-2 protein terms from the Protein Ontology, COVID-19-related phenotype terms from the Human Phenotype Ontology, and over 100 COVID-19 terms for vaccines (both authorized and in clinical trial) from the Vaccine Ontology. CIDO systematically represents variants of SARS-CoV-2 viruses and over 300 amino acid substitutions therein, along with over 300 diagnostic kits and methods. CIDO also describes hundreds of host-coronavirus protein-protein interactions (PPIs) and the drugs that target proteins in these PPIs. CIDO has been used to model COVID-19 related phenomena in areas such as epidemiology. The scope of CIDO was evaluated by visual analysis supported by a summarization network method. CIDO has been used in various applications such as term standardization, inference, natural language processing (NLP) and clinical data integration. We have applied the amino acid variant knowledge present in CIDO to analyze differences between SARS-CoV-2 Delta and Omicron variants. CIDO's integrative host-coronavirus PPIs and drug-target knowledge has also been used to support drug repurposing for COVID-19 treatment. CONCLUSION CIDO represents entities and relations in the domain of coronavirus diseases with a special focus on COVID-19. It supports shared knowledge representation, data and metadata standardization and integration, and has been used in a range of applications.
Collapse
Affiliation(s)
- Yongqun He
- University of Michigan Medical School, Ann Arbor, MI USA
| | - Hong Yu
- People’s Hospital of Guizhou Province, Guiyang, Guizhou China
| | | | - Asiyah Yu Lin
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD USA
- National Center for Ontological Research, Buffalo, NY USA
| | | | - John Beverley
- National Center for Ontological Research, Buffalo, NY USA
- The Johns Hopkins University Applied Physics Laboratory, Laurel, MD USA
| | - Ling Zheng
- Computer Science and Software Engineering Department, Monmouth University, West Long Branch, NJ USA
| | - Yehoshua Perl
- Department of Computer Science, New Jersey Institute of Technology, Newark, NJ USA
| | - Zhigang Wang
- Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & School of Basic Medicine, Peking Union Medical College, Beijing, China
| | - Yingtong Liu
- University of Michigan Medical School, Ann Arbor, MI USA
| | - Edison Ong
- University of Michigan Medical School, Ann Arbor, MI USA
| | - Yang Wang
- University of Michigan Medical School, Ann Arbor, MI USA
- People’s Hospital of Guizhou Province, Guiyang, Guizhou China
| | - Philip Huang
- University of Michigan Medical School, Ann Arbor, MI USA
| | - Long Tran
- University of Michigan Medical School, Ann Arbor, MI USA
| | - Jinyang Du
- University of Michigan Medical School, Ann Arbor, MI USA
| | - Zalan Shah
- University of Michigan Medical School, Ann Arbor, MI USA
| | - Easheta Shah
- University of Michigan Medical School, Ann Arbor, MI USA
| | - Roshan Desai
- University of Michigan Medical School, Ann Arbor, MI USA
| | - Hsin-hui Huang
- University of Michigan Medical School, Ann Arbor, MI USA
- National Yang-Ming University, Taipei, Taiwan
| | - Yujia Tian
- Rutgers University, New Brunswick, NJ USA
| | | | | | | | - Lynn M. Schriml
- University of Maryland School of Medicine, Baltimore, MD USA
| | - Jie Zheng
- Department of Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA USA
| | - Anna Maria Masci
- Office of Data Science, National Institute of Environmental Health Sciences, Research Triangle Park, NC USA
| | | | | | | | - Robert Hoehndorf
- King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Zoë May Pendlington
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Paola Roncaglia
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Xianwei Ye
- People’s Hospital of Guizhou Province, Guiyang, Guizhou China
| | - Jiangan Xie
- School of Bioinformatics, Chongqing University of Posts and Telecommunications, Chongqing, China
| | - Yi-Wei Tang
- Cepheid, Danaher Diagnostic Platform, Shanghai, China
| | - Xiaolin Yang
- Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & School of Basic Medicine, Peking Union Medical College, Beijing, China
| | - Suyuan Peng
- National Institute of Health Data Science, Peking University, Beijing, China
| | - Luxia Zhang
- National Institute of Health Data Science, Peking University, Beijing, China
| | - Luonan Chen
- Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China
| | - Junguk Hur
- University of North Dakota School of Medicine and Health Sciences, Grand Forks, ND USA
| | | | - Brian Athey
- University of Michigan Medical School, Ann Arbor, MI USA
| | - Barry Smith
- National Center for Ontological Research, Buffalo, NY USA
- University at Buffalo, Buffalo, NY 14260 USA
| |
Collapse
|
13
|
Matentzoglu N, Goutte-Gattat D, Tan SZK, Balhoff JP, Carbon S, Caron AR, Duncan WD, Flack JE, Haendel M, Harris NL, Hogan WR, Hoyt CT, Jackson RC, Kim H, Kir H, Larralde M, McMurry JA, Overton JA, Peters B, Pilgrim C, Stefancsik R, Robb SMC, Toro S, Vasilevsky NA, Walls R, Mungall CJ, Osumi-Sutherland D. Ontology Development Kit: a toolkit for building, maintaining and standardizing biomedical ontologies. Database (Oxford) 2022; 2022:6754192. [PMID: 36208225 PMCID: PMC9547537 DOI: 10.1093/database/baac087] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 08/19/2022] [Accepted: 09/23/2022] [Indexed: 11/21/2022]
Abstract
Similar to managing software packages, managing the ontology life cycle involves multiple complex workflows such as preparing releases, continuous quality control checking and dependency management. To manage these processes, a diverse set of tools is required, from command-line utilities to powerful ontology-engineering environmentsr. Particularly in the biomedical domain, which has developed a set of highly diverse yet inter-dependent ontologies, standardizing release practices and metadata and establishing shared quality standards are crucial to enable interoperability. The Ontology Development Kit (ODK) provides a set of standardized, customizable and automatically executable workflows, and packages all required tooling in a single Docker image. In this paper, we provide an overview of how the ODK works, show how it is used in practice and describe how we envision it driving standardization efforts in our community. Database URL: https://github.com/INCATools/ontology-development-kit.
Collapse
Affiliation(s)
| | - Damien Goutte-Gattat
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| | - Shawn Zheng Kai Tan
- Samples Phenotypes and Ontologies Team (SPOT), European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - James P Balhoff
- RENCI, University of North Carolina, Chapel Hill, NC, North Carolina 27517, USA
| | - Seth Carbon
- Berkeley Bioinformatics Open-source Projects (BBOP), Lawrence Berkeley National Laboratory (LBNL), 1 Cyclotron Road, Mailstop 977-0257, Berkeley, CA 94720, USA
| | - Anita R Caron
- Samples Phenotypes and Ontologies Team (SPOT), European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - William D Duncan
- Berkeley Bioinformatics Open-source Projects (BBOP), Lawrence Berkeley National Laboratory (LBNL), 1 Cyclotron Road, Mailstop 977-0257, Berkeley, CA 94720, USA,College of Dentistry; Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, William D. Duncan: 1395 Center Dr, Gainesville, William R. Hogan: 1600 SW Archer Rd, Gainesville, FL 32610, USA
| | - Joe E Flack
- School of Medicine, Johns Hopkins University, 733 N Broadway, Baltimore, Baltimore, MD 21205, USA
| | - Melissa Haendel
- University of Colorado Anschutz Medical Campus, 13001 E 17th Pl, Aurora, CO 80045, USA
| | - Nomi L Harris
- Berkeley Bioinformatics Open-source Projects (BBOP), Lawrence Berkeley National Laboratory (LBNL), 1 Cyclotron Road, Mailstop 977-0257, Berkeley, CA 94720, USA
| | - William R Hogan
- College of Dentistry; Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, William D. Duncan: 1395 Center Dr, Gainesville, William R. Hogan: 1600 SW Archer Rd, Gainesville, FL 32610, USA
| | - Charles Tapley Hoyt
- Laboratory of Systems Pharmacology, Harvard Medical School, 200 Longwood Avenue Armenise Building Room 109, Boston, MA 02115, USA
| | - Rebecca C Jackson
- Bend Informatics LLC, 5305 RIVER RD NORTH, STE B, KEIZER, OR 97303, USA
| | | | - Huseyin Kir
- Samples Phenotypes and Ontologies Team (SPOT), European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Martin Larralde
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstraße 1, Heidelberg 69117, Germany
| | - Julie A McMurry
- University of Colorado Anschutz Medical Campus, 13001 E 17th Pl, Aurora, CO 80045, USA
| | | | - Bjoern Peters
- Institute for Allergy & Immunology, La Jolla Institute for Immunology, 9420 Athena Circle, La Jolla, CA 92037, USA
| | - Clare Pilgrim
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge, CB2 3DY, UK
| | - Ray Stefancsik
- Samples Phenotypes and Ontologies Team (SPOT), European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Sofia MC Robb
- Stowers Institute for Medical Research, 1000 E. 50th St., Kansas City, MO 64110, USA
| | - Sabrina Toro
- University of Colorado Anschutz Medical Campus, 13001 E 17th Pl, Aurora, CO 80045, USA
| | - Nicole A Vasilevsky
- University of Colorado Anschutz Medical Campus, 13001 E 17th Pl, Aurora, CO 80045, USA
| | - Ramona Walls
- Critical Path Institute, 1730 E River Road, Tucson, AZ 85718, USA
| | - Christopher J Mungall
- Berkeley Bioinformatics Open-source Projects (BBOP), Lawrence Berkeley National Laboratory (LBNL), 1 Cyclotron Road, Mailstop 977-0257, Berkeley, CA 94720, USA
| | | |
Collapse
|
14
|
Abu-Salih B. MetaOntology: Toward developing an ontology for the metaverse. Front Big Data 2022; 5:998648. [PMID: 36156936 PMCID: PMC9493250 DOI: 10.3389/fdata.2022.998648] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 08/22/2022] [Indexed: 11/13/2022] Open
Abstract
Metaverse is now perceived as a celebrated future version of the internet. In this new anticipated virtual universe, interconnected digital platforms leveraged by augmented, extended, and virtual realities will elevate users' immersive experiences through multidimensional interactions. In particular, users will be offered a broad spectrum of digital activities within a newly immersive setting mediated by technology. This study aims to design a domain ontology (MetaOntology) for the metaverse to provide an explicit specification of relevant state-of-the-art technologies and infrastructure. A four-step methodological approach is followed to construct the designated ontology. Due to the immaturity of the metaverse, MetaOntology is not intended to furnish a complete outlook on the domain, rather it aims to establish a cornerstone so as to facilitate future efforts in building extant versions of this ontology considering the evolvement of relevant technologies.
Collapse
Affiliation(s)
- Bilal Abu-Salih
- King Abdullah II School of Information Technology, The University of Jordan, Al Jubeiha, Jordan
- School of Management and Marketing, Curtin University, Perth, WA, Australia
- *Correspondence: Bilal Abu-Salih
| |
Collapse
|
15
|
He Y. Development and Applications of Interoperable Biomedical Ontologies for Integrative Data and Knowledge Representation and Multiscale Modeling in Systems Medicine. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2486:233-244. [PMID: 35437726 DOI: 10.1007/978-1-0716-2265-0_12] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
The data FAIR Guiding Principles state that all data should be Findable, Accessible, Interoperable, and Reusable. Ontology is critical to data integration, sharing, and analysis. Given thousands of ontologies have been developed in the era of artificial intelligence, it is critical to have interoperable ontologies to support standardized data and knowledge presentation and reasoning. For interoperable ontology development, the eXtensible ontology development (XOD) strategy offers four principles including ontology term reuse, semantic alignment, ontology design pattern usage, and community extensibility. Many software programs are available to help implement these principles. As a demonstration, the XOD strategy is applied to developing the interoperable Coronavirus Infectious Disease Ontology (CIDO). Various applications of interoperable ontologies, such as COVID-19 and kidney precision medicine research, are also introduced in this chapter.
Collapse
Affiliation(s)
- Yongqun He
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, Center of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA.
| |
Collapse
|
16
|
Raboudi A, Allanic M, Balvay D, Hervé PY, Viel T, Yoganathan T, Certain A, Hilbey J, Charlet J, Durupt A, Boutinaud P, Eynard B, Tavitian B. The BMS-LM ontology for biomedical data reporting throughout the lifecycle of a research study: From data model to ontology. J Biomed Inform 2022; 127:104007. [DOI: 10.1016/j.jbi.2022.104007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 12/24/2021] [Accepted: 01/28/2022] [Indexed: 11/16/2022]
|
17
|
Liu M, Liu J, Liu G, Wang H, Wang X, Deng Z, He Y, Ou HY. ICEO, a biological ontology for representing and analyzing bacterial integrative and conjugative elements. Sci Data 2022; 9:11. [PMID: 35058462 PMCID: PMC8776819 DOI: 10.1038/s41597-021-01112-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 12/13/2021] [Indexed: 01/18/2023] Open
Abstract
Bacterial integrative and conjugative elements (ICEs) are highly modular mobile genetic elements critical to the horizontal transfer of antibiotic resistance and virulence factor genes. To better understand and analyze the ongoing increase of ICEs, we developed an Integrative and Conjugative Element Ontology (ICEO) to represent the gene components, functional modules, and other information of experimentally verified ICEs. ICEO is aligned with the upper-level Basic Formal Ontology and reuses existing reliable ontologies. There are 31,081 terms, including 26,814 classes from 14 ontologies and 4128 ICEO-specific classes, representing the information of 271 known experimentally verified ICEs from 235 bacterial strains in ICEO currently and 311 predicted ICEs of 272 completely sequenced Klebsiella pneumoniae strains. Three ICEO use cases were illustrated to investigate complex joins of ICEs and their harboring antibiotic resistance or virulence factor genes by using SPARQL or DL query. ICEO has been approved as an Open Biomedical Ontology library ontology. It may be dedicated to facilitating systematical ICE knowledge representation, integration, and computer-assisted queries.
Collapse
Affiliation(s)
- Meng Liu
- State Key Laboratory of Microbial Metabolism, Joint International Laboratory on Metabolic & Developmental Sciences, School of Life Sciences & Biotechnology, Shanghai Jiao Tong University, Shanghai, 200030, China
| | - Jialin Liu
- Department of Critical Care Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Guitian Liu
- State Key Laboratory of Microbial Metabolism, Joint International Laboratory on Metabolic & Developmental Sciences, School of Life Sciences & Biotechnology, Shanghai Jiao Tong University, Shanghai, 200030, China
| | - Hui Wang
- State Key Laboratory of Pathogens and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, China
| | - Xiaoli Wang
- Department of Critical Care Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Zixin Deng
- State Key Laboratory of Microbial Metabolism, Joint International Laboratory on Metabolic & Developmental Sciences, School of Life Sciences & Biotechnology, Shanghai Jiao Tong University, Shanghai, 200030, China
| | - Yongqun He
- University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
| | - Hong-Yu Ou
- State Key Laboratory of Microbial Metabolism, Joint International Laboratory on Metabolic & Developmental Sciences, School of Life Sciences & Biotechnology, Shanghai Jiao Tong University, Shanghai, 200030, China.
| |
Collapse
|
18
|
Wan L, Song J, He V, Roman J, Whah G, Peng S, Zhang L, He Y. Development of the International Classification of Diseases Ontology (ICDO) and its application for COVID-19 diagnostic data analysis. BMC Bioinformatics 2021; 22:508. [PMID: 34663204 PMCID: PMC8522253 DOI: 10.1186/s12859-021-04402-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Accepted: 09/24/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The 10th and 9th revisions of the International Statistical Classification of Diseases and Related Health Problems (ICD10 and ICD9) have been adopted worldwide as a well-recognized norm to share codes for diseases, signs and symptoms, abnormal findings, etc. The international Consortium for Clinical Characterization of COVID-19 by EHR (4CE) website stores diagnosis COVID-19 disease data using ICD10 and ICD9 codes. However, the ICD systems are difficult to decode due to their many shortcomings, which can be addressed using ontology. METHODS An ICD ontology (ICDO) was developed to logically and scientifically represent ICD terms and their relations among different ICD terms. ICDO is also aligned with the Basic Formal Ontology (BFO) and reuses terms from existing ontologies. As a use case, the ICD10 and ICD9 diagnosis data from the 4CE website were extracted, mapped to ICDO, and analyzed using ICDO. RESULTS We have developed the ICDO to ontologize the ICD terms and relations. Different from existing disease ontologies, all ICD diseases in ICDO are defined as disease processes to describe their occurrence with other properties. The ICDO decomposes each disease term into different components, including anatomic entities, process profiles, etiological causes, output phenotype, etc. Over 900 ICD terms have been represented in ICDO. Many ICDO terms are presented in both English and Chinese. The ICD10/ICD9-based diagnosis data of over 27,000 COVID-19 patients from 5 countries were extracted from the 4CE. A total of 917 COVID-19-related disease codes, each of which were associated with 1 or more cases in the 4CE dataset, were mapped to ICDO and further analyzed using the ICDO logical annotations. Our study showed that COVID-19 targeted multiple systems and organs such as the lung, heart, and kidney. Different acute and chronic kidney phenotypes were identified. Some kidney diseases appeared to result from other diseases, such as diabetes. Some of the findings could only be easily found using ICDO instead of ICD9/10. CONCLUSIONS ICDO was developed to ontologize ICD10/10 codes and applied to study COVID-19 patient diagnosis data. Our findings showed that ICDO provides a semantic platform for more accurate detection of disease profiles.
Collapse
Affiliation(s)
- Ling Wan
- University of Michigan Medical School, Ann Arbor, MI 48109 USA
- OntoWise, Nanjing, Jiangsu China
| | - Justin Song
- Cranbrook Kingswood Upper School, Bloomfield Hills, MI 48304 USA
| | | | - Jennifer Roman
- College of Literacy, Science, and Arts, University of Michigan, Ann Arbor, MI 48109 USA
| | - Grace Whah
- College of Engineering, University of Michigan, Ann Arbor, MI 48109 USA
| | - Suyuan Peng
- School of Public Health, Peking University, Beijing, China
- National Institute of Health Data Science, Peking University, Beijing, China
| | - Luxia Zhang
- National Institute of Health Data Science, Peking University, Beijing, China
- Advanced Institute of Information Technology, Peking University, Hangzhou, China
- Renal Division, Department of Medicine, Peking University First Hospital, Peking University Institute of Nephrology, Beijing, China
| | - Yongqun He
- University of Michigan Medical School, Ann Arbor, MI 48109 USA
| |
Collapse
|
19
|
Zhu Y, Liu L, Gao B, Liu J, Qiao X, Lian C, He Y. TCDO: A Community-Based Ontology for Integrative Representation and Analysis of Traditional Chinese Drugs and Their Properties. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE : ECAM 2021; 2021:6637810. [PMID: 34603473 PMCID: PMC8483929 DOI: 10.1155/2021/6637810] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 08/04/2021] [Accepted: 08/31/2021] [Indexed: 11/17/2022]
Abstract
Traditional Chinese drugs (TCDs) have been widely used in clinical practice in China and many other regions for thousands of years. Nowadays TCD's bioactive ingredients and mechanisms of action are being identified. However, the lack of standardized terminologies or ontologies for the description of TCDs has hindered the interoperability and deep analysis of TCD knowledge and data. By aligning with the Basic Formal Ontology (BFO), an ISO-approved top-level ontology, we constructed a community-driven TCD ontology (TCDO) with the aim of supporting standardized TCD representation and integrated analysis. TCDO provides logical and textual definitions of TCDs, TCD categories, and the properties of TCDs (i.e., nature, flavor, toxicity, and channel tropism). More than 400 popular TCD decoction pieces (TCD-DPs) and Chinese medicinal materials (CMMs) are systematically represented. The logical TCD representation in TCDO supports computer-assisted reasoning and queries using tools such as Description Logic (DL) and SPARQL queries. Our statistical analysis of the knowledge represented in TCDO revealed scientific insights about TCDs. A total of 36 TCDs with medium or high toxicity are most densely distributed, primarily in Aconitum genus, Lamiids clade, and Fabids clade. TCD toxicity is mostly associated with the hot nature and pungent or bitter flavors and has liver, kidney, and spleen channel tropism. The three pairs of TCD flavor-nature associations (i.e., bitter-cold, pungent-warm, and sweet-neutral) were identified. The significance of these findings is discussed. TCDO has also been used to support the development of a web-based traditional Chinese medicine semantic annotation system that provides comprehensive annotation for individual TCDs. As a novel formal TCD ontology, TCDO lays out a strong foundation for more advanced TCD studies in the future.
Collapse
Affiliation(s)
- Yan Zhu
- Institute of Information on Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Lihong Liu
- Institute of Information on Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Bo Gao
- Institute of Information on Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Jing Liu
- Institute of Information on Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Xingchao Qiao
- Institute of Information on Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Chaojie Lian
- National Institutes for Food and Drug Control, Beijing 102627, China
| | - Yongqun He
- University of Michigan Medical School, Ann Arbor, MI 48109, USA
| |
Collapse
|
20
|
Huffman A, Masci AM, Zheng J, Sanati N, Brunson T, Wu G, He Y. CIDO ontology updates and secondary analysis of host responses to COVID-19 infection based on ImmPort reports and literature. J Biomed Semantics 2021; 12:18. [PMID: 34454610 PMCID: PMC8400831 DOI: 10.1186/s13326-021-00250-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Accepted: 08/05/2021] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND With COVID-19 still in its pandemic stage, extensive research has generated increasing amounts of data and knowledge. As many studies are published within a short span of time, we often lose an integrative and comprehensive picture of host-coronavirus interaction (HCI) mechanisms. As of early April 2021, the ImmPort database has stored 7 studies (with 6 having details) that cover topics including molecular immune signatures, epitopes, and sex differences in terms of mortality in COVID-19 patients. The Coronavirus Infectious Disease Ontology (CIDO) represents basic HCI information. We hypothesize that the CIDO can be used as the platform to represent newly recorded information from ImmPort leading the reinforcement of CIDO. METHODS The CIDO was used as the semantic platform for logically modeling and representing newly identified knowledge reported in the 6 ImmPort studies. A recursive eXtensible Ontology Development (XOD) strategy was established to support the CIDO representation and enhancement. Secondary data analysis was also performed to analyze different aspects of the HCI from these ImmPort studies and other related literature reports. RESULTS The topics covered by the 6 ImmPort papers were identified to overlap with existing CIDO representation. SARS-CoV-2 viral S protein related HCI knowledge was emphasized for CIDO modeling, including its binding with ACE2, mutations causing different variants, and epitope homology by comparison with other coronavirus S proteins. Different types of cytokine signatures were also identified and added to CIDO. Our secondary analysis of two cohort COVID-19 studies with cytokine panel detection found that a total of 11 cytokines were up-regulated in female patients after infection and 8 cytokines in male patients. These sex-specific gene responses were newly modeled and represented in CIDO. A new DL query was generated to demonstrate the benefits of such integrative ontology representation. Furthermore, IL-10 signaling pathway was found to be statistically significant for both male patients and female patients. CONCLUSION Using the recursive XOD strategy, six new ImmPort COVID-19 studies were systematically reviewed, the results were modeled and represented in CIDO, leading to the enhancement of CIDO. The enhanced ontology and further seconary analysis supported more comprehensive understanding of the molecular mechanism of host responses to COVID-19 infection.
Collapse
Affiliation(s)
- Anthony Huffman
- Department of Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109 USA
| | - Anna Maria Masci
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC 27710 USA
- Office of Data Science, National Institute of Environmental Health Sciences, 530 Davis Drive, Research Triangle Park, NC 27560 USA
| | - Jie Zheng
- Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104 USA
| | - Nasim Sanati
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR 97239 USA
| | - Timothy Brunson
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR 97239 USA
| | - Guanming Wu
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR 97239 USA
| | - Yongqun He
- Department of Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109 USA
- Unit for Laboratory Animal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109 USA
- Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, MI 48109 USA
- Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109 USA
| |
Collapse
|
21
|
Wang Z, He Y. Precision omics data integration and analysis with interoperable ontologies and their application for COVID-19 research. Brief Funct Genomics 2021; 20:235-248. [PMID: 34159360 PMCID: PMC8287950 DOI: 10.1093/bfgp/elab029] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 05/10/2021] [Accepted: 05/24/2021] [Indexed: 12/12/2022] Open
Abstract
Omics technologies are widely used in biomedical research. Precision medicine focuses on individual-level disease treatment and prevention. Here, we propose the usage of the term 'precision omics' to represent the combinatorial strategy that applies omics to translate large-scale molecular omics data for precision disease understanding and accurate disease diagnosis, treatment and prevention. Given the complexity of both omics and precision medicine, precision omics requires standardized representation and integration of heterogeneous data types. Ontology has emerged as an important artificial intelligence component to become critical for standard data and metadata representation, standardization and integration. To support precision omics, we propose a precision omics ontology hypothesis, which hypothesizes that the effectiveness of precision omics is positively correlated with the interoperability of ontologies used for data and knowledge integration. Therefore, to make effective precision omics studies, interoperable ontologies are required to standardize and incorporate heterogeneous data and knowledge in a human- and computer-interpretable manner. Methods for efficient development and application of interoperable ontologies are proposed and illustrated. With the interoperable omics data and knowledge, omics tools such as OmicsViz can also be evolved to process, integrate, visualize and analyze various omics data, leading to the identification of new knowledge and hypotheses of molecular mechanisms underlying the outcomes of diseases such as COVID-19. Given extensive COVID-19 omics research, we propose the strategy of precision omics supported by interoperable ontologies, accompanied with ontology-based semantic reasoning and machine learning, leading to systematic disease mechanism understanding and rational design of precision treatment and prevention. SHORT ABSTRACT Precision medicine focuses on individual-level disease treatment and prevention. Precision omics is a new strategy that applies omics for precision medicine research, which requires standardized representation and integration of individual genetics and phenotypes, experimental conditions, and data analysis settings. Ontology has emerged as an important artificial intelligence component to become critical for standard data and metadata representation, standardization and integration. To support precision omics, interoperable ontologies are required in order to standardize and incorporate heterogeneous data and knowledge in a human- and computer-interpretable manner. With the interoperable omics data and knowledge, omics tools such as OmicsViz can also be evolved to process, integrate, visualize and analyze various omics data, leading to the identification of new knowledge and hypotheses of molecular mechanisms underlying disease outcomes. The precision COVID-19 omics study is provided as the primary use case to illustrate the rationale and implementation of the precision omics strategy.
Collapse
Affiliation(s)
| | - Yongqun He
- University of Michigan Medical School, Ann Arbor, MI, USA
| |
Collapse
|
22
|
Norris E, Wright AJ, Hastings J, West R, Boyt N, Michie S. Specifying who delivers behaviour change interventions: development of an Intervention Source Ontology. Wellcome Open Res 2021; 6:77. [PMID: 34497878 PMCID: PMC8406443 DOI: 10.12688/wellcomeopenres.16682.1] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/22/2021] [Indexed: 12/16/2022] Open
Abstract
Background: Identifying how behaviour change interventions are delivered, including by whom, is key to understanding intervention effectiveness. However, information about who delivers interventions is reported inconsistently in intervention evaluations, limiting communication and knowledge accumulation. This paper reports a method for consistent reporting: The Intervention Source Ontology. This forms one part of the Behaviour Change Intervention Ontology, which aims to cover all aspects of behaviour change interventions . Methods: The Intervention Source Ontology was developed following methods for ontology development and maintenance used in the Human Behaviour-Change Project, with seven key steps: 1) define the scope of the ontology, 2) identify key entities and develop their preliminary definitions by reviewing existing classification systems (top-down) and reviewing 100 behaviour change intervention reports (bottom-up), 3) refine the ontology by piloting the preliminary ontology on 100 reports, 4) stakeholder review by 34 behavioural science and public health experts, 5) inter-rater reliability testing of annotating intervention reports using the ontology, 6) specify ontological relationships between entities and 7) disseminate and maintain the Intervention Source Ontology. Results: The Intervention Source Ontology consists of 140 entities. Key areas of the ontology include Occupational Role of Source, Relatedness between Person Source and the Target Population, Sociodemographic attributes and Expertise. Inter-rater reliability was found to be 0.60 for those familiar with the ontology and 0.59 for those unfamiliar with it, levels of agreement considered 'acceptable'. Conclusions: Information about who delivers behaviour change interventions can be reliably specified using the Intervention Source Ontology. For human-delivered interventions, the ontology can be used to classify source characteristics in existing behaviour change reports and enable clearer specification of intervention sources in reporting.
Collapse
Affiliation(s)
- Emma Norris
- Health Behaviour Change Research Group, Brunel University London, Uxbridge, UB8 3PH, UK
- Centre for Behaviour Change, University College London, London, WC1E 7HB, UK
| | - Alison J. Wright
- Centre for Behaviour Change, University College London, London, WC1E 7HB, UK
| | - Janna Hastings
- Centre for Behaviour Change, University College London, London, WC1E 7HB, UK
| | - Robert West
- Research Department of Epidemiology & Public Health, University College London, London, WC1E 7HB, UK
| | - Neil Boyt
- Centre for Behaviour Change, University College London, London, WC1E 7HB, UK
| | - Susan Michie
- Centre for Behaviour Change, University College London, London, WC1E 7HB, UK
| |
Collapse
|
23
|
Cimino JJ. Putting the "why" in "EHR": capturing and coding clinical cognition. J Am Med Inform Assoc 2021; 26:1379-1384. [PMID: 31407781 PMCID: PMC6798564 DOI: 10.1093/jamia/ocz125] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Revised: 06/21/2019] [Accepted: 06/25/2019] [Indexed: 12/02/2022] Open
Abstract
Complaints about electronic health records, including information overload, note bloat, and alert fatigue, are frequent topics of discussion. Despite substantial effort by researchers and industry, complaints continue noting serious adverse effects on patient safety and clinician quality of life. I believe solutions are possible if we can add information to the record that explains the “why” of a patient’s care, such as relationships between symptoms, physical findings, diagnostic results, differential diagnoses, therapeutic plans, and goals. While this information may be present in clinical notes, I propose that we modify electronic health records to support explicit representation of this information using formal structure and controlled vocabularies. Such information could foster development of more situation-aware tools for data retrieval and synthesis. Informatics research is needed to understand what should be represented, how to capture it, and how to benefit those providing the information so that their workload is reduced.
Collapse
Affiliation(s)
- James J Cimino
- Informatics Institute, University of Alabama at Birmingham, Birmingham, Alabama, USA
| |
Collapse
|
24
|
El-Achkar TM, Eadon MT, Menon R, Lake BB, Sigdel TK, Alexandrov T, Parikh S, Zhang G, Dobi D, Dunn KW, Otto EA, Anderton CR, Carson JM, Luo J, Park C, Hamidi H, Zhou J, Hoover P, Schroeder A, Joanes M, Azeloglu EU, Sealfon R, Winfree S, Steck B, He Y, D’Agati V, Iyengar R, Troyanskaya OG, Barisoni L, Gaut J, Zhang K, Laszik Z, Rovin BH, Dagher PC, Sharma K, Sarwal MM, Hodgin JB, Alpers CE, Kretzler M, Jain S. A multimodal and integrated approach to interrogate human kidney biopsies with rigor and reproducibility: guidelines from the Kidney Precision Medicine Project. Physiol Genomics 2021; 53:1-11. [PMID: 33197228 PMCID: PMC7847045 DOI: 10.1152/physiolgenomics.00104.2020] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Comprehensive and spatially mapped molecular atlases of organs at a cellular level are a critical resource to gain insights into pathogenic mechanisms and personalized therapies for diseases. The Kidney Precision Medicine Project (KPMP) is an endeavor to generate three-dimensional (3-D) molecular atlases of healthy and diseased kidney biopsies by using multiple state-of-the-art omics and imaging technologies across several institutions. Obtaining rigorous and reproducible results from disparate methods and at different sites to interrogate biomolecules at a single-cell level or in 3-D space is a significant challenge that can be a futile exercise if not well controlled. We describe a "follow the tissue" pipeline for generating a reliable and authentic single-cell/region 3-D molecular atlas of human adult kidney. Our approach emphasizes quality assurance, quality control, validation, and harmonization across different omics and imaging technologies from sample procurement, processing, storage, shipping to data generation, analysis, and sharing. We established benchmarks for quality control, rigor, reproducibility, and feasibility across multiple technologies through a pilot experiment using common source tissue that was processed and analyzed at different institutions and different technologies. A peer review system was established to critically review quality control measures and the reproducibility of data generated by each technology before their being approved to interrogate clinical biopsy specimens. The process established economizes the use of valuable biopsy tissue for multiomics and imaging analysis with stringent quality control to ensure rigor and reproducibility of results and serves as a model for precision medicine projects across laboratories, institutions and consortia.
Collapse
Affiliation(s)
| | | | - Rajasree Menon
- 2University of Michigan School of Medicine, Ann Arbor, Michigan
| | - Blue B. Lake
- 3Jacobs School of Engineering, University of California, San Diego, California
| | - Tara K. Sigdel
- 4University of California San Francisco School of Medicine, San Francisco, California
| | | | - Samir Parikh
- 6Ohio State University College of Medicine, Columbus, Ohio
| | - Guanshi Zhang
- 7UT-Health San Antonio School of Medicine, San Antonio, Texas
| | - Dejan Dobi
- 4University of California San Francisco School of Medicine, San Francisco, California
| | - Kenneth W. Dunn
- 1Indiana University School of Medicine, Indianapolis, Indiana
| | - Edgar A. Otto
- 2University of Michigan School of Medicine, Ann Arbor, Michigan
| | - Christopher R. Anderton
- 7UT-Health San Antonio School of Medicine, San Antonio, Texas,8Pacific Northwest National Laboratory, Richland, Washington
| | - Jonas M. Carson
- 9Schools of Medicine and Public Health, University of Washington, Seattle, Washington
| | - Jinghui Luo
- 2University of Michigan School of Medicine, Ann Arbor, Michigan
| | - Chris Park
- 9Schools of Medicine and Public Health, University of Washington, Seattle, Washington
| | - Habib Hamidi
- 2University of Michigan School of Medicine, Ann Arbor, Michigan
| | - Jian Zhou
- 12Princeton University, Princeton, New Jersey,14Columbia University School of Medicine, New York City, New York
| | - Paul Hoover
- 10Harvard University School of Medicine, Boston Massachusetts
| | - Andrew Schroeder
- 4University of California San Francisco School of Medicine, San Francisco, California
| | - Marianinha Joanes
- 4University of California San Francisco School of Medicine, San Francisco, California
| | | | - Rachel Sealfon
- 12Princeton University, Princeton, New Jersey,14Columbia University School of Medicine, New York City, New York
| | - Seth Winfree
- 1Indiana University School of Medicine, Indianapolis, Indiana
| | - Becky Steck
- 2University of Michigan School of Medicine, Ann Arbor, Michigan
| | - Yongqun He
- 2University of Michigan School of Medicine, Ann Arbor, Michigan
| | - Vivette D’Agati
- 14Columbia University School of Medicine, New York City, New York
| | - Ravi Iyengar
- 11Icahn School of Medicine at Mount Sinai, New York City, New York
| | - Olga G. Troyanskaya
- 12Princeton University, Princeton, New Jersey,14Columbia University School of Medicine, New York City, New York
| | - Laura Barisoni
- 15Duke University School of Medicine, Durham, North Carolina
| | - Joseph Gaut
- 16Washington University in Saint Louis School of Medicine, St. Louis, Missouri
| | - Kun Zhang
- 3Jacobs School of Engineering, University of California, San Diego, California
| | - Zoltan Laszik
- 4University of California San Francisco School of Medicine, San Francisco, California
| | - Brad H. Rovin
- 6Ohio State University College of Medicine, Columbus, Ohio
| | | | - Kumar Sharma
- 7UT-Health San Antonio School of Medicine, San Antonio, Texas
| | - Minnie M. Sarwal
- 4University of California San Francisco School of Medicine, San Francisco, California
| | | | - Charles E. Alpers
- 9Schools of Medicine and Public Health, University of Washington, Seattle, Washington
| | | | - Sanjay Jain
- 16Washington University in Saint Louis School of Medicine, St. Louis, Missouri
| |
Collapse
|
25
|
He Y, Yu H, Ong E, Wang Y, Liu Y, Huffman A, Huang HH, Beverley J, Hur J, Yang X, Chen L, Omenn GS, Athey B, Smith B. CIDO, a community-based ontology for coronavirus disease knowledge and data integration, sharing, and analysis. Sci Data 2020; 7:181. [PMID: 32533075 PMCID: PMC7293349 DOI: 10.1038/s41597-020-0523-6] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 05/19/2020] [Indexed: 11/15/2022] Open
Abstract
The Coronavirus Infectious Disease Ontology (CIDO) is a community-based ontology that supports coronavirus disease knowledge and data standardization, integration, sharing, and analysis.
Collapse
Affiliation(s)
- Yongqun He
- University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
| | - Hong Yu
- People's Hospital of Guizhou Province, Guiyang, Guizhou, 550002, China
- Guizhou University Medical College, Guiyang, Guizhou, 550025, China
| | - Edison Ong
- University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Yang Wang
- University of Michigan Medical School, Ann Arbor, MI, 48109, USA
- People's Hospital of Guizhou Province, Guiyang, Guizhou, 550002, China
- Guizhou University Medical College, Guiyang, Guizhou, 550025, China
| | - Yingtong Liu
- University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Anthony Huffman
- University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Hsin-Hui Huang
- University of Michigan Medical School, Ann Arbor, MI, 48109, USA
- National Yang-Ming University, Taipei, 112-21, Taiwan
| | | | - Junguk Hur
- University of North Dakota School of Medicine and Health Sciences, Grand Forks, ND, 58203, USA
| | - Xiaolin Yang
- Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences (CAMS) & School of Basic Medicine, Peking Union Medical College (PUMC), Beijing, China
| | - Luonan Chen
- Key Laboratory of Systems Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, 200031, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China
| | - Gilbert S Omenn
- University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Brian Athey
- University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Barry Smith
- University at Buffalo, Buffalo, NY, 14260, USA
| |
Collapse
|
26
|
Norris E, Marques MM, Finnerty AN, Wright AJ, West R, Hastings J, Williams P, Carey RN, Kelly MP, Johnston M, Michie S. Development of an Intervention Setting Ontology for behaviour change: Specifying where interventions take place. Wellcome Open Res 2020; 5:124. [PMID: 32964137 PMCID: PMC7489274 DOI: 10.12688/wellcomeopenres.15904.1] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/20/2020] [Indexed: 12/16/2022] Open
Abstract
Background: Contextual factors such as an intervention's setting are key to understanding how interventions to change behaviour have their effects and patterns of generalisation across contexts. The intervention's setting is not consistently reported in published reports of evaluations. Using ontologies to specify and classify intervention setting characteristics enables clear and reproducible reporting, thus aiding replication, implementation and evidence synthesis. This paper reports the development of a Setting Ontology for behaviour change interventions as part of a Behaviour Change Intervention Ontology, currently being developed in the Wellcome Trust funded Human Behaviour-Change Project. Methods: The Intervention Setting Ontology was developed following methods for ontology development used in the Human Behaviour-Change Project: 1) Defining the ontology's scope, 2) Identifying key entities by reviewing existing classification systems (top-down) and 100 published behaviour change intervention reports (bottom-up), 3) Refining the preliminary ontology by literature annotation of 100 reports, 4) Stakeholder reviewing by 23 behavioural science and public health experts to refine the ontology, 5) Assessing inter-rater reliability of using the ontology by two annotators familiar with the ontology and two annotators unfamiliar with it, 6) Specifying ontological relationships between setting entities and 7) Making the Intervention Setting Ontology machine-readable using Web Ontology Language (OWL) and publishing online. Re sults: The Intervention Setting Ontology consists of 72 entities structured hierarchically with two upper-level classes: Physical setting including Geographic location, Attribute of location (including Area social and economic condition, Population and resource density sub-levels) and Intervention site (including Facility, Transportation and Outdoor environment sub-levels), as well as Social setting. Inter-rater reliability was found to be 0.73 (good) for those familiar with the ontology and 0.61 (acceptable) for those unfamiliar with it. Conclusion: The Intervention Setting Ontology can be used to code information from diverse sources, annotate the setting characteristics of existing intervention evaluation reports and guide future reporting.
Collapse
Affiliation(s)
- Emma Norris
- Centre for Behaviour Change, University College London, London, UK
- Department of Clinical Sciences, Brunel University, Uxbridge, UK
| | - Marta M. Marques
- Centre for Behaviour Change, University College London, London, UK
- ADAPT SFI Research Centre, Trinity College Dublin, Dublin, Ireland
| | | | - Alison J. Wright
- Centre for Behaviour Change, University College London, London, UK
| | - Robert West
- Research Department of Epidemiology & Public Health, University College London, London, UK
| | - Janna Hastings
- Centre for Behaviour Change, University College London, London, UK
| | - Poppy Williams
- Centre for Behaviour Change, University College London, London, UK
| | - Rachel N. Carey
- Centre for Behaviour Change, University College London, London, UK
| | - Michael P. Kelly
- Primary Care Unit, Institute of Public Health, University of Cambridge, Cambridge, UK
| | - Marie Johnston
- Aberdeen Health Psychology Group, University of Aberdeen, Aberdeen, UK
| | - Susan Michie
- Centre for Behaviour Change, University College London, London, UK
| |
Collapse
|
27
|
He Y, Wang H, Zheng J, Beiting DP, Masci AM, Yu H, Liu K, Wu J, Curtis JL, Smith B, Alekseyenko AV, Obeid JS. OHMI: the ontology of host-microbiome interactions. J Biomed Semantics 2019; 10:25. [PMID: 31888755 PMCID: PMC6937947 DOI: 10.1186/s13326-019-0217-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2019] [Accepted: 12/04/2019] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Host-microbiome interactions (HMIs) are critical for the modulation of biological processes and are associated with several diseases. Extensive HMI studies have generated large amounts of data. We propose that the logical representation of the knowledge derived from these data and the standardized representation of experimental variables and processes can foster integration of data and reproducibility of experiments and thereby further HMI knowledge discovery. METHODS Through a multi-institutional collaboration, a community-based Ontology of Host-Microbiome Interactions (OHMI) was developed following the Open Biological/Biomedical Ontologies (OBO) Foundry principles. As an OBO library ontology, OHMI leverages established ontologies to create logically structured representations of (1) microbiomes, microbial taxonomy, host species, host anatomical entities, and HMIs under different conditions and (2) associated study protocols and types of data analysis and experimental results. RESULTS Aligned with the Basic Formal Ontology, OHMI comprises over 1000 terms, including terms imported from more than 10 existing ontologies together with some 500 OHMI-specific terms. A specific OHMI design pattern was generated to represent typical host-microbiome interaction studies. As one major OHMI use case, drawing on data from over 50 peer-reviewed publications, we identified over 100 bacteria and fungi from the gut, oral cavity, skin, and airway that are associated with six rheumatic diseases including rheumatoid arthritis. Our ontological study identified new high-level microbiota taxonomical structures. Two microbiome-related competency questions were also designed and addressed. We were also able to use OHMI to represent statistically significant results identified from a large existing microbiome database data analysis. CONCLUSION OHMI represents entities and relations in the domain of HMIs. It supports shared knowledge representation, data and metadata standardization and integration, and can be used in formulation of advanced queries for purposes of data analysis.
Collapse
Affiliation(s)
- Yongqun He
- University of Michigan Medical School, Ann Arbor, MI 48109 USA
| | - Haihe Wang
- University of Michigan Medical School, Ann Arbor, MI 48109 USA
- Daqing Branch of Harbin Medical University, Daqing, 163319 Heilongjiang China
| | - Jie Zheng
- University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104 USA
| | - Daniel P. Beiting
- University of Pennsylvania School of Veterinary Medicine, Philadelphia, PA 19104 USA
| | - Anna Maria Masci
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC 27710 USA
| | - Hong Yu
- University of Michigan Medical School, Ann Arbor, MI 48109 USA
- People’s Hospital of Guizhou Province, Guiyang, 550025 Guizhou China
| | - Kaiyong Liu
- School of Public Health, Anhui Medical University, No 81 Meishan Road, Hefei, 230032 Anhui China
| | - Jianmin Wu
- Center for Cancer Bioinformatics, Peking University Cancer Hospital & Institute, Beijing, 100142 China
| | - Jeffrey L. Curtis
- University of Michigan Medical School, Ann Arbor, MI 48109 USA
- Pulmonary & Critical Care Medicine Section, Medical Service, VA Ann Arbor Healthcare System, Ann Arbor, MI 48105 USA
| | - Barry Smith
- University at Buffalo, Buffalo, NY 14260 USA
| | - Alexander V. Alekseyenko
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC 29425 USA
| | - Jihad S. Obeid
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC 29425 USA
| |
Collapse
|
28
|
Ong E, Sun P, Berke K, Zheng J, Wu G, He Y. VIO: ontology classification and study of vaccine responses given various experimental and analytical conditions. BMC Bioinformatics 2019; 20:704. [PMID: 31865910 PMCID: PMC6927110 DOI: 10.1186/s12859-019-3194-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Background Different human responses to the same vaccine were frequently observed. For example, independent studies identified overlapping but different transcriptomic gene expression profiles in Yellow Fever vaccine 17D (YF-17D) immunized human subjects. Different experimental and analysis conditions were likely contributed to the observed differences. To investigate this issue, we developed a Vaccine Investigation Ontology (VIO), and applied VIO to classify the different variables and relations among these variables systematically. We then evaluated whether the ontological VIO modeling and VIO-based statistical analysis would contribute to the enhanced vaccine investigation studies and a better understanding of vaccine response mechanisms. Results Our VIO modeling identified many variables related to data processing and analysis such as normalization method, cut-off criteria, software settings including software version. The datasets from two previous studies on human responses to YF-17D vaccine, reported by Gaucher et al. (2008) and Querec et al. (2009), were re-analyzed. We first applied the same LIMMA statistical method to re-analyze the Gaucher data set and identified a big difference in terms of significantly differentiated gene lists compared to the original study. The different results were likely due to the LIMMA version and software package differences. Our second study re-analyzed both Gaucher and Querec data sets but with the same data processing and analysis pipeline. Significant differences in differential gene lists were also identified. In both studies, we found that Gene Ontology (GO) enrichment results had more overlapping than the gene lists and enriched pathway lists. The visualization of the identified GO hierarchical structures among the enriched GO terms and their associated ancestor terms using GOfox allowed us to find more associations among enriched but often different GO terms, demonstrating the usage of GO hierarchical relations enhance data analysis. Conclusions The ontology-based analysis framework supports standardized representation, integration, and analysis of heterogeneous data of host responses to vaccines. Our study also showed that differences in specific variables might explain different results drawn from similar studies.
Collapse
Affiliation(s)
- Edison Ong
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Peter Sun
- College of Literature, Science, and the Arts, University of Michigan, Ann Arbor, MI, USA
| | - Kimberly Berke
- College of Literature, Science, and the Arts, University of Michigan, Ann Arbor, MI, USA.,Central Michigan University College of Medicine, Mount Pleasant, MI, USA
| | - Jie Zheng
- University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Guanming Wu
- Oregon Health & Science University, Portland, OR, USA
| | - Yongqun He
- Unit of Laboratory Animal Medicine, University of Michigan, Ann Arbor, MI, USA. .,Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA. .,Center for Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
29
|
Abstract
This Editorial first introduces the background of the vaccine and drug relations and how biomedical terminologies and ontologies have been used to support their studies. The history of the seven workshops, initially named VDOSME, and then named VDOS, is also summarized and introduced. Then the 7th International Workshop on Vaccine and Drug Ontology Studies (VDOS 2018), held on August 10th, 2018, Corvallis, Oregon, USA, is introduced in detail. These VDOS workshops have greatly supported the development, applications, and discussion of vaccine- and drug-related terminology and drug studies.
Collapse
Affiliation(s)
- Junguk Hur
- Department of Biomedical Sciences, University of North Dakota School of Medicine and Health Sciences, Grand Forks, ND USA
| | - Cui Tao
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
| | - Yongqun He
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI USA
| |
Collapse
|
30
|
Semantic Data Management for a Virtual Factory Collaborative Environment. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9224936] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Recent developments in the area of cyber-physical systems (CPSs) and Internet of Things (IoT) are among the drivers for the emergence of the Industry 4.0 concept, setting new requirements for the architecture, technology, and design approaches of modern industrial systems. Industry 4.0 assumes a higher level of intelligence, and thus autonomy of the systems and subsystems, and a larger focus on the analysis of gathered data for further utilization. The Virtual Factory Open Operating System (vf-OS) project is intended to respond to some of these key challenges, in particular for the smart factory application domain. Complementarily, data and knowledge storage and processing are also in the scope of vf-OS. This article introduces the semantic management component of vf-OS, which aims to analyze the interrelations among stored entities, as well as to define the closeness among them to generate meaningful suggestions, which can be later used by other subsystems or operators in a user-friendly way. The semantic managing system makes use of non relational approaches, namely a graph database, which enables data to be represented as graphs for further semantic querying. The developed prototype and an illustrative application case are also presented.
Collapse
|
31
|
Bousquet C, Souvignet J, Sadou É, Jaulent MC, Declerck G. Ontological and Non-Ontological Resources for Associating Medical Dictionary for Regulatory Activities Terms to SNOMED Clinical Terms With Semantic Properties. Front Pharmacol 2019; 10:975. [PMID: 31551780 PMCID: PMC6747929 DOI: 10.3389/fphar.2019.00975] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2018] [Accepted: 07/31/2019] [Indexed: 11/20/2022] Open
Abstract
Background: Formal definitions allow selecting terms (e.g., identifying all terms related to “Infectious disease” using the query “has causative agent organism”) and terminological reasoning (e.g., “hepatitis B” is a “hepatitis” and is an “infectious disease”). However, the standard international terminology Medical Dictionary for Regulatory Activities (MedDRA) used for coding adverse drug reactions in pharmacovigilance databases does not beneficiate from such formal definitions. Our objective was to evaluate the potential of reuse of ontological and non-ontological resources for generating such definitions for MedDRA. Methods: We developed several methods that collectively allow a semiautomatic semantic enrichment of MedDRA: 1) using MedDRA-to-SNOMED Clinical Terms (SNOMED CT) mappings (available in the Unified Medical Language System metathesaurus or other mapping resources, e.g., the MedDRA preferred term “hepatitis B” is associated to the SNOMED CT concept “type B viral hepatitis”) to extract term definitions (e.g., “hepatitis B” is associated with the following properties: has finding site liver structure, has associated morphology inflammation morphology, and has causative agent hepatitis B virus); 2) using MedDRA labels and lexical/syntactic methods for automatic decomposition of complex MedDRA terms (e.g., the MedDRA systems organ class “blood and lymphatic system disorders” is decomposed in blood system disorders and lymphatic system disorders) or automatic suggestions of properties (e.g., the string “cyclic” in preferred term “cyclic neutropenia” leads to the property has clinical course cyclic). Results: The Unified Medical Language System metathesaurus was the main ontological resource reusable for generating formal definitions for MedDRA terms. The non-ontological resources (another mapping resource provided by Nadkarni and Darer in 2010 and MedDRA labels) allowed defining few additional preferred terms. While the Ci4SeR tool helped the curator to define 1,935 terms by suggesting potential supplemental relations based on the parents’ and siblings’ semantic definition, defining manually all MedDRA terms remains expensive in time. Discussion: Several ontological and non-ontological resources are available for associating MedDRA terms to SNOMED CT concepts with semantic properties, but providing manual definitions is still necessary. The ontology of adverse events is a possible alternative but does not cover all MedDRA terms either. Perspectives are to implement more efficient techniques to find more logical relations between SNOMED CT and MedDRA in an automated way.
Collapse
Affiliation(s)
- Cédric Bousquet
- Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances en e-Santé, LIMICS, Sorbonne Université, Inserm, Université Paris 13, Paris, France.,Unit of Public Health and Medical Informatics, University of Saint Etienne, Saint Etienne, France
| | - Julien Souvignet
- Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances en e-Santé, LIMICS, Sorbonne Université, Inserm, Université Paris 13, Paris, France.,Unit of Public Health and Medical Informatics, University of Saint Etienne, Saint Etienne, France
| | - Éric Sadou
- Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances en e-Santé, LIMICS, Sorbonne Université, Inserm, Université Paris 13, Paris, France
| | - Marie-Christine Jaulent
- Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances en e-Santé, LIMICS, Sorbonne Université, Inserm, Université Paris 13, Paris, France
| | - Gunnar Declerck
- EA 2223 Costech (Connaissance, Organisation et Systèmes Techniques), Centre de Recherche, Sorbonne Universités, Université de technologie de Compiègne, Compiègne, France
| |
Collapse
|
32
|
Mechanistic integration of exposure and effects: advances to apply systems toxicology in support of regulatory decision-making. CURRENT OPINION IN TOXICOLOGY 2019. [DOI: 10.1016/j.cotox.2019.09.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
33
|
Jackson RC, Balhoff JP, Douglass E, Harris NL, Mungall CJ, Overton JA. ROBOT: A Tool for Automating Ontology Workflows. BMC Bioinformatics 2019; 20:407. [PMID: 31357927 PMCID: PMC6664714 DOI: 10.1186/s12859-019-3002-3] [Citation(s) in RCA: 76] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Accepted: 07/19/2019] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND Ontologies are invaluable in the life sciences, but building and maintaining ontologies often requires a challenging number of distinct tasks such as running automated reasoners and quality control checks, extracting dependencies and application-specific subsets, generating standard reports, and generating release files in multiple formats. Similar to more general software development, automation is the key to executing and managing these tasks effectively and to releasing more robust products in standard forms. For ontologies using the Web Ontology Language (OWL), the OWL API Java library is the foundation for a range of software tools, including the Protégé ontology editor. In the Open Biological and Biomedical Ontologies (OBO) community, we recognized the need to package a wide range of low-level OWL API functionality into a library of common higher-level operations and to make those operations available as a command-line tool. RESULTS ROBOT (a recursive acronym for "ROBOT is an OBO Tool") is an open source library and command-line tool for automating ontology development tasks. The library can be called from any programming language that runs on the Java Virtual Machine (JVM). Most usage is through the command-line tool, which runs on macOS, Linux, and Windows. ROBOT provides ontology processing commands for a variety of tasks, including commands for converting formats, running a reasoner, creating import modules, running reports, and various other tasks. These commands can be combined into larger workflows using a separate task execution system such as GNU Make, and workflows can be automatically executed within continuous integration systems. CONCLUSIONS ROBOT supports automation of a wide range of ontology development tasks, focusing on OBO conventions. It packages common high-level ontology development functionality into a convenient library, and makes it easy to configure, combine, and execute individual tasks in comprehensive, automated workflows. This helps ontology developers to efficiently create, maintain, and release high-quality ontologies, so that they can spend more time focusing on development tasks. It also helps guarantee that released ontologies are free of certain types of logical errors and conform to standard quality control checks, increasing the overall robustness and efficiency of the ontology development lifecycle.
Collapse
Affiliation(s)
| | - James P Balhoff
- Renaissance Computing Institute, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Eric Douglass
- Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Nomi L Harris
- Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | | | | |
Collapse
|
34
|
Natsiavas P, Malousi A, Bousquet C, Jaulent MC, Koutkias V. Computational Advances in Drug Safety: Systematic and Mapping Review of Knowledge Engineering Based Approaches. Front Pharmacol 2019; 10:415. [PMID: 31156424 PMCID: PMC6533857 DOI: 10.3389/fphar.2019.00415] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Accepted: 04/02/2019] [Indexed: 12/12/2022] Open
Abstract
Drug Safety (DS) is a domain with significant public health and social impact. Knowledge Engineering (KE) is the Computer Science discipline elaborating on methods and tools for developing “knowledge-intensive” systems, depending on a conceptual “knowledge” schema and some kind of “reasoning” process. The present systematic and mapping review aims to investigate KE-based approaches employed for DS and highlight the introduced added value as well as trends and possible gaps in the domain. Journal articles published between 2006 and 2017 were retrieved from PubMed/MEDLINE and Web of Science® (873 in total) and filtered based on a comprehensive set of inclusion/exclusion criteria. The 80 finally selected articles were reviewed on full-text, while the mapping process relied on a set of concrete criteria (concerning specific KE and DS core activities, special DS topics, employed data sources, reference ontologies/terminologies, and computational methods, etc.). The analysis results are publicly available as online interactive analytics graphs. The review clearly depicted increased use of KE approaches for DS. The collected data illustrate the use of KE for various DS aspects, such as Adverse Drug Event (ADE) information collection, detection, and assessment. Moreover, the quantified analysis of using KE for the respective DS core activities highlighted room for intensifying research on KE for ADE monitoring, prevention and reporting. Finally, the assessed use of the various data sources for DS special topics demonstrated extensive use of dominant data sources for DS surveillance, i.e., Spontaneous Reporting Systems, but also increasing interest in the use of emerging data sources, e.g., observational healthcare databases, biochemical/genetic databases, and social media. Various exemplar applications were identified with promising results, e.g., improvement in Adverse Drug Reaction (ADR) prediction, detection of drug interactions, and novel ADE profiles related with specific mechanisms of action, etc. Nevertheless, since the reviewed studies mostly concerned proof-of-concept implementations, more intense research is required to increase the maturity level that is necessary for KE approaches to reach routine DS practice. In conclusion, we argue that efficiently addressing DS data analytics and management challenges requires the introduction of high-throughput KE-based methods for effective knowledge discovery and management, resulting ultimately, in the establishment of a continuous learning DS system.
Collapse
Affiliation(s)
- Pantelis Natsiavas
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece.,Sorbonne Université, INSERM, Univ Paris 13, Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances pour la e-Santé, LIMICS, Paris, France
| | - Andigoni Malousi
- Laboratory of Biological Chemistry, Department of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Cédric Bousquet
- Sorbonne Université, INSERM, Univ Paris 13, Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances pour la e-Santé, LIMICS, Paris, France.,Public Health and Medical Information Unit, University Hospital of Saint-Etienne, Saint-Étienne, France
| | - Marie-Christine Jaulent
- Sorbonne Université, INSERM, Univ Paris 13, Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances pour la e-Santé, LIMICS, Paris, France
| | - Vassilis Koutkias
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece
| |
Collapse
|
35
|
Yu H, Nysak S, Garg N, Ong E, Ye X, Zhang X, He Y. ODAE: Ontology-based systematic representation and analysis of drug adverse events and its usage in study of adverse events given different patient age and disease conditions. BMC Bioinformatics 2019; 20:199. [PMID: 31074377 PMCID: PMC6509876 DOI: 10.1186/s12859-019-2729-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Background Drug adverse events (AEs), or called adverse drug events (ADEs), are ranked one of the leading causes of mortality. The Ontology of Adverse Events (OAE) has been widely used for adverse event AE representation, standardization, and analysis. OAE-based ADE-specific ontologies, including ODNAE for drug-associated neuropathy-inducing AEs and OCVDAE for cardiovascular drug AEs, have also been developed and used. However, these ADE-specific ontologies do not consider the effects of other factors (e.g., age and drug-treated disease) on the outcomes of ADEs. With more ontological studies of ADEs, it is also critical to develop a general purpose ontology for representing ADEs for various types of drugs. Results Our survey of FDA drug package insert documents and other resources for 224 neuropathy-inducing drugs discovered that many drugs (e.g., sirolimus and linezolid) cause different AEs given patients’ age or the diseases treated by the drugs. To logically represent the complex relations among drug, drug ingredient and mechanism of action, AE, age, disease, and other related factors, an ontology design pattern was developed and applied to generate a community-driven open-source Ontology of Drug Adverse Events (ODAE). The ODAE development follows the OBO Foundry ontology development principles (e.g., openness and collaboration). Built on a generalizable ODAE design pattern and extending the OAE and NDF-RT ontology, ODAE has represented various AEs associated with the over 200 neuropathy-inducing drugs given different age and disease conditions. ODAE is now deposited in the Ontobee for browsing and queries. As a demonstration of usage, a SPARQL query of the ODAE knowledge base was developed to identify all the drugs having the mechanisms of ion channel interactions, the diseases treated with the drugs, and AEs after the treatment in adult patients. AE-specific drug class effects were also explored using ODAE and SPARQL. Conclusion ODAE provides a general representation of ADEs given different conditions and can be used for querying scientific questions. ODAE is also a robust knowledge base and platform for semantic and logic representation and study of ADEs of more drugs in the future. Electronic supplementary material The online version of this article (10.1186/s12859-019-2729-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Hong Yu
- Department of Pulmonary and Critical Care Medicine, Guizhou Provincial People's Hospital, Guiyang, 550002, Guizhou, China. .,Guizhou University Medical College, Guiyang, 550025, Guizhou, China.
| | - Solomiya Nysak
- College of Literature, Science, and the Arts, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Noemi Garg
- College of Pharmacy, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Edison Ong
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Xianwei Ye
- Department of Pulmonary and Critical Care Medicine, Guizhou Provincial People's Hospital, Guiyang, 550002, Guizhou, China.,Guizhou University Medical College, Guiyang, 550025, Guizhou, China
| | - Xiangyan Zhang
- Department of Pulmonary and Critical Care Medicine, Guizhou Provincial People's Hospital, Guiyang, 550002, Guizhou, China.,Guizhou University Medical College, Guiyang, 550025, Guizhou, China
| | - Yongqun He
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, Center for Computational Medicine and Bioinformatics, and Comprehensive Cancer Center, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
| |
Collapse
|
36
|
Sayers S, Li L, Ong E, Deng S, Fu G, Lin Y, Yang B, Zhang S, Fa Z, Zhao B, Xiang Z, Li Y, Zhao XM, Olszewski MA, Chen L, He Y. Victors: a web-based knowledge base of virulence factors in human and animal pathogens. Nucleic Acids Res 2019; 47:D693-D700. [PMID: 30365026 PMCID: PMC6324020 DOI: 10.1093/nar/gky999] [Citation(s) in RCA: 108] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 10/07/2018] [Accepted: 10/09/2018] [Indexed: 12/21/2022] Open
Abstract
Virulence factors (VFs) are molecules that allow microbial pathogens to overcome host defense mechanisms and cause disease in a host. It is critical to study VFs for better understanding microbial pathogenesis and host defense mechanisms. Victors (http://www.phidias.us/victors) is a novel, manually curated, web-based integrative knowledge base and analysis resource for VFs of pathogens that cause infectious diseases in human and animals. Currently, Victors contains 5296 VFs obtained via manual annotation from peer-reviewed publications, with 4648, 179, 105 and 364 VFs originating from 51 bacterial, 54 viral, 13 parasitic and 8 fungal species, respectively. Our data analysis identified many VF-specific patterns. Within the global VF pool, cytoplasmic proteins were more common, while adhesins were less common compared to findings on protective vaccine antigens. Many VFs showed homology with host proteins and the human proteins interacting with VFs represented the hubs of human-pathogen interactions. All Victors data are queriable with a user-friendly web interface. The VFs can also be searched by a customized BLAST sequence similarity searching program. These VFs and their interactions with the host are represented in a machine-readable Ontology of Host-Pathogen Interactions. Victors supports the 'One Health' research as a vital source of VFs in human and animal pathogens.
Collapse
Affiliation(s)
- Samantha Sayers
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Li Li
- Key Laboratory of Systems Biology, CAS Center for Excellence in Molecular Cell Science, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Edison Ong
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Shunzhou Deng
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
- Department of Veterinary Medicine, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China
| | - Guanghua Fu
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
- Institute of Animal Husbandry and Veterinary Medicine, Fujian Academy of Agricultural Sciences, Fuzhou, Fujian 350013, China
| | - Yu Lin
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Brian Yang
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Shelley Zhang
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Zhenzong Fa
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Health System and Research Service, VA Ann Arbor Health Systems, Ann Arbor 48109, MI, USA
| | - Bin Zhao
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Zuoshuang Xiang
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Yongqing Li
- Institute of Animal Husbandry and Veterinary Medicine, Beijing Municipal Academy of Agriculture and Forestry Sciences, Beijing 100097, China
| | - Xing-Ming Zhao
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
| | - Michal A Olszewski
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Health System and Research Service, VA Ann Arbor Health Systems, Ann Arbor 48109, MI, USA
| | - Luonan Chen
- Key Laboratory of Systems Biology, CAS Center for Excellence in Molecular Cell Science, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
- School of Life Science and Technology, Shanghai Tech University, Shanghai 201210, China
| | - Yongqun He
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, and Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| |
Collapse
|
37
|
Jorquera R, González C, Clausen P, Petersen B, Holmes DS. Improved ontology for eukaryotic single-exon coding sequences in biological databases. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:1-6. [PMID: 30239665 PMCID: PMC6146118 DOI: 10.1093/database/bay089] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Accepted: 07/30/2018] [Indexed: 12/21/2022]
Abstract
Efficient extraction of knowledge from biological data requires the development of structured vocabularies to unambiguously define biological terms. This paper proposes descriptions and definitions to disambiguate the term 'single-exon gene'. Eukaryotic Single-Exon Genes (SEGs) have been defined as genes that do not have introns in their protein coding sequences. They have been studied not only to determine their origin and evolution but also because their expression has been linked to several types of human cancer and neurological/developmental disorders and many exhibit tissue-specific transcription. Unfortunately, the term 'SEGs' is rife with ambiguity, leading to biological misinterpretations. In the classic definition, no distinction is made between SEGs that harbor introns in their untranslated regions (UTRs) versus those without. This distinction is important to make because the presence of introns in UTRs affects transcriptional regulation and post-transcriptional processing of the mRNA. In addition, recent whole-transcriptome shotgun sequencing has led to the discovery of many examples of single-exon mRNAs that arise from alternative splicing of multi-exon genes, these single-exon isoforms are being confused with SEGs despite their clearly different origin. The increasing expansion of RNA-seq datasets makes it imperative to distinguish the different SEG types before annotation errors become indelibly propagated in biological databases. This paper develops a structured vocabulary for their disambiguation, allowing a major reassessment of their evolutionary trajectories, regulation, RNA processing and transport, and provides the opportunity to improve the detection of gene associations with disorders including cancers, neurological and developmental diseases.
Collapse
Affiliation(s)
- Roddy Jorquera
- Center for Bioinformatics and Genome Biology, Fundacion Ciencia & Vida, Avenida Zañartu 1482, Ñuñoa, Santiago, Chile.,Facultad de Ciencias Biologicas, Universidad Andres Bello, Santiago, Chile
| | - Carolina González
- Center for Bioinformatics and Genome Biology, Fundacion Ciencia & Vida, Avenida Zañartu 1482, Ñuñoa, Santiago, Chile
| | - Philip Clausen
- Department of Bio and Health Informatics, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Bent Petersen
- Department of Bio and Health Informatics, Technical University of Denmark, Kgs. Lyngby, Denmark.,Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Kedah, Malaysia
| | - David S Holmes
- Center for Bioinformatics and Genome Biology, Fundacion Ciencia & Vida, Avenida Zañartu 1482, Ñuñoa, Santiago, Chile.,Centro de Genómica y Bioinformática Facultad de Ciencias, Universidad Mayor, Santiago, Chile
| |
Collapse
|