Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dumontier M, Baker CJ, Baran J, Callahan A, Chepelev L, Cruz-Toledo J, Del Rio NR, Duck G, Furlong LI, Keath N, Klassen D, McCusker JP, Queralt-Rosinach N, Samwald M, Villanueva-Rosales N, Wilkinson MD, Hoehndorf R. The Semanticscience Integrated Ontology (SIO) for biomedical research and knowledge discovery. J Biomed Semantics 2014;5:14. [PMID: 24602174 PMCID: PMC4015691 DOI: 10.1186/2041-1480-5-14] [Citation(s) in RCA: 77] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2013] [Accepted: 02/02/2014] [Indexed: 11/10/2022] Open

For:	Dumontier M, Baker CJ, Baran J, Callahan A, Chepelev L, Cruz-Toledo J, Del Rio NR, Duck G, Furlong LI, Keath N, Klassen D, McCusker JP, Queralt-Rosinach N, Samwald M, Villanueva-Rosales N, Wilkinson MD, Hoehndorf R. The Semanticscience Integrated Ontology (SIO) for biomedical research and knowledge discovery. J Biomed Semantics 2014;5:14. [PMID: 24602174 PMCID: PMC4015691 DOI: 10.1186/2041-1480-5-14] [Citation(s) in RCA: 77] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2013] [Accepted: 02/02/2014] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Mulero-Hernández J, Mironov V, Miñarro-Giménez JA, Kuiper M, Fernández-Breis J. Integration of chromosome locations and functional aspects of enhancers and topologically associating domains in knowledge graphs enables versatile queries about gene regulation. Nucleic Acids Res 2024;52:e69. [PMID: 38967009 PMCID: PMC11347148 DOI: 10.1093/nar/gkae566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 06/12/2024] [Accepted: 06/19/2024] [Indexed: 07/06/2024] Open

Wenk EH, Sauquet H, Gallagher RV, Brownlee R, Boettiger C, Coleman D, Yang S, Auld T, Barrett R, Brodribb T, Choat B, Dun L, Ellsworth D, Gosper C, Guja L, Jordan GJ, Le Breton T, Leigh A, Lu-Irving P, Medlyn B, Nolan R, Ooi M, Sommerville KD, Vesk P, White M, Wright IJ, Falster DS. The AusTraits plant dictionary. Sci Data 2024;11:537. [PMID: 38796535 PMCID: PMC11127939 DOI: 10.1038/s41597-024-03368-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 05/10/2024] [Indexed: 05/28/2024] Open

Affiliation(s)

Elizabeth H Wenk Evolution & Ecology Research Centre, University of New South Wales, Sydney, Australia.
Hervé Sauquet Evolution & Ecology Research Centre, University of New South Wales, Sydney, Australia National Herbarium of NSW, Botanic Gardens of Sydney, Mount Annan, NSW, Australia
Rachael V Gallagher Hawkesbury Institute for the Environment, Western Sydney University, Sydney, Australia
Rowan Brownlee Australian Research Data Commons, Caulfield East, Australia
Carl Boettiger Department of Environmental Science, Policy, & Management, University of California, Berkeley, USA
David Coleman Evolution & Ecology Research Centre, University of New South Wales, Sydney, Australia School of Natural Sciences, Macquarie University, Macquarie Park, Australia
Sophie Yang Evolution & Ecology Research Centre, University of New South Wales, Sydney, Australia
Tony Auld NSW Department of Planning and Environment, Parramatta, Australia University of Wollongong, Wollongong, Australia Centre for Ecosystem Science, University of New South Wales, Syndey, Australia
Russell Barrett Evolution & Ecology Research Centre, University of New South Wales, Sydney, Australia National Herbarium of NSW, Botanic Gardens of Sydney, Mount Annan, NSW, Australia
Timothy Brodribb School of Biological Sciences, University of Tasmania, Hobart, Australia
Brendan Choat Hawkesbury Institute for the Environment, Western Sydney University, Sydney, Australia
Lily Dun Evolution & Ecology Research Centre, University of New South Wales, Sydney, Australia National Herbarium of NSW, Botanic Gardens of Sydney, Mount Annan, NSW, Australia Hawkesbury Institute for the Environment, Western Sydney University, Sydney, Australia
David Ellsworth Hawkesbury Institute for the Environment, Western Sydney University, Sydney, Australia
Carl Gosper Biodiversity and Conservation Science, Department of Biodiversity, Conservation and Attractions, Kensington, WA, Australia
Lydia Guja Centre for Australian National Biodiversity Research, Canberra, Australia National Seed Bank, Australian National Botanic Gardens, Department of Climate Change, Energy, the Environment and Water, Canberra, Australia
Gregory J Jordan School of Biological Sciences, University of Tasmania, Hobart, Australia
Tom Le Breton Centre for Ecosystem Science, University of New South Wales, Syndey, Australia
Andrea Leigh School of Life Sciences, University of Technology Sydney, Broadway, Australia
Patricia Lu-Irving National Herbarium of NSW, Botanic Gardens of Sydney, Mount Annan, NSW, Australia
Belinda Medlyn Hawkesbury Institute for the Environment, Western Sydney University, Sydney, Australia
Rachael Nolan Hawkesbury Institute for the Environment, Western Sydney University, Sydney, Australia
Mark Ooi Centre for Ecosystem Science, University of New South Wales, Syndey, Australia
Karen D Sommerville Australian PlantBank, Botanic Gardens of Sydney, Mount Annan, Australia
Peter Vesk School of Agriculture, Food and Ecosystem Sciences, University of Melbourne, Parkville, Australia
Matthew White Arthur Rylah Institute for Environmental Research, Victorian Department of Energy, Environment and Climate Action, East Melbourne, Australia
Ian J Wright Hawkesbury Institute for the Environment, Western Sydney University, Sydney, Australia School of Natural Sciences, Macquarie University, Macquarie Park, Australia
Daniel S Falster Evolution & Ecology Research Centre, University of New South Wales, Sydney, Australia

Collapse

Alper P, Dĕd V, Herzinger S, Grouès V, Peter S, Lebioda J, Ebermann L, Popleteeva M, Barry ND, Welter D, Ghosh S, Becker R, Schneider R, Gu W, Trefois C, Satagopam V. DS-PACK: Tool assembly for the end-to-end support of controlled access human data sharing. Sci Data 2024;11:501. [PMID: 38750048 PMCID: PMC11096168 DOI: 10.1038/s41597-024-03326-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 04/29/2024] [Indexed: 05/18/2024] Open

Affiliation(s)

Pinar Alper Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg. ELIXIR Luxembourg, Belvaux, Luxembourg.
Vilém Dĕd ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Sascha Herzinger ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Valentin Grouès ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Sarah Peter ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Jacek Lebioda Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Linda Ebermann Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Marina Popleteeva ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Nene Djenaba Barry Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Danielle Welter Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Soumyabrata Ghosh ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Regina Becker Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Reinhard Schneider ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Wei Gu Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Christophe Trefois Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Venkata Satagopam ELIXIR Luxembourg, Belvaux, Luxembourg. Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg.

Collapse

van Rijn JPM, Martens M, Ammar A, Cimpan MR, Fessard V, Hoet P, Jeliazkova N, Murugadoss S, Vinković Vrček I, Willighagen EL. From papers to RDF-based integration of physicochemical data and adverse outcome pathways for nanomaterials. J Cheminform 2024;16:49. [PMID: 38693555 PMCID: PMC11064368 DOI: 10.1186/s13321-024-00833-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 03/23/2024] [Indexed: 05/03/2024] Open

Galgonek J, Vondrášek J. The IDSM mass spectrometry extension: searching mass spectra using SPARQL. Bioinformatics 2024;40:btae174. [PMID: 38561173 PMCID: PMC11034985 DOI: 10.1093/bioinformatics/btae174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 02/24/2024] [Accepted: 03/28/2024] [Indexed: 04/04/2024] Open

Niehues A, de Visser C, Hagenbeek FA, Kulkarni P, Pool R, Karu N, Kindt ASD, Singh G, Vermeiren RRJM, Boomsma DI, van Dongen J, ’t Hoen PAC, van Gool AJ. A multi-omics data analysis workflow packaged as a FAIR Digital Object. Gigascience 2024;13:giad115. [PMID: 38217405 PMCID: PMC10787363 DOI: 10.1093/gigascience/giad115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 11/14/2023] [Accepted: 12/10/2023] [Indexed: 01/15/2024] Open

Affiliation(s)

Anna Niehues Department of Medical BioSciences, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud University Medical Center, 6525 GA Nijmegen, the Netherlands
Casper de Visser Department of Medical BioSciences, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands
Fiona A Hagenbeek Department of Biological Psychology, Vrije Universiteit Amsterdam, 1081 BT Amsterdam, The Netherlands Amsterdam Public Health Research Institute, 1081 BT Amsterdam, The Netherlands
Purva Kulkarni Department of Medical BioSciences, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud University Medical Center, 6525 GA Nijmegen, the Netherlands Department of Human Genetics, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands
René Pool Department of Biological Psychology, Vrije Universiteit Amsterdam, 1081 BT Amsterdam, The Netherlands Amsterdam Public Health Research Institute, 1081 BT Amsterdam, The Netherlands
Naama Karu Metabolomics and Analytics Centre, Leiden Academic Centre for Drug Research, Leiden University, 2333 AL Leiden, The Netherlands
Alida S D Kindt Metabolomics and Analytics Centre, Leiden Academic Centre for Drug Research, Leiden University, 2333 AL Leiden, The Netherlands
Gurnoor Singh Department of Medical BioSciences, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands
Robert R J M Vermeiren Department of Child and Adolescent Psychiatry, LUMC-Curium, Leiden University Medical Center, 2342 AK Oegstgeest, The Netherlands
Dorret I Boomsma Department of Biological Psychology, Vrije Universiteit Amsterdam, 1081 BT Amsterdam, The Netherlands Amsterdam Public Health Research Institute, 1081 BT Amsterdam, The Netherlands Amsterdam Reproduction & Development (AR&D) Research Institute, 1081 BT Amsterdam, The Netherlands
Jenny van Dongen Department of Biological Psychology, Vrije Universiteit Amsterdam, 1081 BT Amsterdam, The Netherlands Amsterdam Public Health Research Institute, 1081 BT Amsterdam, The Netherlands Amsterdam Reproduction & Development (AR&D) Research Institute, 1081 BT Amsterdam, The Netherlands
Peter A C ’t Hoen Department of Medical BioSciences, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands
Alain J van Gool Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud University Medical Center, 6525 GA Nijmegen, the Netherlands

Collapse

Abad-Navarro F, Martínez-Costa C. A knowledge graph-based data harmonization framework for secondary data reuse. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024;243:107918. [PMID: 37981455 DOI: 10.1016/j.cmpb.2023.107918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 10/02/2023] [Accepted: 11/05/2023] [Indexed: 11/21/2023]

Bernabé CH, Queralt-Rosinach N, Silva Souza VE, Bonino da Silva Santos LO, Mons B, Jacobsen A, Roos M. The use of foundational ontologies in biomedical research. J Biomed Semantics 2023;14:21. [PMID: 38082345 PMCID: PMC10712036 DOI: 10.1186/s13326-023-00300-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 11/29/2023] [Indexed: 12/18/2023] Open

Abstract

BACKGROUND

The FAIR principles recommend the use of controlled vocabularies, such as ontologies, to define data and metadata concepts. Ontologies are currently modelled following different approaches, sometimes describing conflicting definitions of the same concepts, which can affect interoperability. To cope with that, prior literature suggests organising ontologies in levels, where domain specific (low-level) ontologies are grounded in domain independent high-level ontologies (i.e., foundational ontologies). In this level-based organisation, foundational ontologies work as translators of intended meaning, thus improving interoperability. Despite their considerable acceptance in biomedical research, there are very few studies testing foundational ontologies. This paper describes a systematic literature mapping that was conducted to understand how foundational ontologies are used in biomedical research and to find empirical evidence supporting their claimed (dis)advantages.

RESULTS

From a set of 79 selected papers, we identified that foundational ontologies are used for several purposes: ontology construction, repair, mapping, and ontology-based data analysis. Foundational ontologies are claimed to improve interoperability, enhance reasoning, speed up ontology development and facilitate maintainability. The complexity of using foundational ontologies is the most commonly cited downside. Despite being used for several purposes, there were hardly any experiments (1 paper) testing the claims for or against the use of foundational ontologies. In the subset of 49 papers that describe the development of an ontology, it was observed a low adherence to ontology construction (16 papers) and ontology evaluation formal methods (4 papers).

CONCLUSION

Our findings have two main implications. First, the lack of empirical evidence about the use of foundational ontologies indicates a need for evaluating the use of such artefacts in biomedical research. Second, the low adherence to formal methods illustrates how the field could benefit from a more systematic approach when dealing with the development and evaluation of ontologies. The understanding of how foundational ontologies are used in the biomedical field can drive future research towards the improvement of ontologies and, consequently, data FAIRness. The adoption of formal methods can impact the quality and sustainability of ontologies, and reusing these methods from other fields is encouraged.

Collapse

Seneviratne O, Das AK, Chari S, Agu NN, Rashid SM, McCusker J, Franklin JS, Qi M, Bennett KP, Chen CH, Hendler JA, McGuinness DL. Semantically enabling clinical decision support recommendations. J Biomed Semantics 2023;14:8. [PMID: 37464259 DOI: 10.1186/s13326-023-00285-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 03/28/2023] [Indexed: 07/20/2023] Open

Abstract

BACKGROUND

Clinical decision support systems have been widely deployed to guide healthcare decisions on patient diagnosis, treatment choices, and patient management through evidence-based recommendations. These recommendations are typically derived from clinical practice guidelines created by clinical specialties or healthcare organizations. Although there have been many different technical approaches to encoding guideline recommendations into decision support systems, much of the previous work has not focused on enabling system generated recommendations through the formalization of changes in a guideline, the provenance of a recommendation, and applicability of the evidence. Prior work indicates that healthcare providers may not find that guideline-derived recommendations always meet their needs for reasons such as lack of relevance, transparency, time pressure, and applicability to their clinical practice.

RESULTS

We introduce several semantic techniques that model diseases based on clinical practice guidelines, provenance of the guidelines, and the study cohorts they are based on to enhance the capabilities of clinical decision support systems. We have explored ways to enable clinical decision support systems with semantic technologies that can represent and link to details in related items from the scientific literature and quickly adapt to changing information from the guidelines, identifying gaps, and supporting personalized explanations. Previous semantics-driven clinical decision systems have limited support in all these aspects, and we present the ontologies and semantic web based software tools in three distinct areas that are unified using a standard set of ontologies and a custom-built knowledge graph framework: (i) guideline modeling to characterize diseases, (ii) guideline provenance to attach evidence to treatment decisions from authoritative sources, and (iii) study cohort modeling to identify relevant research publications for complicated patients.

CONCLUSIONS

We have enhanced existing, evidence-based knowledge by developing ontologies and software that enables clinicians to conveniently access updates to and provenance of guidelines, as well as gather additional information from research studies applicable to their patients' unique circumstances. Our software solutions leverage many well-used existing biomedical ontologies and build upon decades of knowledge representation and reasoning work, leading to explainable results.

Collapse

Queder N, Tien VB, Abraham SA, Urchs SGW, Helmer KG, Chaplin D, van Erp TGM, Kennedy DN, Poline JB, Grethe JS, Ghosh SS, Keator DB. NIDM-Terms: community-based terminology management for improved neuroimaging dataset descriptions and query. Front Neuroinform 2023;17:1174156. [PMID: 37533796 PMCID: PMC10392125 DOI: 10.3389/fninf.2023.1174156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Accepted: 06/27/2023] [Indexed: 08/04/2023] Open

Affiliation(s)

Nazek Queder Department of Psychiatry and Human Behavior, School of Medicine, University of California, Irvine, Irvine, CA, United States Department of Neurobiology and Behavior and Center for the Neurobiology of Learning and Memory, University of California, Irvine, Irvine, CA, United States
Vivian B. Tien Fairmont Preparatory Academy, Anaheim, CA, United States
Sanu Ann Abraham McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, United States
Sebastian Georg Wenzel Urchs NeuroDataScience–ORIGAMI Laboratory, McConnell Brain Imaging Centre, The Neuro (Montreal Neurological Institute-Hospital), Faculty of Medicine, McGill University, Montreal, QC, Canada
Karl G. Helmer Massachusetts General Hospital, Boston, MA, United States Harvard Medical School, Boston, MA, United States
Derek Chaplin Massachusetts General Hospital, Boston, MA, United States
Theo G. M. van Erp Clinical Translational Neuroscience Laboratory, Department of Psychiatry and Human Behavior, School of Medicine, University of California, Irvine, Irvine, CA, United States Center for the Neurobiology of Learning and Memory, University of California, Irvine, Irvine, CA, United States
David N. Kennedy Departments of Psychiatry and Radiology, University of Massachusetts Chan Medical School, Worcester, MA, United States
Jean-Baptiste Poline NeuroDataScience–ORIGAMI Laboratory, McConnell Brain Imaging Centre, The Neuro (Montreal Neurological Institute-Hospital), Faculty of Medicine, McGill University, Montreal, QC, Canada
Jeffrey S. Grethe Department of Neurosciences, School of Medicine, University of California, San Diego, San Diego, CA, United States
Satrajit S. Ghosh McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, United States
David B. Keator Department of Psychiatry and Human Behavior, School of Medicine, University of California, Irvine, Irvine, CA, United States

Collapse

Zhang S, Benis N, Cornet R. Automated approach for quality assessment of RDF resources. BMC Med Inform Decis Mak 2023;23:90. [PMID: 37165363 PMCID: PMC10170671 DOI: 10.1186/s12911-023-02182-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 04/20/2023] [Indexed: 05/12/2023] Open

Abstract

INTRODUCTION

The Semantic Web community provides a common Resource Description Framework (RDF) that allows representation of resources such that they can be linked. To maximize the potential of linked data - machine-actionable interlinked resources on the Web - a certain level of quality of RDF resources should be established, particularly in the biomedical domain in which concepts are complex and high-quality biomedical ontologies are in high demand. However, it is unclear which quality metrics for RDF resources exist that can be automated, which is required given the multitude of RDF resources. Therefore, we aim to determine these metrics and demonstrate an automated approach to assess such metrics of RDF resources.

METHODS

An initial set of metrics are identified through literature, standards, and existing tooling. Of these, metrics are selected that fulfil these criteria: (1) objective; (2) automatable; and (3) foundational. Selected metrics are represented in RDF and semantically aligned to existing standards. These metrics are then implemented in an open-source tool. To demonstrate the tool, eight commonly used RDF resources were assessed, including data models in the healthcare domain (HL7 RIM, HL7 FHIR, CDISC CDASH), ontologies (DCT, SIO, FOAF, ORDO), and a metadata profile (GRDDL).

RESULTS

Six objective metrics are identified in 3 categories: Resolvability (1), Parsability (1), and Consistency (4), and represented in RDF. The tool demonstrates that these metrics can be automated, and application in the healthcare domain shows non-resolvable URIs (ranging from 0.3% to 97%) among all eight resources and undefined URIs in HL7 RIM, and FHIR. In the tested resources no errors were found for parsability and the other three consistency metrics for correct usage of classes and properties.

CONCLUSION

We extracted six objective and automatable metrics from literature, as the foundational quality requirements of RDF resources to maximize the potential of linked data. Automated tooling to assess resources has shown to be effective to identify quality issues that must be avoided. This approach can be expanded to incorporate more automatable metrics so as to reflect additional quality dimensions with the assessment tool implementing more metrics.

Collapse

Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, Li Q, Shoemaker BA, Thiessen PA, Yu B, Zaslavsky L, Zhang J, Bolton EE. PubChem 2023 update. Nucleic Acids Res 2022;51:D1373-D1380. [PMID: 36305812 PMCID: PMC9825602 DOI: 10.1093/nar/gkac956] [Citation(s) in RCA: 655] [Impact Index Per Article: 327.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 10/06/2022] [Accepted: 10/13/2022] [Indexed: 01/30/2023] Open

Affiliation(s)

Sunghwan Kim National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Jie Chen National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Tiejun Cheng National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Asta Gindulyte National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Jia He National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Siqian He National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Qingliang Li National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Benjamin A Shoemaker National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Paul A Thiessen National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Bo Yu National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Leonid Zaslavsky National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Jian Zhang National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, 20894, USA
Evan E Bolton To whom correspondence should be addressed. Tel: +1 301 451 1811; Fax: +1 301 480 4559;

Collapse

Wood EC, Glen AK, Kvarfordt LG, Womack F, Acevedo L, Yoon TS, Ma C, Flores V, Sinha M, Chodpathumwan Y, Termehchy A, Roach JC, Mendoza L, Hoffman AS, Deutsch EW, Koslicki D, Ramsey SA. RTX-KG2: a system for building a semantically standardized knowledge graph for translational biomedicine. BMC Bioinformatics 2022;23:400. [PMID: 36175836 PMCID: PMC9520835 DOI: 10.1186/s12859-022-04932-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 09/14/2022] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Biomedical translational science is increasingly using computational reasoning on repositories of structured knowledge (such as UMLS, SemMedDB, ChEMBL, Reactome, DrugBank, and SMPDB in order to facilitate discovery of new therapeutic targets and modalities. The NCATS Biomedical Data Translator project is working to federate autonomous reasoning agents and knowledge providers within a distributed system for answering translational questions. Within that project and the broader field, there is a need for a framework that can efficiently and reproducibly build an integrated, standards-compliant, and comprehensive biomedical knowledge graph that can be downloaded in standard serialized form or queried via a public application programming interface (API).

RESULTS

To create a knowledge provider system within the Translator project, we have developed RTX-KG2, an open-source software system for building-and hosting a web API for querying-a biomedical knowledge graph that uses an Extract-Transform-Load approach to integrate 70 knowledge sources (including the aforementioned core six sources) into a knowledge graph with provenance information including (where available) citations. The semantic layer and schema for RTX-KG2 follow the standard Biolink model to maximize interoperability. RTX-KG2 is currently being used by multiple Translator reasoning agents, both in its downloadable form and via its SmartAPI-registered interface. Serializations of RTX-KG2 are available for download in both the pre-canonicalized form and in canonicalized form (in which synonyms are merged). The current canonicalized version (KG2.7.3) of RTX-KG2 contains 6.4M nodes and 39.3M edges with a hierarchy of 77 relationship types from Biolink.

CONCLUSION

RTX-KG2 is the first knowledge graph that integrates UMLS, SemMedDB, ChEMBL, DrugBank, Reactome, SMPDB, and 64 additional knowledge sources within a knowledge graph that conforms to the Biolink standard for its semantic layer and schema. RTX-KG2 is publicly available for querying via its API at arax.rtx.ai/api/rtxkg2/v1.2/openapi.json . The code to build RTX-KG2 is publicly available at github:RTXteam/RTX-KG2 .

Collapse

Affiliation(s)

E C Wood School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Amy K Glen School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA.
Lindsey G Kvarfordt School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Finn Womack Computer Science and Engineering, Penn State University, State College, PA, USA
Liliana Acevedo School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Timothy S Yoon School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Chunyu Ma Huck Institutes of the Life Sciences, Penn State University, State College, PA, USA
Veronica Flores School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Meghamala Sinha School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Yodsawalai Chodpathumwan King Mongkut's University of Technology North Bangkok, Bangkok, Thailand
Arash Termehchy School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Jared C Roach Institute for Systems Biology, Seattle, WA, USA
Luis Mendoza Institute for Systems Biology, Seattle, WA, USA
Andrew S Hoffman Interdisciplinary Hub for Digitalization and Society, Radboud University, Nijmegen, The Netherlands
Eric W Deutsch Institute for Systems Biology, Seattle, WA, USA
David Koslicki Computer Science and Engineering, Penn State University, State College, PA, USA Huck Institutes of the Life Sciences, Penn State University, State College, PA, USA Department of Biology, Penn State University, State College, PA, USA
Stephen A Ramsey School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA Department of Biomedical Sciences, Oregon State University, Corvallis, OR, USA

Collapse

The Representation of Causality and Causation with Ontologies: A Systematic Literature Review. Online J Public Health Inform 2022;14:e4. [PMID: 36120162 PMCID: PMC9473331 DOI: 10.5210/ojphi.v14i1.12577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open

Unni DR, Moxon SAT, Bada M, Brush M, Bruskiewich R, Caufield JH, Clemons PA, Dancik V, Dumontier M, Fecho K, Glusman G, Hadlock JJ, Harris NL, Joshi A, Putman T, Qin G, Ramsey SA, Shefchek KA, Solbrig H, Soman K, Thessen AE, Haendel MA, Bizon C, Mungall CJ. Biolink Model: A universal schema for knowledge graphs in clinical, biomedical, and translational science. Clin Transl Sci 2022;15:1848-1855. [PMID: 36125173 PMCID: PMC9372416 DOI: 10.1111/cts.13302] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 04/27/2022] [Accepted: 05/02/2022] [Indexed: 12/12/2022] Open

Abstract

Within clinical, biomedical, and translational science, an increasing number of projects are adopting graphs for knowledge representation. Graph-based data models elucidate the interconnectedness among core biomedical concepts, enable data structures to be easily updated, and support intuitive queries, visualizations, and inference algorithms. However, knowledge discovery across these "knowledge graphs" (KGs) has remained difficult. Data set heterogeneity and complexity; the proliferation of ad hoc data formats; poor compliance with guidelines on findability, accessibility, interoperability, and reusability; and, in particular, the lack of a universally accepted, open-access model for standardization across biomedical KGs has left the task of reconciling data sources to downstream consumers. Biolink Model is an open-source data model that can be used to formalize the relationships between data structures in translational science. It incorporates object-oriented classification and graph-oriented features. The core of the model is a set of hierarchical, interconnected classes (or categories) and relationships between them (or predicates) representing biomedical entities such as gene, disease, chemical, anatomic structure, and phenotype. The model provides class and edge attributes and associations that guide how entities should relate to one another. Here, we highlight the need for a standardized data model for KGs, describe Biolink Model, and compare it with other models. We demonstrate the utility of Biolink Model in various initiatives, including the Biomedical Data Translator Consortium and the Monarch Initiative, and show how it has supported easier integration and interoperability of biomedical KGs, bringing together knowledge from multiple sources and helping to realize the goals of translational science.

Collapse

Affiliation(s)

Deepak R. Unni Genome Biology Unit, European Molecular Biology LaboratoryHeidelbergGermany Division of Environmental Genomics and Systems BiologyLawrence Berkeley National LaboratoryBerkeleyCaliforniaUSA
Sierra A. T. Moxon Division of Environmental Genomics and Systems BiologyLawrence Berkeley National LaboratoryBerkeleyCaliforniaUSA
Michael Bada Center for Health AIUniversity of Colorado Anschutz Medical CampusAuroraColoradoUSA
Matthew Brush Center for Health AIUniversity of Colorado Anschutz Medical CampusAuroraColoradoUSA
Richard Bruskiewich Star InformaticsSookeBritish ColumbiaCanada
J. Harry Caufield Division of Environmental Genomics and Systems BiologyLawrence Berkeley National LaboratoryBerkeleyCaliforniaUSA
Paul A. Clemons Chemical Biology and Therapeutics Science ProgramBroad InstituteCambridgeMassachusettsUSA
Vlado Dancik Chemical Biology and Therapeutics Science ProgramBroad InstituteCambridgeMassachusettsUSA
Michel Dumontier Institute of Data ScienceMaastricht UniversityMaastrichtThe Netherlands
Karamarie Fecho Renaissance Computing InstituteUniversity of North Carolina at Chapel HillChapel HillNorth CarolinaUSA
Gustavo Glusman Institute for Systems BiologySeattleWashingtonUSA
Jennifer J. Hadlock Institute for Systems BiologySeattleWashingtonUSA
Nomi L. Harris Division of Environmental Genomics and Systems BiologyLawrence Berkeley National LaboratoryBerkeleyCaliforniaUSA
Arpita Joshi Institute for Systems BiologySeattleWashingtonUSA
Tim Putman Center for Health AIUniversity of Colorado Anschutz Medical CampusAuroraColoradoUSA
Guangrong Qin Institute for Systems BiologySeattleWashingtonUSA
Stephen A. Ramsey Department of Biomedical SciencesOregon State UniversityCorvallisOregonUSA
Kent A. Shefchek Center for Health AIUniversity of Colorado Anschutz Medical CampusAuroraColoradoUSA
Harold Solbrig Johns Hopkins UniversityBaltimoreMarylandUSA
Karthik Soman Department of NeurologyUniversity of California San FranciscoSan FranciscoCaliforniaUSA
Anne E. Thessen Center for Health AIUniversity of Colorado Anschutz Medical CampusAuroraColoradoUSA
Melissa A. Haendel Center for Health AIUniversity of Colorado Anschutz Medical CampusAuroraColoradoUSA
Chris Bizon Renaissance Computing InstituteUniversity of North Carolina at Chapel HillChapel HillNorth CarolinaUSA
Christopher J. Mungall Division of Environmental Genomics and Systems BiologyLawrence Berkeley National LaboratoryBerkeleyCaliforniaUSA
The Biomedical Data Translator Consortium

Collapse

Blagec K, Barbosa-Silva A, Ott S, Samwald M. A curated, ontology-based, large-scale knowledge graph of artificial intelligence tasks and benchmarks. Sci Data 2022;9:322. [PMID: 35715466 PMCID: PMC9205953 DOI: 10.1038/s41597-022-01435-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 05/30/2022] [Indexed: 11/22/2022] Open

Deagen ME, McCusker JP, Fateye T, Stouffer S, Brinson LC, McGuinness DL, Schadler LS. FAIR and Interactive Data Graphics from a Scientific Knowledge Graph. Sci Data 2022;9:239. [PMID: 35624233 PMCID: PMC9142568 DOI: 10.1038/s41597-022-01352-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 04/26/2022] [Indexed: 11/16/2022] Open

Umberfield EE, Stansbury C, Ford K, Jiang Y, Kardia SLR, Thomer AK, Harris MR. Evaluating and Extending the Informed Consent Ontology for Representing Permissions from the Clinical Domain. APPLIED ONTOLOGY 2022;17:321-336. [PMID: 36312514 PMCID: PMC9616177 DOI: 10.3233/ao-210260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

van der Velde KJ, Singh G, Kaliyaperumal R, Liao X, de Ridder S, Rebers S, Kerstens HHD, de Andrade F, van Reeuwijk J, De Gruyter FE, Hiltemann S, Ligtvoet M, Weiss MM, van Deutekom HWM, Jansen AML, Stubbs AP, Vissers LELM, Laros JFJ, van Enckevort E, Stemkens D, 't Hoen PAC, Beliën JAM, van Gijn ME, Swertz MA. FAIR Genomes metadata schema promoting Next Generation Sequencing data reuse in Dutch healthcare and research. Sci Data 2022;9:169. [PMID: 35418585 PMCID: PMC9008059 DOI: 10.1038/s41597-022-01265-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Accepted: 03/25/2022] [Indexed: 11/08/2022] Open

Affiliation(s)

K Joeri van der Velde University of Groningen and University Medical Center Groningen, Genomics Coordination Center, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands University of Groningen and University Medical Center Groningen, Department of Genetics, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
Gurnoor Singh Radboud University Medical Center, Radboud Institute for Molecular Life Sciences, Center for Molecular and Biomolecular Informatics, Geert Grooteplein 28, 6525 GA, Nijmegen, The Netherlands
Rajaram Kaliyaperumal Leiden University Medical Center, Department of Human Genetics, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
XiaoFeng Liao Radboud University Medical Center, Radboud Institute for Molecular Life Sciences, Center for Molecular and Biomolecular Informatics, Geert Grooteplein 28, 6525 GA, Nijmegen, The Netherlands
Sander de Ridder Amsterdam University Medical Center, University of Amsterdam, Department of Pathology, Meibergdreef 9, 1105 AZ, Amsterdam, The Netherlands
Susanne Rebers The Netherlands Cancer Institute, Division of Molecular Pathology, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
Hindrik H D Kerstens Prinses Máxima Center for Pediatric Oncology, Kemmeren group, Heidelberglaan 25, 3584 CS, Utrecht, The Netherlands
Fernanda de Andrade University of Groningen and University Medical Center Groningen, Genomics Coordination Center, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
Jeroen van Reeuwijk Radboud University Medical Center, Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Geert Grooteplein 10, 6525 GA, Nijmegen, The Netherlands
Fini E De Gruyter University Medical Center Utrecht, Department of Genetics, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
Saskia Hiltemann Erasmus Medical Center, Department of Pathology, Doctor Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
Maarten Ligtvoet Nictiz - Dutch competence centre for electronic exchange of health and care information, Oude Middenweg 55, 2491 AC, The Hague, The Netherlands
Marjan M Weiss Radboud University Medical Center, Department of Human Genetics, Geert Grooteplein 10, 6525 GA, Nijmegen, The Netherlands
Hanneke W M van Deutekom University Medical Center Utrecht, Department of Genetics, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
Anne M L Jansen University Medical Center Utrecht, Department of Pathology, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
Andrew P Stubbs Erasmus Medical Center, Department of Pathology, Doctor Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
Lisenka E L M Vissers Radboud University Medical Center, Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Geert Grooteplein 10, 6525 GA, Nijmegen, The Netherlands
Jeroen F J Laros Leiden University Medical Center, Department of Human Genetics, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands Leiden University Medical Center, Department of Clinical Genetics, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands Rijksinstituut voor Volksgezondheid en Milieu, Antonie van Leeuwenhoeklaan 9, 3721 MA, Bilthoven, The Netherlands
Esther van Enckevort University of Groningen and University Medical Center Groningen, Genomics Coordination Center, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
Daphne Stemkens VSOP - Patient Alliance for Rare and Genetic Diseases The Netherlands, Koninginnelaan 23, 3762 DA, Soest, The Netherlands
Peter A C 't Hoen Radboud University Medical Center, Radboud Institute for Molecular Life Sciences, Center for Molecular and Biomolecular Informatics, Geert Grooteplein 28, 6525 GA, Nijmegen, The Netherlands
Jeroen A M Beliën Amsterdam University Medical Center, Vrije Universiteit Amsterdam, Department of Pathology, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
Mariëlle E van Gijn University of Groningen and University Medical Center Groningen, Department of Genetics, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
Morris A Swertz University of Groningen and University Medical Center Groningen, Genomics Coordination Center, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands. University of Groningen and University Medical Center Groningen, Department of Genetics, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands.

Collapse

Marchesin S, Silvello G. TBGA: a large-scale Gene-Disease Association dataset for Biomedical Relation Extraction. BMC Bioinformatics 2022;23:111. [PMID: 35361129 PMCID: PMC8973894 DOI: 10.1186/s12859-022-04646-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 03/22/2022] [Indexed: 01/12/2023] Open

Strömert P, Hunold J, Castro A, Neumann S, Koepler O. Ontologies4Chem: the landscape of ontologies in chemistry. PURE APPL CHEM 2022. [DOI: 10.1515/pac-2021-2007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Kaliyaperumal R, Wilkinson MD, Moreno PA, Benis N, Cornet R, Dos Santos Vieira B, Dumontier M, Bernabé CH, Jacobsen A, Le Cornec CMA, Godoy MP, Queralt-Rosinach N, Schultze Kool LJ, Swertz MA, van Damme P, van der Velde KJ, Lalout N, Zhang S, Roos M. Semantic modelling of common data elements for rare disease registries, and a prototype workflow for their deployment over registry data. J Biomed Semantics 2022;13:9. [PMID: 35292119 PMCID: PMC8922780 DOI: 10.1186/s13326-022-00264-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Accepted: 02/23/2022] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The European Platform on Rare Disease Registration (EU RD Platform) aims to address the fragmentation of European rare disease (RD) patient data, scattered among hundreds of independent and non-coordinating registries, by establishing standards for integration and interoperability. The first practical output of this effort was a set of 16 Common Data Elements (CDEs) that should be implemented by all RD registries. Interoperability, however, requires decisions beyond data elements - including data models, formats, and semantics. Within the European Joint Programme on Rare Diseases (EJP RD), we aim to further the goals of the EU RD Platform by generating reusable RD semantic model templates that follow the FAIR Data Principles.

RESULTS

Through a team-based iterative approach, we created semantically grounded models to represent each of the CDEs, using the SemanticScience Integrated Ontology as the core framework for representing the entities and their relationships. Within that framework, we mapped the concepts represented in the CDEs, and their possible values, into domain ontologies such as the Orphanet Rare Disease Ontology, Human Phenotype Ontology and National Cancer Institute Thesaurus. Finally, we created an exemplar, reusable ETL pipeline that we will be deploying over these non-coordinating data repositories to assist them in creating model-compliant FAIR data without requiring site-specific coding nor expertise in Linked Data or FAIR.

CONCLUSIONS

Within the EJP RD project, we determined that creating reusable, expert-designed templates reduced or eliminated the requirement for our participating biomedical domain experts and rare disease data hosts to understand OWL semantics. This enabled them to publish highly expressive FAIR data using tools and approaches that were already familiar to them.

Collapse

Affiliation(s)

Rajaram Kaliyaperumal Leiden University Medical Center, Leiden, The Netherlands
Mark D Wilkinson Departamento de Biotecnología-Biología Vegetal, Escuela Técnica Superior de Ingeniería Agronómica, Alimentaria y de Biosistemas, Centro de Biotecnología y Genómica de Plantas (CBGP), Universidad Politécnica de Madrid (UPM), Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Pozuelo de Alarcón, Madrid, ES, Spain.
Pablo Alarcón Moreno Departamento de Biotecnología-Biología Vegetal, Escuela Técnica Superior de Ingeniería Agronómica, Alimentaria y de Biosistemas, Centro de Biotecnología y Genómica de Plantas (CBGP), Universidad Politécnica de Madrid (UPM), Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Pozuelo de Alarcón, Madrid, ES, Spain
Nirupama Benis Department of Medical Informatics, Amsterdam Public Health Research Institute, Amsterdam UMC, University of Amsterdam, Meibergdreef 9, Amsterdam, The Netherlands
Ronald Cornet Department of Medical Informatics, Amsterdam Public Health Research Institute, Amsterdam UMC, University of Amsterdam, Meibergdreef 9, Amsterdam, The Netherlands
Bruna Dos Santos Vieira Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands.,Centre for Molecular and Biomolecular Informatics, Radboud University Medical Center, Nijmegen, The Netherlands
Michel Dumontier Institute of Data Science, Paul-Henri Spaaklaan 1, Maastricht University, 6229EN, Maastricht, The Netherlands
César Henrique Bernabé Leiden University Medical Center, Leiden, The Netherlands
Annika Jacobsen Leiden University Medical Center, Leiden, The Netherlands
Clémence M A Le Cornec Division of Paediatric Nephrology, Centre for Paediatrics and Adolescent Medicine, University of Heidelberg, Heidelberg, Germany
Mario Prieto Godoy Departamento de Biotecnología-Biología Vegetal, Escuela Técnica Superior de Ingeniería Agronómica, Alimentaria y de Biosistemas, Centro de Biotecnología y Genómica de Plantas (CBGP), Universidad Politécnica de Madrid (UPM), Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Pozuelo de Alarcón, Madrid, ES, Spain
Núria Queralt-Rosinach Leiden University Medical Center, Leiden, The Netherlands
Leo J Schultze Kool Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands
Morris A Swertz University of Groningen and University Medical Center Groningen, Genomics Coordination Center and Department of Genetics, Antonius Deusinglaan 1, 9713, AV, Groningen, The Netherlands
Philip van Damme Department of Medical Informatics, Amsterdam Public Health Research Institute, Amsterdam UMC, University of Amsterdam, Meibergdreef 9, Amsterdam, The Netherlands
K Joeri van der Velde University of Groningen and University Medical Center Groningen, Genomics Coordination Center and Department of Genetics, Antonius Deusinglaan 1, 9713, AV, Groningen, The Netherlands
Nawel Lalout Centre for Molecular and Biomolecular Informatics, Radboud University Medical Center, Nijmegen, The Netherlands.,Duchenne Parent Project, Veenendaal, The Netherlands
Shuxin Zhang Department of Medical Informatics, Amsterdam Public Health Research Institute, Amsterdam UMC, University of Amsterdam, Meibergdreef 9, Amsterdam, The Netherlands
Marco Roos Leiden University Medical Center, Leiden, The Netherlands

Collapse

Mortensen HM, Martens M, Senn J, Levey T, Evelo CT, Willighagen EL, Exner T. The AOP-DB RDF: Applying FAIR Principles to the Semantic Integration of AOP Data Using the Research Description Framework. FRONTIERS IN TOXICOLOGY 2022;4:803983. [PMID: 35295213 PMCID: PMC8915825 DOI: 10.3389/ftox.2022.803983] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 01/13/2022] [Indexed: 01/12/2023] Open

Kim S, Cheng T, He S, Thiessen PA, Li Q, Gindulyte A, Bolton EE. PubChem Protein, Gene, Pathway, and Taxonomy Data Collections: Bridging Biology and Chemistry through Target-Centric Views of PubChem Data. J Mol Biol 2022;434:167514. [DOI: 10.1016/j.jmb.2022.167514] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 02/17/2022] [Accepted: 02/22/2022] [Indexed: 12/21/2022]

Ławrynowicz A, Wróblewska A, Adrian WT, Kulczyński B, Gramza-Michałowska A. Food Recipe Ingredient Substitution Ontology Design Pattern. SENSORS 2022;22:s22031095. [PMID: 35161841 PMCID: PMC8837940 DOI: 10.3390/s22031095] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 01/27/2022] [Accepted: 01/28/2022] [Indexed: 11/29/2022]

Dealing with the Ambiguity of Glycan Substructure Search. MOLECULES (BASEL, SWITZERLAND) 2021;27:molecules27010065. [PMID: 35011294 PMCID: PMC8746581 DOI: 10.3390/molecules27010065] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 12/17/2021] [Accepted: 12/17/2021] [Indexed: 01/15/2023]

Wan L, Song J, He V, Roman J, Whah G, Peng S, Zhang L, He Y. Development of the International Classification of Diseases Ontology (ICDO) and its application for COVID-19 diagnostic data analysis. BMC Bioinformatics 2021;22:508. [PMID: 34663204 PMCID: PMC8522253 DOI: 10.1186/s12859-021-04402-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Accepted: 09/24/2021] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The 10th and 9th revisions of the International Statistical Classification of Diseases and Related Health Problems (ICD10 and ICD9) have been adopted worldwide as a well-recognized norm to share codes for diseases, signs and symptoms, abnormal findings, etc. The international Consortium for Clinical Characterization of COVID-19 by EHR (4CE) website stores diagnosis COVID-19 disease data using ICD10 and ICD9 codes. However, the ICD systems are difficult to decode due to their many shortcomings, which can be addressed using ontology.

METHODS

An ICD ontology (ICDO) was developed to logically and scientifically represent ICD terms and their relations among different ICD terms. ICDO is also aligned with the Basic Formal Ontology (BFO) and reuses terms from existing ontologies. As a use case, the ICD10 and ICD9 diagnosis data from the 4CE website were extracted, mapped to ICDO, and analyzed using ICDO.

RESULTS

We have developed the ICDO to ontologize the ICD terms and relations. Different from existing disease ontologies, all ICD diseases in ICDO are defined as disease processes to describe their occurrence with other properties. The ICDO decomposes each disease term into different components, including anatomic entities, process profiles, etiological causes, output phenotype, etc. Over 900 ICD terms have been represented in ICDO. Many ICDO terms are presented in both English and Chinese. The ICD10/ICD9-based diagnosis data of over 27,000 COVID-19 patients from 5 countries were extracted from the 4CE. A total of 917 COVID-19-related disease codes, each of which were associated with 1 or more cases in the 4CE dataset, were mapped to ICDO and further analyzed using the ICDO logical annotations. Our study showed that COVID-19 targeted multiple systems and organs such as the lung, heart, and kidney. Different acute and chronic kidney phenotypes were identified. Some kidney diseases appeared to result from other diseases, such as diabetes. Some of the findings could only be easily found using ICDO instead of ICD9/10.

CONCLUSIONS

ICDO was developed to ontologize ICD10/10 codes and applied to study COVID-19 patient diagnosis data. Our findings showed that ICDO provides a semantic platform for more accurate detection of disease profiles.

Collapse

Delmas M, Filangi O, Paulhe N, Vinson F, Duperier C, Garrier W, Saunier PE, Pitarch Y, Jourdan F, Giacomoni F, Frainay C. FORUM: Building a Knowledge Graph from public databases and scientific literature to extract associations between chemicals and diseases. Bioinformatics 2021;37:3896-3904. [PMID: 34478489 PMCID: PMC8570811 DOI: 10.1093/bioinformatics/btab627] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 08/16/2021] [Accepted: 09/01/2021] [Indexed: 11/22/2022] Open

Galgonek J, Vondrášek J. IDSM ChemWebRDF: SPARQLing small-molecule datasets. J Cheminform 2021;13:38. [PMID: 33980298 PMCID: PMC8117646 DOI: 10.1186/s13321-021-00515-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Accepted: 04/23/2021] [Indexed: 11/12/2022] Open

Abstract

The Resource Description Framework (RDF), together with well-defined ontologies, significantly increases data interoperability and usability. The SPARQL query language was introduced to retrieve requested RDF data and to explore links between them. Among other useful features, SPARQL supports federated queries that combine multiple independent data source endpoints. This allows users to obtain insights that are not possible using only a single data source. Owing to all of these useful features, many biological and chemical databases present their data in RDF, and support SPARQL querying. In our project, we primary focused on PubChem, ChEMBL and ChEBI small-molecule datasets. These datasets are already being exported to RDF by their creators. However, none of them has an official and currently supported SPARQL endpoint. This omission makes it difficult to construct complex or federated queries that could access all of the datasets, thus underutilising the main advantage of the availability of RDF data. Our goal is to address this gap by integrating the datasets into one database called the Integrated Database of Small Molecules (IDSM) that will be accessible through a SPARQL endpoint. Beyond that, we will also focus on increasing mutual interoperability of the datasets. To realise the endpoint, we decided to implement an in-house developed SPARQL engine based on the PostgreSQL relational database for data storage. In our approach, data are stored in the traditional relational form, and the SPARQL engine translates incoming SPARQL queries into equivalent SQL queries. An important feature of the engine is that it optimises the resulting SQL queries. Together with optimisations performed by PostgreSQL, this allows efficient evaluations of SPARQL queries. The endpoint provides not only querying in the dataset, but also the compound substructure and similarity search supported by our Sachem project. Although the endpoint is accessible from an internet browser, it is mainly intended to be used for programmatic access by other services, for example as a part of federated queries. For regular users, we offer a rich web application called ChemWebRDF using the endpoint. The application is publicly available at https://idsm.elixir-czech.cz/chemweb/.

Collapse

A resource to explore the discovery of rare diseases and their causative genes. Sci Data 2021;8:124. [PMID: 33947870 PMCID: PMC8096966 DOI: 10.1038/s41597-021-00905-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Accepted: 03/26/2021] [Indexed: 12/28/2022] Open

Kamdar MR, Musen MA. An empirical meta-analysis of the life sciences linked open data on the web. Sci Data 2021;8:24. [PMID: 33479214 PMCID: PMC7819992 DOI: 10.1038/s41597-021-00797-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 12/04/2020] [Indexed: 01/29/2023] Open

Irshad O, Ghani Khan MU. Formalization and Semantic Integration of Heterogeneous Omics Annotations for Exploratory Searches. Curr Bioinform 2021. [DOI: 10.2174/1574893615666200127122818] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract Aim: To facilitate researchers and practitioners for unveiling the mysterious functional aspects of human cellular system through performing exploratory searching on semantically integrated heterogeneous and geographically dispersed omics annotations. Background: Improving health standards of life is one of the motives which continuously instigates researchers and practitioners to strive for uncovering the mysterious aspects of human cellular system. Inferring new knowledge from known facts always requires reasonably large amount of data in well-structured, integrated and unified form. Due to the advent of especially high throughput and sensor technologies, biological data is growing heterogeneously and geographically at astronomical rate. Several data integration systems have been deployed to cope with the issues of data heterogeneity and global dispersion. Systems based on semantic data integration models are more flexible and expandable than syntax-based ones but still lack aspect-based data integration, persistence and querying. Furthermore, these systems do not fully support to warehouse biological entities in the form of semantic associations as naturally possessed by the human cell. Objective: To develop aspect-oriented formal data integration model for semantically integrating heterogeneous and geographically dispersed omics annotations for providing exploratory querying on integrated data. Method: We propose an aspect-oriented formal data integration model which uses web semantics standards to formally specify its each construct. Proposed model supports aspect-oriented representation of biological entities while addressing the issues of data heterogeneity and global dispersion. It associates and warehouses biological entities in the way they relate with Result: To show the significance of proposed model, we developed a data warehouse and information retrieval system based on proposed model compliant multi-layered and multi-modular software architecture. Results show that our model supports well for gathering, associating, integrating, persisting and querying each entity with respect to its all possible aspects within or across the various associated omics layers. Conclusion: Formal specifications better facilitate for addressing data integration issues by providing formal means for understanding omics data based on meaning instead of syntax Collapse

van der Velde KJ, van den Hoek S, van Dijk F, Hendriksen D, van Diemen CC, Johansson LF, Abbott KM, Deelen P, Sikkema‐Raddatz B, Swertz MA. A pipeline-friendly software tool for genome diagnostics to prioritize genes by matching patient symptoms to literature. ADVANCED GENETICS (HOBOKEN, N.J.) 2020;1:e10023. [PMID: 36619248 PMCID: PMC9744518 DOI: 10.1002/ggn2.10023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Revised: 02/12/2020] [Accepted: 03/20/2020] [Indexed: 04/11/2023]

Affiliation(s)

K. Joeri van der Velde Genomics Coordination CenterUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands Department of GeneticsUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands
Sander van den Hoek Genomics Coordination CenterUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands
Freerk van Dijk Genomics Coordination CenterUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands Department of GeneticsUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands Prinses Maxima Center for Child OncologyUtrechtThe Netherlands
Dennis Hendriksen Genomics Coordination CenterUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands
Cleo C. van Diemen Department of GeneticsUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands
Lennart F. Johansson Genomics Coordination CenterUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands Department of GeneticsUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands
Kristin M. Abbott Department of GeneticsUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands
Patrick Deelen Genomics Coordination CenterUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands Department of GeneticsUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands
Birgit Sikkema‐Raddatz Department of GeneticsUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands
Morris A. Swertz Genomics Coordination CenterUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands Department of GeneticsUniversity of Groningen and University Medical Center GroningenGroningenThe Netherlands

Collapse

Rashid SM, McCusker JP, Pinheiro P, Bax MP, Santos H, Stingone JA, Das AK, McGuinness DL. The Semantic Data Dictionary - An Approach for Describing and Annotating Data. DATA INTELLIGENCE 2020;2:443-486. [PMID: 33103120 DOI: 10.1162/dint_a_00058] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

Brinson LC, Deagen M, Chen W, McCusker J, McGuinness DL, Schadler LS, Palmeri M, Ghumman U, Lin A, Hu B. Polymer Nanocomposite Data: Curation, Frameworks, Access, and Potential for Discovery and Design. ACS Macro Lett 2020;9:1086-1094. [PMID: 35653211 DOI: 10.1021/acsmacrolett.0c00264] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Piñero J, Ramírez-Anguita JM, Saüch-Pitarch J, Ronzano F, Centeno E, Sanz F, Furlong LI. The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res 2020;48:D845-D855. [PMID: 31680165 PMCID: PMC7145631 DOI: 10.1093/nar/gkz1021] [Citation(s) in RCA: 819] [Impact Index Per Article: 204.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2019] [Revised: 10/14/2019] [Accepted: 10/18/2019] [Indexed: 02/07/2023] Open

Vos RA, Katayama T, Mishima H, Kawano S, Kawashima S, Kim JD, Moriya Y, Tokimatsu T, Yamaguchi A, Yamamoto Y, Wu H, Amstutz P, Antezana E, Aoki NP, Arakawa K, Bolleman JT, Bolton E, Bonnal RJP, Bono H, Burger K, Chiba H, Cohen KB, Deutsch EW, Fernández-Breis JT, Fu G, Fujisawa T, Fukushima A, García A, Goto N, Groza T, Hercus C, Hoehndorf R, Itaya K, Juty N, Kawashima T, Kim JH, Kinjo AR, Kotera M, Kozaki K, Kumagai S, Kushida T, Lütteke T, Matsubara M, Miyamoto J, Mohsen A, Mori H, Naito Y, Nakazato T, Nguyen-Xuan J, Nishida K, Nishida N, Nishide H, Ogishima S, Ohta T, Okuda S, Paten B, Perret JL, Prathipati P, Prins P, Queralt-Rosinach N, Shinmachi D, Suzuki S, Tabata T, Takatsuki T, Taylor K, Thompson M, Uchiyama I, Vieira B, Wei CH, Wilkinson M, Yamada I, Yamanaka R, Yoshitake K, Yoshizawa AC, Dumontier M, Kosaki K, Takagi T. BioHackathon 2015: Semantics of data for life sciences and reproducible research. F1000Res 2020;9:136. [PMID: 32308977 PMCID: PMC7141167 DOI: 10.12688/f1000research.18236.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/05/2020] [Indexed: 01/08/2023] Open

Affiliation(s)

Rutger A. Vos Institute of Biology Leiden, Leiden University, Leiden, The Netherlands Naturalis Biodiversity Center, Leiden, The Netherlands
Toshiaki Katayama Database Center for Life Science, Tokyo, Japan
Hiroyuki Mishima Department of Human Genetics, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
Shin Kawano Database Center for Life Science, Tokyo, Japan
Shuichi Kawashima Database Center for Life Science, Tokyo, Japan
Jin-Dong Kim Database Center for Life Science, Tokyo, Japan
Yuki Moriya Database Center for Life Science, Tokyo, Japan
Toshiaki Tokimatsu DDBJ Center, National Institute of Genetics, Mishima, Japan
Atsuko Yamaguchi Database Center for Life Science, Tokyo, Japan
Yasunori Yamamoto Database Center for Life Science, Tokyo, Japan
Hongyan Wu Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
Peter Amstutz Curoverse, Somerville, USA
Erick Antezana Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway
Nobuyuki P. Aoki Faculty of Science and Engineering, SOKA University, Tokyo, Japan
Kazuharu Arakawa Institute for Advanced Biosciences, Keio University, Tokyo, Japan
Jerven T. Bolleman SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, Lausanne, Switzerland
Evan Bolton National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
Raoul J. P. Bonnal Istituto Nazionale Genetica Molecolare, Romeo ed Enrica Invernizzi, Milan, Italy
Hidemasa Bono Database Center for Life Science, Tokyo, Japan
Kees Burger Dutch Techcentre for Life Sciences, Utrecht, The Netherlands
Hirokazu Chiba National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Japan
Kevin B. Cohen Computational Bioscience Program, University of Colorado School of Medicine, Denver, USA Université Paris-Saclay, LIMSI, CNRS, Paris, France
Eric W. Deutsch Institute for Systems Biology, Seattle, USA
Jesualdo T. Fernández-Breis Universidad de Murcia, IMIB-Arrixaca, Murcia, Spain
Gang Fu National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
Takatomo Fujisawa National Institute of Genetics, Mishima, Japan
Atsushi Fukushima RIKEN Center for Sustainable Resource Science, Yokohama, Japan
Alexander García Polytechnic University of Madrid, Madrid, Spain
Naohisa Goto Research Institute for Microbial Diseases, Osaka University, Osaka, Japan
Tudor Groza St Vincent's Clinical School, Faculty of Medicine, University of New South Wales, Darlinghurst, Australia Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, Australia
Colin Hercus Novocraft Technologies Sdn. Bhd., Selangor, Malaysia
Robert Hoehndorf Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Kotone Itaya Institute for Advanced Biosciences, Keio University, Tokyo, Japan
Nick Juty European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Takeshi Kawashima National Institute of Genetics, Mishima, Japan
Jee-Hyub Kim European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Akira R. Kinjo Institute for Protein Research, Osaka University, Osaka, Japan
Masaaki Kotera School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, Japan
Kouji Kozaki The Institute of Scientific and Industrial Research, Osaka University, Osaka, Japan
Sadahiro Kumagai Hitachi Ltd., Tokyo, Japan
Tatsuya Kushida National Bioscience Database Center, Japan Science and Technology Agency, Tokyo, Japan
Thomas Lütteke Institute of Veterinary Physiology and Biochemistry, Justus-Liebig University Giessen, Giessen, Germany Gesellschaft für innovative Personalwirtschaftssysteme mbH (GIP GmbH), Offenbach, Germany
Masaaki Matsubara The Noguchi Institute, Tokyo, Japan
Joe Miyamoto National Cancer Center Japan, Tokyo, Japan
Attayeb Mohsen National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, Japan
Hiroshi Mori Center for Information Biology, National Institute of Genetics, Mishima, Japan
Yuki Naito Database Center for Life Science, Tokyo, Japan
Takeru Nakazato Database Center for Life Science, Tokyo, Japan
Jeremy Nguyen-Xuan Lawrence Berkeley National Laboratory, Berkeley, USA
Kozo Nishida RIKEN Quantitative Biology Center, Osaka, Japan
Naoki Nishida Department of Systems Science, Osaka University, Osaka, Japan
Hiroyo Nishide National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Japan
Soichi Ogishima Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
Tazro Ohta Database Center for Life Science, Tokyo, Japan
Shujiro Okuda Niigata University Graduate School of Medical and Dental Sciences, Niigata, Japan
Benedict Paten UC Santa Cruz Genomics Institute, University of California, Santa Cruz, USA
Jean-Luc Perret INVENesis, Neuchâtel, Switzerland
Philip Prathipati National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, Japan
Pjotr Prins University Medical Center Utrecht, Utrecht, The Netherlands University of Tennessee Health Science Center, Memphis, USA
Núria Queralt-Rosinach Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
Daisuke Shinmachi Faculty of Science and Engineering, SOKA University, Tokyo, Japan
Shinya Suzuki School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, Japan
Tsuyosi Tabata Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
Terue Takatsuki RIKEN BioResource Center, Ibaraki, Japan
Kieron Taylor European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Mark Thompson Leiden University Medical Center, Leiden, The Netherlands
Ikuo Uchiyama National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Japan
Bruno Vieira WurmLab, School of Biological & Chemical Sciences, Queen Mary University of London, London, UK
Chih-Hsuan Wei National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
Mark Wilkinson Escuela Técnica Superior de Ingeniería Agronómica, Alimentaria y de Biosistemas, Universidad Politécnica de Madrid, Madrid, Spain
Issaku Yamada The Noguchi Institute, Tokyo, Japan
Ryota Yamanaka Oracle Corporation, Tokyo, Japan
Kazutoshi Yoshitake Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
Akiyasu C. Yoshizawa Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
Michel Dumontier Institute of Data Science, Maastricht University, Maastricht, The Netherlands
Kenjiro Kosaki Center for Medical Genetics, Keio University School of Medicine, Tokyo, Japan
Toshihisa Takagi National Bioscience Database Center, Japan Science and Technology Agency, Tokyo, Japan Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan

Collapse

Semantic Publication of Agricultural Scientific Literature Using Property Graphs. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10030861] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Conford B, Almsaeed A, Buehler S, Childers CP, Ficklin SP, Staton ME, Poelchau MF. Tripal EUtils: a Tripal module to increase exchange and reuse of genome assembly metadata. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020;2019:5709695. [PMID: 31960040 DOI: 10.1093/database/baz143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Revised: 11/04/2019] [Accepted: 11/17/2019] [Indexed: 11/13/2022]

Kafkas Ş, Abdelhakim M, Hashish Y, Kulmanov M, Abdellatif M, Schofield PN, Hoehndorf R. PathoPhenoDB, linking human pathogens to their phenotypes in support of infectious disease research. Sci Data 2019;6:79. [PMID: 31160594 PMCID: PMC6546783 DOI: 10.1038/s41597-019-0090-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Accepted: 05/07/2019] [Indexed: 12/11/2022] Open

Sood AJ, Viner C, Hoffman MM. DNAmod: the DNA modification database. J Cheminform 2019;11:30. [PMID: 31016417 PMCID: PMC6478773 DOI: 10.1186/s13321-019-0349-4] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 03/25/2019] [Indexed: 11/10/2022] Open

Reed SK, Dumontier M. Adding Cognition to the Semanticscience Integrated Ontology. ACTA ACUST UNITED AC 2019. [DOI: 10.33805/2638.8073.116] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Katayama T, Kawashima S, Okamoto S, Moriya Y, Chiba H, Naito Y, Fujisawa T, Mori H, Takagi T. TogoGenome/TogoStanza: modularized Semantic Web genome database. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2019;2019:5277251. [PMID: 30624651 PMCID: PMC6323299 DOI: 10.1093/database/bay132] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/08/2018] [Accepted: 11/26/2018] [Indexed: 11/12/2022]

Jonquet C, Toulet A, Dutta B, Emonet V. Harnessing the Power of Unified Metadata in an Ontology Repository: The Case of AgroPortal. JOURNAL ON DATA SEMANTICS 2018. [DOI: 10.1007/s13740-018-0091-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Thompson P, Daikou S, Ueno K, Batista-Navarro R, Tsujii J, Ananiadou S. Annotation and detection of drug effects in text for pharmacovigilance. J Cheminform 2018;10:37. [PMID: 30105604 PMCID: PMC6089860 DOI: 10.1186/s13321-018-0290-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Accepted: 07/20/2018] [Indexed: 02/02/2023] Open

Abstract

Pharmacovigilance (PV) databases record the benefits and risks of different drugs, as a means to ensure their safe and effective use. Creating and maintaining such resources can be complex, since a particular medication may have divergent effects in different individuals, due to specific patient characteristics and/or interactions with other drugs being administered. Textual information from various sources can provide important evidence to curators of PV databases about the usage and effects of drug targets in different medical subjects. However, the efficient identification of relevant evidence can be challenging, due to the increasing volume of textual data. Text mining (TM) techniques can support curators by automatically detecting complex information, such as interactions between drugs, diseases and adverse effects. This semantic information supports the quick identification of documents containing information of interest (e.g., the different types of patients in which a given adverse drug reaction has been observed to occur). TM tools are typically adapted to different domains by applying machine learning methods to corpora that are manually labelled by domain experts using annotation guidelines to ensure consistency. We present a semantically annotated corpus of 597 MEDLINE abstracts, PHAEDRA, encoding rich information on drug effects and their interactions, whose quality is assured through the use of detailed annotation guidelines and the demonstration of high levels of inter-annotator agreement (e.g., 92.6% F-Score for identifying named entities and 78.4% F-Score for identifying complex events, when relaxed matching criteria are applied). To our knowledge, the corpus is unique in the domain of PV, according to the level of detail of its annotations. To illustrate the utility of the corpus, we have trained TM tools based on its rich labels to recognise drug effects in text automatically. The corpus and annotation guidelines are available at: http://www.nactem.ac.uk/PHAEDRA/ .

Collapse

Brandizi M, Singh A, Rawlings C, Hassani-Pak K. Towards FAIRer Biological Knowledge Networks Using a Hybrid Linked Data and Graph Database Approach. J Integr Bioinform 2018;15:/j/jib.ahead-of-print/jib-2018-0023/jib-2018-0023.xml. [PMID: 30085931 PMCID: PMC6340125 DOI: 10.1515/jib-2018-0023] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2018] [Accepted: 06/07/2018] [Indexed: 01/01/2023] Open

Hu W, Qiu H, Huang J, Dumontier M. BioSearch: a semantic search engine for Bio2RDF. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018;2017:4079799. [PMID: 29220451 PMCID: PMC5569678 DOI: 10.1093/database/bax059] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/28/2016] [Accepted: 07/10/2017] [Indexed: 12/14/2022]

Combining Physical, Virtual, and Mental Actions and Objects. EDUCATIONAL PSYCHOLOGY REVIEW 2018. [DOI: 10.1007/s10648-018-9441-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Tang A, Tam R, Cadrin-Chênevert A, Guest W, Chong J, Barfett J, Chepelev L, Cairns R, Mitchell JR, Cicero MD, Poudrette MG, Jaremko JL, Reinhold C, Gallix B, Gray B, Geis R, O'Connell T, Babyn P, Koff D, Ferguson D, Derkatch S, Bilbily A, Shabana W. Canadian Association of Radiologists White Paper on Artificial Intelligence in Radiology. Can Assoc Radiol J 2018;69:120-135. [DOI: 10.1016/j.carj.2018.02.002] [Citation(s) in RCA: 238] [Impact Index Per Article: 39.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2018] [Accepted: 02/13/2018] [Indexed: 02/07/2023] Open

Affiliation(s)

An Tang Department of Radiology, Université de Montréal, Montréal, Québec, Canada Centre de recherche du Centre hospitalier de l'Université de Montréal, Montréal, Québec, Canada
Roger Tam Department of Radiology, University of British Columbia, Vancouver, British Columbia, Canada School of Biomedical Engineering, University of British Columbia, Vancouver, British Columbia, Canada
Alexandre Cadrin-Chênevert Department of Medical Imaging, CISSS Lanaudière, Université Laval, Joliette, Québec, Canada
Will Guest Department of Radiology, University of British Columbia, Vancouver, British Columbia, Canada
Jaron Chong Department of Radiology, McGill University Health Center, Montréal, Québec, Canada
Joseph Barfett Department of Medical Imaging, St. Michael's Hospital, University of Toronto, Toronto, Ontario, Canada
Leonid Chepelev Department of Radiology, University of Ottawa, Ottawa, Ontario, Canada
Robyn Cairns Department of Radiology, British Columbia's Children's Hospital, University of British Columbia, Vancouver, British Columbia, Canada
J. Ross Mitchell Department of Research, Mayo Clinic, Phoenix, Arizona, USA
Mark D. Cicero Department of Medical Imaging, St. Michael's Hospital, University of Toronto, Toronto, Ontario, Canada
Manuel Gaudreau Poudrette Department of Radiology, Université de Sherbrooke, Sherbrooke, Québec, Canada
Jacob L. Jaremko Department of Radiology and Diagnostic Imaging, University of Alberta, Edmonton, Alberta, Canada
Caroline Reinhold Department of Radiology, McGill University Health Center, Montréal, Québec, Canada
Benoit Gallix Department of Radiology, McGill University Health Center, Montréal, Québec, Canada
Bruce Gray Department of Medical Imaging, St. Michael's Hospital, University of Toronto, Toronto, Ontario, Canada
Raym Geis Department of Radiology, National Jewish Health, Denver, Colorado, USA
Timothy O'Connell
Paul Babyn
David Koff
Darren Ferguson
Sheldon Derkatch
Alexander Bilbily
Wael Shabana

Collapse

Opinion: Why we need a centralized repository for isotopic data. Proc Natl Acad Sci U S A 2018;114:2997-3001. [PMID: 28325883 DOI: 10.1073/pnas.1701742114] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open