Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chen C, Huang H, Ross KE, Cowart JE, Arighi CN, Wu CH, Natale DA. Protein ontology on the semantic web for knowledge discovery. Sci Data 2020;7:337. [PMID: 33046717 DOI: 10.1038/s41597-020-00679-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Accepted: 09/17/2020] [Indexed: 11/26/2022] Open

For:	Chen C, Huang H, Ross KE, Cowart JE, Arighi CN, Wu CH, Natale DA. Protein ontology on the semantic web for knowledge discovery. Sci Data 2020;7:337. [PMID: 33046717 DOI: 10.1038/s41597-020-00679-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Accepted: 09/17/2020] [Indexed: 11/26/2022] Open

Number

Cited by Other Article(s)

Santangelo BE, Apgar M, Colorado ASB, Martin CG, Sterrett J, Wall E, Joachimiak MP, Hunter LE, Lozupone CA. Integrating biological knowledge for mechanistic inference in the host-associated microbiome. Front Microbiol 2024;15:1351678. [PMID: 38638909 PMCID: PMC11024261 DOI: 10.3389/fmicb.2024.1351678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 02/26/2024] [Indexed: 04/20/2024] Open

Alqaissi E, Alotaibi F, Ramzan MS. Graph data science and machine learning for the detection of COVID-19 infection from symptoms. PeerJ Comput Sci 2023;9:e1333. [PMID: 37346701 PMCID: PMC10280642 DOI: 10.7717/peerj-cs.1333] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 03/16/2023] [Indexed: 06/23/2023]

Abstract

Background

COVID-19 is an infectious disease caused by SARS-CoV-2. The symptoms of COVID-19 vary from mild-to-moderate respiratory illnesses, and it sometimes requires urgent medication. Therefore, it is crucial to detect COVID-19 at an early stage through specific clinical tests, testing kits, and medical devices. However, these tests are not always available during the time of the pandemic. Therefore, this study developed an automatic, intelligent, rapid, and real-time diagnostic model for the early detection of COVID-19 based on its symptoms.

Methods

The COVID-19 knowledge graph (KG) constructed based on literature from heterogeneous data is imported to understand the COVID-19 different relations. We added human disease ontology to the COVID-19 KG and applied a node-embedding graph algorithm called fast random projection to extract an extra feature from the COVID-19 dataset. Subsequently, experiments were conducted using two machine learning (ML) pipelines to predict COVID-19 infection from its symptoms. Additionally, automatic tuning of the model hyperparameters was adopted.

Results

We compared two graph-based ML models, logistic regression (LR) and random forest (RF) models. The proposed graph-based RF model achieved a small error rate = 0.0064 and the best scores on all performance metrics, including specificity = 98.71%, accuracy = 99.36%, precision = 99.65%, recall = 99.53%, and F1-score = 99.59%. Furthermore, the Matthews correlation coefficient achieved by the RF model was higher than that of the LR model. Comparative analysis with other ML algorithms and with studies from the literature showed that the proposed RF model exhibited the best detection accuracy.

Conclusion

The graph-based RF model registered high performance in classifying the symptoms of COVID-19 infection, thereby indicating that the graph data science, in conjunction with ML techniques, helps improve performance and accelerate innovations.

Collapse

Wood EC, Glen AK, Kvarfordt LG, Womack F, Acevedo L, Yoon TS, Ma C, Flores V, Sinha M, Chodpathumwan Y, Termehchy A, Roach JC, Mendoza L, Hoffman AS, Deutsch EW, Koslicki D, Ramsey SA. RTX-KG2: a system for building a semantically standardized knowledge graph for translational biomedicine. BMC Bioinformatics 2022;23:400. [PMID: 36175836 PMCID: PMC9520835 DOI: 10.1186/s12859-022-04932-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 09/14/2022] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Biomedical translational science is increasingly using computational reasoning on repositories of structured knowledge (such as UMLS, SemMedDB, ChEMBL, Reactome, DrugBank, and SMPDB in order to facilitate discovery of new therapeutic targets and modalities. The NCATS Biomedical Data Translator project is working to federate autonomous reasoning agents and knowledge providers within a distributed system for answering translational questions. Within that project and the broader field, there is a need for a framework that can efficiently and reproducibly build an integrated, standards-compliant, and comprehensive biomedical knowledge graph that can be downloaded in standard serialized form or queried via a public application programming interface (API).

RESULTS

To create a knowledge provider system within the Translator project, we have developed RTX-KG2, an open-source software system for building-and hosting a web API for querying-a biomedical knowledge graph that uses an Extract-Transform-Load approach to integrate 70 knowledge sources (including the aforementioned core six sources) into a knowledge graph with provenance information including (where available) citations. The semantic layer and schema for RTX-KG2 follow the standard Biolink model to maximize interoperability. RTX-KG2 is currently being used by multiple Translator reasoning agents, both in its downloadable form and via its SmartAPI-registered interface. Serializations of RTX-KG2 are available for download in both the pre-canonicalized form and in canonicalized form (in which synonyms are merged). The current canonicalized version (KG2.7.3) of RTX-KG2 contains 6.4M nodes and 39.3M edges with a hierarchy of 77 relationship types from Biolink.

CONCLUSION

RTX-KG2 is the first knowledge graph that integrates UMLS, SemMedDB, ChEMBL, DrugBank, Reactome, SMPDB, and 64 additional knowledge sources within a knowledge graph that conforms to the Biolink standard for its semantic layer and schema. RTX-KG2 is publicly available for querying via its API at arax.rtx.ai/api/rtxkg2/v1.2/openapi.json . The code to build RTX-KG2 is publicly available at github:RTXteam/RTX-KG2 .

Collapse

Affiliation(s)

E C Wood School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Amy K Glen School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA.
Lindsey G Kvarfordt School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Finn Womack Computer Science and Engineering, Penn State University, State College, PA, USA
Liliana Acevedo School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Timothy S Yoon School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Chunyu Ma Huck Institutes of the Life Sciences, Penn State University, State College, PA, USA
Veronica Flores School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Meghamala Sinha School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Yodsawalai Chodpathumwan King Mongkut's University of Technology North Bangkok, Bangkok, Thailand
Arash Termehchy School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA
Jared C Roach Institute for Systems Biology, Seattle, WA, USA
Luis Mendoza Institute for Systems Biology, Seattle, WA, USA
Andrew S Hoffman Interdisciplinary Hub for Digitalization and Society, Radboud University, Nijmegen, The Netherlands
Eric W Deutsch Institute for Systems Biology, Seattle, WA, USA
David Koslicki Computer Science and Engineering, Penn State University, State College, PA, USA Huck Institutes of the Life Sciences, Penn State University, State College, PA, USA Department of Biology, Penn State University, State College, PA, USA
Stephen A Ramsey School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA Department of Biomedical Sciences, Oregon State University, Corvallis, OR, USA

Collapse

Rodriguez-Esteban R, Duarte J, Teixeira PC, Richard F, Koltsova S, So WV. Prediction of standard cell types and functional markers from textual descriptions of flow cytometry gating definitions using machine learning. CYTOMETRY. PART B, CLINICAL CYTOMETRY 2022;102:220-227. [PMID: 35253974 DOI: 10.1002/cyto.b.22065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 02/02/2022] [Accepted: 02/28/2022] [Indexed: 06/14/2023]

Chen C, Ross KE, Gavali S, Cowart JE, Wu CH. COVID-19 knowledge graph from semantic integration of biomedical literature and databases. Bioinformatics 2021;37:4597-4598. [PMID: 34613368 PMCID: PMC8513397 DOI: 10.1093/bioinformatics/btab694] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2021] [Revised: 09/26/2021] [Accepted: 10/04/2021] [Indexed: 11/12/2022] Open