1
|
Verma G, Rebholz-Schuhmann D, Madden MG. Enabling personalised disease diagnosis by combining a patient's time-specific gene expression profile with a biomedical knowledge base. BMC Bioinformatics 2024; 25:62. [PMID: 38326757 PMCID: PMC10848462 DOI: 10.1186/s12859-024-05674-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Accepted: 01/25/2024] [Indexed: 02/09/2024] Open
Abstract
BACKGROUND Recent developments in the domain of biomedical knowledge bases (KBs) open up new ways to exploit biomedical knowledge that is available in the form of KBs. Significant work has been done in the direction of biomedical KB creation and KB completion, specifically, those having gene-disease associations and other related entities. However, the use of such biomedical KBs in combination with patients' temporal clinical data still largely remains unexplored, but has the potential to immensely benefit medical diagnostic decision support systems. RESULTS We propose two new algorithms, LOADDx and SCADDx, to combine a patient's gene expression data with gene-disease association and other related information available in the form of a KB, to assist personalized disease diagnosis. We have tested both of the algorithms on two KBs and on four real-world gene expression datasets of respiratory viral infection caused by Influenza-like viruses of 19 subtypes. We also compare the performance of proposed algorithms with that of five existing state-of-the-art machine learning algorithms (k-NN, Random Forest, XGBoost, Linear SVM, and SVM with RBF Kernel) using two validation approaches: LOOCV and a single internal validation set. Both SCADDx and LOADDx outperform the existing algorithms when evaluated with both validation approaches. SCADDx is able to detect infections with up to 100% accuracy in the cases of Datasets 2 and 3. Overall, SCADDx and LOADDx are able to detect an infection within 72 h of infection with 91.38% and 92.66% average accuracy respectively considering all four datasets, whereas XGBoost, which performed best among the existing machine learning algorithms, can detect the infection with only 86.43% accuracy on an average. CONCLUSIONS We demonstrate how our novel idea of using the most and least differentially expressed genes in combination with a KB can enable identification of the diseases that a patient is most likely to have at a particular time, from a KB with thousands of diseases. Moreover, the proposed algorithms can provide a short ranked list of the most likely diseases for each patient along with their most affected genes, and other entities linked with them in the KB, which can support health care professionals in their decision-making.
Collapse
Affiliation(s)
- Ghanshyam Verma
- Insight Centre for Data Analytics, School of Computer Science, University of Galway, Galway, Ireland.
- School of Computer Science, University of Galway, Galway, Ireland.
| | | | - Michael G Madden
- Insight Centre for Data Analytics, School of Computer Science, University of Galway, Galway, Ireland
- School of Computer Science, University of Galway, Galway, Ireland
| |
Collapse
|
2
|
Gardinassi LG, Souza COS, Sales-Campos H, Fonseca SG. Immune and Metabolic Signatures of COVID-19 Revealed by Transcriptomics Data Reuse. Front Immunol 2020; 11:1636. [PMID: 32670298 PMCID: PMC7332781 DOI: 10.3389/fimmu.2020.01636] [Citation(s) in RCA: 76] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 06/18/2020] [Indexed: 12/21/2022] Open
Abstract
The current pandemic of coronavirus disease 19 (COVID-19) has affected millions of individuals and caused thousands of deaths worldwide. The pathophysiology of the disease is complex and mostly unknown. Therefore, identifying the molecular mechanisms that promote progression of the disease is critical to overcome this pandemic. To address such issues, recent studies have reported transcriptomic profiles of cells, tissues and fluids from COVID-19 patients that mainly demonstrated activation of humoral immunity, dysregulated type I and III interferon expression, intense innate immune responses and inflammatory signaling. Here, we provide novel perspectives on the pathophysiology of COVID-19 using robust functional approaches to analyze public transcriptome datasets. In addition, we compared the transcriptional signature of COVID-19 patients with individuals infected with SARS-CoV-1 and Influenza A (IAV) viruses. We identified a core transcriptional signature induced by the respiratory viruses in peripheral leukocytes, whereas the absence of significant type I interferon/antiviral responses characterized SARS-CoV-2 infection. We also identified the higher expression of genes involved in metabolic pathways including heme biosynthesis, oxidative phosphorylation and tryptophan metabolism. A BTM-driven meta-analysis of bronchoalveolar lavage fluid (BALF) from COVID-19 patients showed significant enrichment for neutrophils and chemokines, which were also significant in data from lung tissue of one deceased COVID-19 patient. Importantly, our results indicate higher expression of genes related to oxidative phosphorylation both in peripheral mononuclear leukocytes and BALF, suggesting a critical role for mitochondrial activity during SARS-CoV-2 infection. Collectively, these data point for immunopathological features and targets that can be therapeutically exploited to control COVID-19.
Collapse
Affiliation(s)
- Luiz G. Gardinassi
- Departamento de Biociências e Tecnologia, Instituto de Patologia Tropical e Saúde Pública, Universidade Federal de Goiás, Goiânia, Brazil
| | - Camila O. S. Souza
- Departamento de Análises Clínicas, Toxicológicas e Bromatológicas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, Brazil
| | - Helioswilton Sales-Campos
- Departamento de Biociências e Tecnologia, Instituto de Patologia Tropical e Saúde Pública, Universidade Federal de Goiás, Goiânia, Brazil
| | - Simone G. Fonseca
- Departamento de Biociências e Tecnologia, Instituto de Patologia Tropical e Saúde Pública, Universidade Federal de Goiás, Goiânia, Brazil
| |
Collapse
|
3
|
Fourati S, Talla A, Mahmoudian M, Burkhart JG, Klén R, Henao R, Yu T, Aydın Z, Yeung KY, Ahsen ME, Almugbel R, Jahandideh S, Liang X, Nordling TEM, Shiga M, Stanescu A, Vogel R, Pandey G, Chiu C, McClain MT, Woods CW, Ginsburg GS, Elo LL, Tsalik EL, Mangravite LM, Sieberts SK. A crowdsourced analysis to identify ab initio molecular signatures predictive of susceptibility to viral infection. Nat Commun 2018; 9:4418. [PMID: 30356117 PMCID: PMC6200745 DOI: 10.1038/s41467-018-06735-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Accepted: 09/12/2018] [Indexed: 01/17/2023] Open
Abstract
The response to respiratory viruses varies substantially between individuals, and there are currently no known molecular predictors from the early stages of infection. Here we conduct a community-based analysis to determine whether pre- or early post-exposure molecular factors could predict physiologic responses to viral exposure. Using peripheral blood gene expression profiles collected from healthy subjects prior to exposure to one of four respiratory viruses (H1N1, H3N2, Rhinovirus, and RSV), as well as up to 24 h following exposure, we find that it is possible to construct models predictive of symptomatic response using profiles even prior to viral exposure. Analysis of predictive gene features reveal little overlap among models; however, in aggregate, these genes are enriched for common pathways. Heme metabolism, the most significantly enriched pathway, is associated with a higher risk of developing symptoms following viral exposure. This study demonstrates that pre-exposure molecular predictors can be identified and improves our understanding of the mechanisms of response to respiratory viruses.
Collapse
Affiliation(s)
- Slim Fourati
- Department of Pathology, School of Medicine, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - Aarthi Talla
- Department of Pathology, School of Medicine, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - Mehrad Mahmoudian
- Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, FI-20520, Turku, Finland
- Department of Future Technologies, University of Turku, FI-20014 Turku, Finland
| | - Joshua G Burkhart
- Department of Medical Informatics and Clinical Epidemiology, School of Medicine, Oregon Health & Science University, Portland, OR, 97239, USA
- Laboratory of Evolutionary Genetics, Institute of Ecology and Evolution, University of Oregon, Eugene, OR, 97403, USA
| | - Riku Klén
- Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, FI-20520, Turku, Finland
| | - Ricardo Henao
- Duke Center for Applied Genomics and Precision Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27708, USA
| | - Thomas Yu
- Sage Bionetworks, Seattle, WA, 98121, USA
| | - Zafer Aydın
- Department of Computer Engineering, Abdullah Gul University, Kayseri, 38080, Turkey
| | - Ka Yee Yeung
- School of Engineering and Technology, University of Washington Tacoma, Tacoma, WA, 98402, USA
| | - Mehmet Eren Ahsen
- Department of Genetics and Genomic Sciences and Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Reem Almugbel
- School of Engineering and Technology, University of Washington Tacoma, Tacoma, WA, 98402, USA
| | | | - Xiao Liang
- School of Engineering and Technology, University of Washington Tacoma, Tacoma, WA, 98402, USA
| | - Torbjörn E M Nordling
- Department of Mechanical Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
| | - Motoki Shiga
- Department of Electrical, Electronic and Computer Engineering, Faculty of Engineering, Gifu University, Gifu, 501-1193, Japan
| | - Ana Stanescu
- Department of Genetics and Genomic Sciences and Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Computer Science, University of West Georgia, Carrolton, GA, 30116, USA
| | - Robert Vogel
- Department of Genetics and Genomic Sciences and Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- IBM T.J. Watson Research Center, Yorktown Heights, NY, 10598, USA
| | - Gaurav Pandey
- Department of Genetics and Genomic Sciences and Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Christopher Chiu
- Section of Infectious Diseases and Immunity, Imperial College London, London, W12 0NN, UK
| | - Micah T McClain
- Duke Center for Applied Genomics and Precision Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
- Medical Service, Durham VA Health Care System, Durham, NC, 27705, USA
- Department of Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Christopher W Woods
- Duke Center for Applied Genomics and Precision Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
- Medical Service, Durham VA Health Care System, Durham, NC, 27705, USA
- Department of Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Geoffrey S Ginsburg
- Duke Center for Applied Genomics and Precision Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
- Department of Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Laura L Elo
- Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, FI-20520, Turku, Finland
| | - Ephraim L Tsalik
- Duke Center for Applied Genomics and Precision Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
- Department of Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
- Emergency Medicine Service, Durham VA Health Care System, Durham, NC, 27705, USA
| | | | | |
Collapse
|
4
|
Bongen E, Vallania F, Utz PJ, Khatri P. KLRD1-expressing natural killer cells predict influenza susceptibility. Genome Med 2018; 10:45. [PMID: 29898768 PMCID: PMC6001128 DOI: 10.1186/s13073-018-0554-1] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 05/24/2018] [Indexed: 01/30/2023] Open
Abstract
BACKGROUND Influenza infects tens of millions of people every year in the USA. Other than notable risk groups, such as children and the elderly, it is difficult to predict what subpopulations are at higher risk of infection. Viral challenge studies, where healthy human volunteers are inoculated with live influenza virus, provide a unique opportunity to study infection susceptibility. Biomarkers predicting influenza susceptibility would be useful for identifying risk groups and designing vaccines. METHODS We applied cell mixture deconvolution to estimate immune cell proportions from whole blood transcriptome data in four independent influenza challenge studies. We compared immune cell proportions in the blood between symptomatic shedders and asymptomatic nonshedders across three discovery cohorts prior to influenza inoculation and tested results in a held-out validation challenge cohort. RESULTS Natural killer (NK) cells were significantly lower in symptomatic shedders at baseline in both discovery and validation cohorts. Hematopoietic stem and progenitor cells (HSPCs) were higher in symptomatic shedders at baseline in discovery cohorts. Although the HSPCs were higher in symptomatic shedders in the validation cohort, the increase was statistically nonsignificant. We observed that a gene associated with NK cells, KLRD1, which encodes CD94, was expressed at lower levels in symptomatic shedders at baseline in discovery and validation cohorts. KLRD1 expression in the blood at baseline negatively correlated with influenza infection symptom severity. KLRD1 expression 8 h post-infection in the nasal epithelium from a rhinovirus challenge study also negatively correlated with symptom severity. CONCLUSIONS We identified KLRD1-expressing NK cells as a potential biomarker for influenza susceptibility. Expression of KLRD1 was inversely correlated with symptom severity. Our results support a model where an early response by KLRD1-expressing NK cells may control influenza infection.
Collapse
Affiliation(s)
- Erika Bongen
- Institute for Immunity, Transplantation and Infection, Stanford University School of Medicine, Stanford, CA 94305 USA
- Program in Immunology, Stanford University School of Medicine, Stanford, 94305 CA USA
| | - Francesco Vallania
- Institute for Immunity, Transplantation and Infection, Stanford University School of Medicine, Stanford, CA 94305 USA
- Department of Medicine, Division of Biomedical Informatics Research, Stanford University School of Medicine, Stanford, CA 94305 USA
| | - Paul J. Utz
- Institute for Immunity, Transplantation and Infection, Stanford University School of Medicine, Stanford, CA 94305 USA
- Program in Immunology, Stanford University School of Medicine, Stanford, 94305 CA USA
- Department of Medicine, Division of Immunology and Rheumatology, Stanford University School of Medicine, Stanford, CA 94305 USA
| | - Purvesh Khatri
- Institute for Immunity, Transplantation and Infection, Stanford University School of Medicine, Stanford, CA 94305 USA
- Program in Immunology, Stanford University School of Medicine, Stanford, 94305 CA USA
- Department of Medicine, Division of Biomedical Informatics Research, Stanford University School of Medicine, Stanford, CA 94305 USA
| |
Collapse
|
5
|
Barton AJ, Hill J, Pollard AJ, Blohmke CJ. Transcriptomics in Human Challenge Models. Front Immunol 2017; 8:1839. [PMID: 29326715 PMCID: PMC5741696 DOI: 10.3389/fimmu.2017.01839] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 12/05/2017] [Indexed: 12/22/2022] Open
Abstract
Human challenge models, in which volunteers are experimentally infected with a pathogen of interest, provide the opportunity to directly identify both natural and vaccine-induced correlates of protection. In this review, we highlight how the application of transcriptomics to human challenge studies allows for the identification of novel correlates and gives insight into the immunological pathways required to develop functional immunity. In malaria challenge trials for example, innate immune pathways appear to play a previously underappreciated role in conferring protective immunity. Transcriptomic analyses of samples obtained in human challenge studies can also deepen our understanding of the immune responses preceding symptom onset, allowing characterization of innate immunity and early gene signatures, which may influence disease outcome. Influenza challenge studies demonstrate that these gene signatures have diagnostic potential in the context of pandemics, in which presymptomatic diagnosis of at-risk individuals could allow early initiation of antiviral treatment and help limit transmission. Furthermore, gene expression analysis facilitates the identification of host factors contributing to disease susceptibility, such as C4BPA expression in enterotoxigenic Escherichia coli infection. Overall, these studies highlight the exceptional value of transcriptional data generated in human challenge trials and illustrate the broad impact molecular data analysis may have on global health through rational vaccine design and biomarker discovery.
Collapse
Affiliation(s)
- Amber J Barton
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Centre, Oxford, United Kingdom
| | - Jennifer Hill
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Centre, Oxford, United Kingdom
| | - Andrew J Pollard
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Centre, Oxford, United Kingdom
| | - Christoph J Blohmke
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Centre, Oxford, United Kingdom
| |
Collapse
|