1
|
Odriozola I, Rasmussen JA, Gilbert MTP, Limborg MT, Alberdi A. A practical introduction to holo-omics. CELL REPORTS METHODS 2024; 4:100820. [PMID: 38986611 PMCID: PMC11294832 DOI: 10.1016/j.crmeth.2024.100820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 04/17/2024] [Accepted: 06/20/2024] [Indexed: 07/12/2024]
Abstract
Holo-omics refers to the joint study of non-targeted molecular data layers from host-microbiota systems or holobionts, which is increasingly employed to disentangle the complex interactions between the elements that compose them. We navigate through the generation, analysis, and integration of omics data, focusing on the commonalities and main differences to generate and analyze the various types of omics, with a special focus on optimizing data generation and integration. We advocate for careful generation and distillation of data, followed by independent exploration and analyses of the single omic layers to obtain a better understanding of the study system, before the integration of multiple omic layers in a final model is attempted. We highlight critical decision points to achieve this aim and flag the main challenges to address complex biological questions regarding the integrative study of host-microbiota relationships.
Collapse
Affiliation(s)
- Iñaki Odriozola
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Jacob A Rasmussen
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - M Thomas P Gilbert
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark; University Museum, NTNU, Trondheim, Norway
| | - Morten T Limborg
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Antton Alberdi
- Center for Evolutionary Hologenomics, Globe Institute, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
2
|
Mohr AE, Ortega-Santos CP, Whisner CM, Klein-Seetharaman J, Jasbi P. Navigating Challenges and Opportunities in Multi-Omics Integration for Personalized Healthcare. Biomedicines 2024; 12:1496. [PMID: 39062068 PMCID: PMC11274472 DOI: 10.3390/biomedicines12071496] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 06/25/2024] [Accepted: 06/28/2024] [Indexed: 07/28/2024] Open
Abstract
The field of multi-omics has witnessed unprecedented growth, converging multiple scientific disciplines and technological advances. This surge is evidenced by a more than doubling in multi-omics scientific publications within just two years (2022-2023) since its first referenced mention in 2002, as indexed by the National Library of Medicine. This emerging field has demonstrated its capability to provide comprehensive insights into complex biological systems, representing a transformative force in health diagnostics and therapeutic strategies. However, several challenges are evident when merging varied omics data sets and methodologies, interpreting vast data dimensions, streamlining longitudinal sampling and analysis, and addressing the ethical implications of managing sensitive health information. This review evaluates these challenges while spotlighting pivotal milestones: the development of targeted sampling methods, the use of artificial intelligence in formulating health indices, the integration of sophisticated n-of-1 statistical models such as digital twins, and the incorporation of blockchain technology for heightened data security. For multi-omics to truly revolutionize healthcare, it demands rigorous validation, tangible real-world applications, and smooth integration into existing healthcare infrastructures. It is imperative to address ethical dilemmas, paving the way for the realization of a future steered by omics-informed personalized medicine.
Collapse
Affiliation(s)
- Alex E. Mohr
- Systems Precision Engineering and Advanced Research (SPEAR), Theriome Inc., Phoenix, AZ 85004, USA; (A.E.M.); (C.P.O.-S.); (C.M.W.); (J.K.-S.)
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Biodesign Institute Center for Health Through Microbiomes, Arizona State University, Tempe, AZ 85281, USA
| | - Carmen P. Ortega-Santos
- Systems Precision Engineering and Advanced Research (SPEAR), Theriome Inc., Phoenix, AZ 85004, USA; (A.E.M.); (C.P.O.-S.); (C.M.W.); (J.K.-S.)
- Department of Exercise and Nutrition Sciences, Milken Institute School of Public Health, George Washington University, Washington, DC 20052, USA
| | - Corrie M. Whisner
- Systems Precision Engineering and Advanced Research (SPEAR), Theriome Inc., Phoenix, AZ 85004, USA; (A.E.M.); (C.P.O.-S.); (C.M.W.); (J.K.-S.)
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Biodesign Institute Center for Health Through Microbiomes, Arizona State University, Tempe, AZ 85281, USA
| | - Judith Klein-Seetharaman
- Systems Precision Engineering and Advanced Research (SPEAR), Theriome Inc., Phoenix, AZ 85004, USA; (A.E.M.); (C.P.O.-S.); (C.M.W.); (J.K.-S.)
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- School of Molecular Sciences, Arizona State University, Tempe, AZ 85281, USA
| | - Paniz Jasbi
- Systems Precision Engineering and Advanced Research (SPEAR), Theriome Inc., Phoenix, AZ 85004, USA; (A.E.M.); (C.P.O.-S.); (C.M.W.); (J.K.-S.)
| |
Collapse
|
3
|
Abbasi AF, Asim MN, Ahmed S, Vollmer S, Dengel A. Survival prediction landscape: an in-depth systematic literature review on activities, methods, tools, diseases, and databases. Front Artif Intell 2024; 7:1428501. [PMID: 39021434 PMCID: PMC11252047 DOI: 10.3389/frai.2024.1428501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Accepted: 06/12/2024] [Indexed: 07/20/2024] Open
Abstract
Survival prediction integrates patient-specific molecular information and clinical signatures to forecast the anticipated time of an event, such as recurrence, death, or disease progression. Survival prediction proves valuable in guiding treatment decisions, optimizing resource allocation, and interventions of precision medicine. The wide range of diseases, the existence of various variants within the same disease, and the reliance on available data necessitate disease-specific computational survival predictors. The widespread adoption of artificial intelligence (AI) methods in crafting survival predictors has undoubtedly revolutionized this field. However, the ever-increasing demand for more sophisticated and effective prediction models necessitates the continued creation of innovative advancements. To catalyze these advancements, it is crucial to bring existing survival predictors knowledge and insights into a centralized platform. The paper in hand thoroughly examines 23 existing review studies and provides a concise overview of their scope and limitations. Focusing on a comprehensive set of 90 most recent survival predictors across 44 diverse diseases, it delves into insights of diverse types of methods that are used in the development of disease-specific predictors. This exhaustive analysis encompasses the utilized data modalities along with a detailed analysis of subsets of clinical features, feature engineering methods, and the specific statistical, machine or deep learning approaches that have been employed. It also provides insights about survival prediction data sources, open-source predictors, and survival prediction frameworks.
Collapse
Affiliation(s)
- Ahtisham Fazeel Abbasi
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, Germany
- Smart Data & Knowledge Services, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, Germany
| | - Muhammad Nabeel Asim
- Smart Data & Knowledge Services, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, Germany
| | - Sheraz Ahmed
- Smart Data & Knowledge Services, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, Germany
| | - Sebastian Vollmer
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, Germany
- Smart Data & Knowledge Services, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, Germany
| | - Andreas Dengel
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, Germany
- Smart Data & Knowledge Services, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, Germany
| |
Collapse
|
4
|
Ozaki Y, Broughton P, Abdollahi H, Valafar H, Blenda AV. Integrating Omics Data and AI for Cancer Diagnosis and Prognosis. Cancers (Basel) 2024; 16:2448. [PMID: 39001510 PMCID: PMC11240413 DOI: 10.3390/cancers16132448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 06/27/2024] [Accepted: 07/01/2024] [Indexed: 07/16/2024] Open
Abstract
Cancer is one of the leading causes of death, making timely diagnosis and prognosis very important. Utilization of AI (artificial intelligence) enables providers to organize and process patient data in a way that can lead to better overall outcomes. This review paper aims to look at the varying uses of AI for diagnosis and prognosis and clinical utility. PubMed and EBSCO databases were utilized for finding publications from 1 January 2020 to 22 December 2023. Articles were collected using key search terms such as "artificial intelligence" and "machine learning." Included in the collection were studies of the application of AI in determining cancer diagnosis and prognosis using multi-omics data, radiomics, pathomics, and clinical and laboratory data. The resulting 89 studies were categorized into eight sections based on the type of data utilized and then further subdivided into two subsections focusing on cancer diagnosis and prognosis, respectively. Eight studies integrated more than one form of omics, namely genomics, transcriptomics, epigenomics, and proteomics. Incorporating AI into cancer diagnosis and prognosis alongside omics and clinical data represents a significant advancement. Given the considerable potential of AI in this domain, ongoing prospective studies are essential to enhance algorithm interpretability and to ensure safe clinical integration.
Collapse
Affiliation(s)
- Yousaku Ozaki
- Department of Biomedical Sciences, University of South Carolina School of Medicine Greenville, Greenville, SC 29605, USA; (Y.O.); (P.B.)
| | - Phil Broughton
- Department of Biomedical Sciences, University of South Carolina School of Medicine Greenville, Greenville, SC 29605, USA; (Y.O.); (P.B.)
| | - Hamed Abdollahi
- Department of Computer Science and Engineering, Molinaroli College of Engineering and Computing, Columbia, SC 29208, USA;
| | - Homayoun Valafar
- Department of Computer Science and Engineering, Molinaroli College of Engineering and Computing, Columbia, SC 29208, USA;
| | - Anna V. Blenda
- Department of Biomedical Sciences, University of South Carolina School of Medicine Greenville, Greenville, SC 29605, USA; (Y.O.); (P.B.)
- Prisma Health Cancer Institute, Prisma Health, Greenville, SC 29605, USA
| |
Collapse
|
5
|
Wang FA, Zhuang Z, Gao F, He R, Zhang S, Wang L, Liu J, Li Y. TMO-Net: an explainable pretrained multi-omics model for multi-task learning in oncology. Genome Biol 2024; 25:149. [PMID: 38845006 PMCID: PMC11157742 DOI: 10.1186/s13059-024-03293-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 05/29/2024] [Indexed: 06/09/2024] Open
Abstract
Cancer is a complex disease composing systemic alterations in multiple scales. In this study, we develop the Tumor Multi-Omics pre-trained Network (TMO-Net) that integrates multi-omics pan-cancer datasets for model pre-training, facilitating cross-omics interactions and enabling joint representation learning and incomplete omics inference. This model enhances multi-omics sample representation and empowers various downstream oncology tasks with incomplete multi-omics datasets. By employing interpretable learning, we characterize the contributions of distinct omics features to clinical outcomes. The TMO-Net model serves as a versatile framework for cross-modal multi-omics learning in oncology, paving the way for tumor omics-specific foundation models.
Collapse
Affiliation(s)
- Feng-Ao Wang
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China
- Guangzhou National Laboratory, Guangzhou, 510005, China
| | - Zhenfeng Zhuang
- Department of Computer Science at the School of Informatics, Xiamen University, Xiamen, 361005, China
| | - Feng Gao
- Department of Colorectal Surgery, The Sixth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 510655, China
- Shanghai Artificial Intelligence Laboratory, Shanghai, 200433, China
- Biomedical Innovation Center, The Sixth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 510655, China
| | - Ruikun He
- BYHEALTH Institute of Nutrition & Health, Guangzhou, 510000, China
| | - Shaoting Zhang
- Shanghai Artificial Intelligence Laboratory, Shanghai, 200433, China
| | - Liansheng Wang
- Department of Computer Science at the School of Informatics, Xiamen University, Xiamen, 361005, China.
| | - Junwei Liu
- Guangzhou National Laboratory, Guangzhou, 510005, China.
| | - Yixue Li
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China.
- Guangzhou National Laboratory, Guangzhou, 510005, China.
- Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai, 200030, China.
- GZMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou Medical University, Guangzhou, 511436, China.
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
- Collaborative Innovation Center for Genetics and Development, Fudan University, Shanghai, 200433, China.
- Shanghai Institute for Biomedical and Pharmaceutical Technologies, Shanghai, 200032, China.
| |
Collapse
|
6
|
Madhan S, Kalaiselvan A. Omics data classification using constitutive artificial neural network optimized with single candidate optimizer. NETWORK (BRISTOL, ENGLAND) 2024:1-25. [PMID: 38736309 DOI: 10.1080/0954898x.2024.2348726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 04/23/2024] [Indexed: 05/14/2024]
Abstract
Recent technical advancements enable omics-based biological study of molecules with very high throughput and low cost, such as genomic, proteomic, and microbionics'. To overcome this drawback, Omics Data Classification using Constitutive Artificial Neural Network Optimized with Single Candidate Optimizer (ODC-ZOA-CANN-SCO) is proposed in this manuscript. The input data is pre-processing by using Adaptive variational Bayesian filtering (AVBF) to replace missing values. The pre-processing data is fed to Zebra Optimization Algorithm (ZOA) for dimensionality reduction. Then, the Constitutive Artificial Neural Network (CANN) is employed to classify omics data. The weight parameter is optimized by Single Candidate Optimizer (SCO). The proposed ODC-ZOA-CANN-SCO method attains 25.36%, 21.04%, 22.18%, 26.90%, and 28.12% higher accuracy when analysed to the existing methods like multi-omics data integration utilizing adaptive graph learning and attention mode for patient categorization with biomarker identification (MOD-AGL-AM-PABI), deep learning method depending upon multi-omics data integration to create risk stratification prediction mode for skin cutaneous melanoma (DL-MODI-RSP-SCM), Deep belief network-base model for identifying Alzheimer's disease utilizing multi-omics data (DDN-DAD-MOD), hybrid cancer prediction depending upon multi-omics data and reinforcement learning state action reward state action (HCP-MOD-RL-SARSA), machine learning basis method under omics data including biological knowledge database for cancer clinical endpoint prediction (ML-ODBKD-CCEP) methods, respectively.
Collapse
Affiliation(s)
- Subramaniam Madhan
- Department of Computer Science and Engineering, University College of Engineering, Thirukkuvalai (A Constituent College of Anna University Chennai), Nagapattinam, Tamilnadu, India
| | - Anbarasan Kalaiselvan
- Department of Science and Humanities, University College of Engineering, Thirukkuvalai (A Constituent College of Anna University Chennai), Nagapattinam, Tamilnadu, India
| |
Collapse
|
7
|
Acharya D, Mukhopadhyay A. A comprehensive review of machine learning techniques for multi-omics data integration: challenges and applications in precision oncology. Brief Funct Genomics 2024:elae013. [PMID: 38600757 DOI: 10.1093/bfgp/elae013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 03/12/2024] [Accepted: 03/22/2024] [Indexed: 04/12/2024] Open
Abstract
Multi-omics data play a crucial role in precision medicine, mainly to understand the diverse biological interaction between different omics. Machine learning approaches have been extensively employed in this context over the years. This review aims to comprehensively summarize and categorize these advancements, focusing on the integration of multi-omics data, which includes genomics, transcriptomics, proteomics and metabolomics, alongside clinical data. We discuss various machine learning techniques and computational methodologies used for integrating distinct omics datasets and provide valuable insights into their application. The review emphasizes both the challenges and opportunities present in multi-omics data integration, precision medicine and patient stratification, offering practical recommendations for method selection in various scenarios. Recent advances in deep learning and network-based approaches are also explored, highlighting their potential to harmonize diverse biological information layers. Additionally, we present a roadmap for the integration of multi-omics data in precision oncology, outlining the advantages, challenges and implementation difficulties. Hence this review offers a thorough overview of current literature, providing researchers with insights into machine learning techniques for patient stratification, particularly in precision oncology. Contact: anirban@klyuniv.ac.in.
Collapse
Affiliation(s)
- Debabrata Acharya
- Department of Computer Science & Engineering, University of Kalyani, Kalyani-741235, West Bengal, India
| | - Anirban Mukhopadhyay
- Department of Computer Science & Engineering, University of Kalyani, Kalyani-741235, West Bengal, India
| |
Collapse
|
8
|
Anwar MA, Keshteli AH, Yang H, Wang W, Li X, Messier HM, Cullis PR, Borchers CH, Fraser R, Wishart DS. Blood-Based Multiomics-Guided Detection of a Precancerous Pancreatic Tumor. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2024; 28:182-192. [PMID: 38634790 DOI: 10.1089/omi.2023.0278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]
Abstract
Over a decade ago, longitudinal multiomics analysis was pioneered for early disease detection and individually tailored precision health interventions. However, high sample processing costs, expansive multiomics measurements along with complex data analysis have made this approach to precision/personalized medicine impractical. Here we describe in a case report, a more practical approach that uses fewer measurements, annual sampling, and faster decision making. We also show how this approach offers promise to detect an exceedingly rare and potentially fatal condition before it fully manifests. Specifically, we describe in the present case report how longitudinal multiomics monitoring (LMOM) helped detect a precancerous pancreatic tumor and led to a successful surgical intervention. The patient, enrolled in an annual blood-based LMOM since 2018, had dramatic changes in the June 2021 and 2022 annual metabolomics and proteomics results that prompted further clinical diagnostic testing for pancreatic cancer. Using abdominal magnetic resonance imaging, a 2.6 cm lesion in the tail of the patient's pancreas was detected. The tumor fluid from an aspiration biopsy had 10,000 times that of normal carcinoembryonic antigen levels. After the tumor was surgically resected, histopathological findings confirmed it was a precancerous pancreatic tumor. Postoperative omics testing indicated that most metabolite and protein levels returned to patient's 2018 levels. This case report illustrates the potentials of blood LMOM for precision/personalized medicine, and new ways of thinking medical innovation for a potentially life-saving early diagnosis of pancreatic cancer. Blood LMOM warrants future programmatic translational research with the goals of precision medicine, and individually tailored cancer diagnoses and treatments.
Collapse
Affiliation(s)
| | | | - Haiyan Yang
- Molecular You Corporation, Vancouver, British Columbia, Canada
| | - Windy Wang
- Molecular You Corporation, Vancouver, British Columbia, Canada
| | - Xukun Li
- Molecular You Corporation, Vancouver, British Columbia, Canada
| | - Helen M Messier
- Molecular You Corporation, Vancouver, British Columbia, Canada
- Fountain Life, Naples, Florida, USA
| | - Pieter R Cullis
- Molecular You Corporation, Vancouver, British Columbia, Canada
- Life Sciences Centre, University of British Columbia, Vancouver, British Columbia, Canada
- Department of Biochemistry and Molecular Biology, Faculty of Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Christoph H Borchers
- Gerald Bronfman Department of Oncology, Jewish General Hospital, McGill University, Montreal, Quebec, Canada
- Segal Cancer Proteomics Centre, Lady Davis Institute, Jewish General Hospital, McGill University, Montreal, Quebec, Canada
- Division of Experimental Medicine, McGill University, Montreal, Quebec, Canada
- Department of Pathology, McGill University, Montreal, Quebec, Canada
| | - Robert Fraser
- Molecular You Corporation, Vancouver, British Columbia, Canada
| | - David S Wishart
- Molecular You Corporation, Vancouver, British Columbia, Canada
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta, Canada
- Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada
- Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, Alberta, Canada
- Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, Alberta, Canada
| |
Collapse
|
9
|
Kumar V, Prabha C, Sharma P, Mittal N, Askar SS, Abouhawwash M. Unified deep learning models for enhanced lung cancer prediction with ResNet-50-101 and EfficientNet-B3 using DICOM images. BMC Med Imaging 2024; 24:63. [PMID: 38500083 PMCID: PMC10946139 DOI: 10.1186/s12880-024-01241-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Accepted: 03/07/2024] [Indexed: 03/20/2024] Open
Abstract
Significant advancements in machine learning algorithms have the potential to aid in the early detection and prevention of cancer, a devastating disease. However, traditional research methods face obstacles, and the amount of cancer-related information is rapidly expanding. The authors have developed a helpful support system using three distinct deep-learning models, ResNet-50, EfficientNet-B3, and ResNet-101, along with transfer learning, to predict lung cancer, thereby contributing to health and reducing the mortality rate associated with this condition. This offer aims to address the issue effectively. Using a dataset of 1,000 DICOM lung cancer images from the LIDC-IDRI repository, each image is classified into four different categories. Although deep learning is still making progress in its ability to analyze and understand cancer data, this research marks a significant step forward in the fight against cancer, promoting better health outcomes and potentially lowering the mortality rate. The Fusion Model, like all other models, achieved 100% precision in classifying Squamous Cells. The Fusion Model and ResNet-50 achieved a precision of 90%, closely followed by EfficientNet-B3 and ResNet-101 with slightly lower precision. To prevent overfitting and improve data collection and planning, the authors implemented a data extension strategy. The relationship between acquiring knowledge and reaching specific scores was also connected to advancing and addressing the issue of imprecise accuracy, ultimately contributing to advancements in health and a reduction in the mortality rate associated with lung cancer.
Collapse
Affiliation(s)
- Vinod Kumar
- Department of Computer Science and Engineering, Chandigarh University, Mohali, Punjab, India
| | - Chander Prabha
- Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India
| | - Preeti Sharma
- Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India
| | - Nitin Mittal
- Skill Faculty of Engineering and Technology, Shri Vishwakarma Skill University, Palwal, Haryana, India.
| | - S S Askar
- Department of Statistics and Operations Research, College of Science, King Saud University, P.O. Box 2455, 11451, Riyadh, Saudi Arabia
| | - Mohamed Abouhawwash
- Department of Mathematics, Faculty of Science, Mansoura University, Mansoura, 35516, Egypt
| |
Collapse
|
10
|
Abbasi EY, Deng Z, Ali Q, Khan A, Shaikh A, Reshan MSA, Sulaiman A, Alshahrani H. A machine learning and deep learning-based integrated multi-omics technique for leukemia prediction. Heliyon 2024; 10:e25369. [PMID: 38352790 PMCID: PMC10862685 DOI: 10.1016/j.heliyon.2024.e25369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 12/13/2023] [Accepted: 01/25/2024] [Indexed: 02/16/2024] Open
Abstract
In recent years, scientific data on cancer has expanded, providing potential for a better understanding of malignancies and improved tailored care. Advances in Artificial Intelligence (AI) processing power and algorithmic development position Machine Learning (ML) and Deep Learning (DL) as crucial players in predicting Leukemia, a blood cancer, using integrated multi-omics technology. However, realizing these goals demands novel approaches to harness this data deluge. This study introduces a novel Leukemia diagnosis approach, analyzing multi-omics data for accuracy using ML and DL algorithms. ML techniques, including Random Forest (RF), Naive Bayes (NB), Decision Tree (DT), Logistic Regression (LR), Gradient Boosting (GB), and DL methods such as Recurrent Neural Networks (RNN) and Feedforward Neural Networks (FNN) are compared. GB achieved 97 % accuracy in ML, while RNN outperformed by achieving 98 % accuracy in DL. This approach filters unclassified data effectively, demonstrating the significance of DL for leukemia prediction. The testing validation was based on 17 different features such as patient age, sex, mutation type, treatment methods, chromosomes, and others. Our study compares ML and DL techniques and chooses the best technique that gives optimum results. The study emphasizes the implications of high-throughput technology in healthcare, offering improved patient care.
Collapse
Affiliation(s)
- Erum Yousef Abbasi
- State Key Laboratory of Wireless Network Positioning and Communication Engineering Integration Research, School of Electronics Engineering, Beijing University of Posts and Telecommunications, Beijing, China
| | - Zhongliang Deng
- State Key Laboratory of Wireless Network Positioning and Communication Engineering Integration Research, School of Electronics Engineering, Beijing University of Posts and Telecommunications, Beijing, China
| | - Qasim Ali
- Department of Software Engineering, Mehran University of Engineering and Technology, Jamshoro, Pakistan
| | - Adil Khan
- State Key Laboratory of Wireless Network Positioning and Communication Engineering Integration Research, School of Electronics Engineering, Beijing University of Posts and Telecommunications, Beijing, China
| | - Asadullah Shaikh
- Department of Information Systems, College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia
| | - Mana Saleh Al Reshan
- Department of Information Systems, College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia
- Scientific and Engineering Research Centre, Najran University, Najran, 61441, Saudi Arabia
| | - Adel Sulaiman
- Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia
| | - Hani Alshahrani
- Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia
| |
Collapse
|
11
|
Muller E, Shiryan I, Borenstein E. Multi-omic integration of microbiome data for identifying disease-associated modules. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.03.547607. [PMID: 37461534 PMCID: PMC10349976 DOI: 10.1101/2023.07.03.547607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
The human gut microbiome is a complex ecosystem with profound implications for health and disease. This recognition has led to a surge in multi-omic microbiome studies, employing various molecular assays to elucidate the microbiome's role in diseases across multiple functional layers. However, despite the clear value of these multi-omic datasets, rigorous integrative analysis of such data poses significant challenges, hindering a comprehensive understanding of microbiome-disease interactions. Perhaps most notably, multiple approaches, including univariate and multivariate analyses, as well as machine learning, have been applied to such data to identify disease-associated markers, namely, specific features (e.g., species, pathways, metabolites) that are significantly altered in disease state. These methods, however, often yield extensive lists of features associated with the disease without effectively capturing the multi-layered structure of multi-omic data or offering clear, interpretable hypotheses about underlying microbiome-disease mechanisms. Here, we address this challenge by introducing MintTea - an intermediate integration-based method for analyzing multi-omic microbiome data. MintTea combines a canonical correlation analysis (CCA) extension, consensus analysis, and an evaluation protocol to robustly identify disease-associated multi-omic modules. Each such module consists of a set of features from the various omics that both shift in concord, and collectively associate with the disease. Applying MintTea to diverse case-control cohorts with multi-omic data, we show that this framework is able to capture modules with high predictive power for disease, significant cross-omic correlations, and alignment with known microbiome-disease associations. For example, analyzing samples from a metabolic syndrome (MS) study, we found a MS-associated module comprising of a highly correlated cluster of serum glutamate- and TCA cycle-related metabolites, as well as bacterial species previously implicated in insulin resistance. In another cohort, we identified a module associated with late-stage colorectal cancer, featuring Peptostreptococcus and Gemella species and several fecal amino acids, in agreement with these species' reported role in the metabolism of these amino acids and their coordinated increase in abundance during disease development. Finally, comparing modules identified in different datasets, we detected multiple significant overlaps, suggesting common interactions between microbiome features. Combined, this work serves as a proof of concept for the potential benefits of advanced integration methods in generating integrated multi-omic hypotheses underlying microbiome-disease interactions and a promising avenue for researchers seeking systems-level insights into coherent mechanisms governing microbiome-related diseases.
Collapse
|
12
|
Pathak RK, Kim JM. Veterinary systems biology for bridging the phenotype-genotype gap via computational modeling for disease epidemiology and animal welfare. Brief Bioinform 2024; 25:bbae025. [PMID: 38343323 PMCID: PMC10859662 DOI: 10.1093/bib/bbae025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 01/02/2024] [Accepted: 01/15/2024] [Indexed: 02/15/2024] Open
Abstract
Veterinary systems biology is an innovative approach that integrates biological data at the molecular and cellular levels, allowing for a more extensive understanding of the interactions and functions of complex biological systems in livestock and veterinary science. It has tremendous potential to integrate multi-omics data with the support of vetinformatics resources for bridging the phenotype-genotype gap via computational modeling. To understand the dynamic behaviors of complex systems, computational models are frequently used. It facilitates a comprehensive understanding of how a host system defends itself against a pathogen attack or operates when the pathogen compromises the host's immune system. In this context, various approaches, such as systems immunology, network pharmacology, vaccinology and immunoinformatics, can be employed to effectively investigate vaccines and drugs. By utilizing this approach, we can ensure the health of livestock. This is beneficial not only for animal welfare but also for human health and environmental well-being. Therefore, the current review offers a detailed summary of systems biology advancements utilized in veterinary sciences, demonstrating the potential of the holistic approach in disease epidemiology, animal welfare and productivity.
Collapse
Affiliation(s)
- Rajesh Kumar Pathak
- Department of Animal Science and Technology, Chung-Ang University, Anseong-si, Gyeonggi-do 17546, Republic of Korea
| | - Jun-Mo Kim
- Department of Animal Science and Technology, Chung-Ang University, Anseong-si, Gyeonggi-do 17546, Republic of Korea
| |
Collapse
|
13
|
Wang Q, Chang Z, Liu X, Wang Y, Feng C, Ping Y, Feng X. Predictive Value of Machine Learning for Platinum Chemotherapy Responses in Ovarian Cancer: Systematic Review and Meta-Analysis. J Med Internet Res 2024; 26:e48527. [PMID: 38252469 PMCID: PMC10845031 DOI: 10.2196/48527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 11/23/2023] [Accepted: 11/24/2023] [Indexed: 01/23/2024] Open
Abstract
BACKGROUND Machine learning is a potentially effective method for predicting the response to platinum-based treatment for ovarian cancer. However, the predictive performance of various machine learning methods and variables is still a matter of controversy and debate. OBJECTIVE This study aims to systematically review relevant literature on the predictive value of machine learning for platinum-based chemotherapy responses in patients with ovarian cancer. METHODS Following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, we systematically searched the PubMed, Embase, Web of Science, and Cochrane databases for relevant studies on predictive models for platinum-based therapies for the treatment of ovarian cancer published before April 26, 2023. The Prediction Model Risk of Bias Assessment tool was used to evaluate the risk of bias in the included articles. Concordance index (C-index), sensitivity, and specificity were used to evaluate the performance of the prediction models to investigate the predictive value of machine learning for platinum chemotherapy responses in patients with ovarian cancer. RESULTS A total of 1749 articles were examined, and 19 of them involving 39 models were eligible for this study. The most commonly used modeling methods were logistic regression (16/39, 41%), Extreme Gradient Boosting (4/39, 10%), and support vector machine (4/39, 10%). The training cohort reported C-index in 39 predictive models, with a pooled value of 0.806; the validation cohort reported C-index in 12 predictive models, with a pooled value of 0.831. Support vector machine performed well in both the training and validation cohorts, with a C-index of 0.942 and 0.879, respectively. The pooled sensitivity was 0.890, and the pooled specificity was 0.790 in the training cohort. CONCLUSIONS Machine learning can effectively predict how patients with ovarian cancer respond to platinum-based chemotherapy and may provide a reference for the development or updating of subsequent scoring systems.
Collapse
Affiliation(s)
- Qingyi Wang
- Department of First Clinical Medical College, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Zhuo Chang
- Basic Medical College, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Xiaofang Liu
- Department of First Clinical Medical College, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Yunrui Wang
- Department of First Clinical Medical College, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Chuwen Feng
- Department of First Clinical Medical College, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Yunlu Ping
- Department of First Clinical Medical College, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Xiaoling Feng
- Department of Gynecology, First Affiliated Hospital of Heilongjiang University of Chinese Medicine, Harbin, China
| |
Collapse
|
14
|
Tsagiopoulou M, Gut IG. Machine learning and multi-omics data in chronic lymphocytic leukemia: the future of precision medicine? Front Genet 2024; 14:1304661. [PMID: 38283149 PMCID: PMC10811210 DOI: 10.3389/fgene.2023.1304661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 12/27/2023] [Indexed: 01/30/2024] Open
Abstract
Chronic lymphocytic leukemia is a complex and heterogeneous hematological malignancy. The advance of high-throughput multi-omics technologies has significantly influenced chronic lymphocytic leukemia research and paved the way for precision medicine approaches. In this review, we explore the role of machine learning in the analysis of multi-omics data in this hematological malignancy. We discuss recent literature on different machine learning models applied to single omic studies in chronic lymphocytic leukemia, with a special focus on the potential contributions to precision medicine. Finally, we highlight the recently published machine learning applications in multi-omics data in this area of research as well as their potential and limitations.
Collapse
Affiliation(s)
| | - Ivo G. Gut
- Centro Nacional de Analisis Genomico (CNAG), Barcelona, Spain
- Universitat de Barcelona (UB), Barcelona, Spain
| |
Collapse
|
15
|
Qi B, Trakadis YJ. Advancing Clinical Psychiatry: Integration of Clinical and Omics Data Using Machine Learning. Biol Psychiatry 2023; 94:908-909. [PMID: 37968028 DOI: 10.1016/j.biopsych.2023.09.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 09/05/2023] [Indexed: 11/17/2023]
Affiliation(s)
- Bill Qi
- Department of Human Genetics, McGill University, Montreal, Quebec, Canada
| | - Yannis J Trakadis
- Department of Human Genetics, McGill University, Montreal, Quebec, Canada; Department of Specialized Medicine, Division of Medical Genetics, McGill University Health Center, Montreal, QC, Canada; Department of Psychiatry, McGill University, Montreal, Quebec, Canada.
| |
Collapse
|
16
|
Mead EC, Wang CA, Phung J, Fu JY, Williams SM, Merialdi M, Jacobsson B, Lye S, Menon R, Pennell CE. The Role of Genetics in Preterm Birth. Reprod Sci 2023; 30:3410-3427. [PMID: 37450251 PMCID: PMC10692032 DOI: 10.1007/s43032-023-01287-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Accepted: 06/09/2023] [Indexed: 07/18/2023]
Abstract
Preterm birth (PTB), defined as the birth of a child before 37 completed weeks gestation, affects approximately 11% of live births and is the leading cause of death in children under 5 years. PTB is a complex disease with multiple risk factors including genetic variation. Much research has aimed to establish the biological mechanisms underlying PTB often through identification of genetic markers for PTB risk. The objective of this review is to present a comprehensive and updated summary of the published data relating to the field of PTB genetics. A literature search in PubMed was conducted and English studies related to PTB genetics were included. Genetic studies have identified genes within inflammatory, immunological, tissue remodeling, endocrine, metabolic, and vascular pathways that may be involved in PTB. However, a substantial proportion of published data have been largely inconclusive and multiple studies had limited power to detect associations. On the contrary, a few large hypothesis-free approaches have identified and replicated multiple novel variants associated with PTB in different cohorts. Overall, attempts to predict PTB using single "-omics" datasets including genomic, transcriptomic, and epigenomic biomarkers have been mostly unsuccessful and have failed to translate to the clinical setting. Integration of data from multiple "-omics" datasets has yielded the most promising results.
Collapse
Affiliation(s)
- Elyse C Mead
- School of Medicine and Public Health, University of Newcastle, Newcastle, NSW, 2308, Australia
| | - Carol A Wang
- School of Medicine and Public Health, University of Newcastle, Newcastle, NSW, 2308, Australia
- Hunter Medical Research Institute, Newcastle, NSW, 2305, Australia
| | - Jason Phung
- School of Medicine and Public Health, University of Newcastle, Newcastle, NSW, 2308, Australia
- Hunter Medical Research Institute, Newcastle, NSW, 2305, Australia
- Department of Maternity and Gynaecology, John Hunter Hospital, Newcastle, NSW, 2305, Australia
| | - Joanna Yx Fu
- School of Medicine and Public Health, University of Newcastle, Newcastle, NSW, 2308, Australia
| | - Scott M Williams
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Mario Merialdi
- Maternal Newborn Health Innovations, Geneva, PBC, Switzerland
| | - Bo Jacobsson
- Department of Obstetrics and Gynaecology, Institute of Clinical Science, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
- Department of Obstetrics and Gynaecology, Region Västra Götaland, Sahlgrenska University Hospital, Gothenburg, Sweden
- Department of Genetics and Bioinformatics, Domain of Health Data and Digitalization, Institute of Public Health, Oslo, Norway
| | - Stephen Lye
- Lunenfeld Tanenbaum Research Institute, Toronto, Ontario, Canada
| | - Ramkumar Menon
- Department of Obstetrics and Gynecology, Division of Basic Science and Translational Research, University of Texas Medical Branch, Galveston, TX, USA
| | - Craig E Pennell
- School of Medicine and Public Health, University of Newcastle, Newcastle, NSW, 2308, Australia.
- Hunter Medical Research Institute, Newcastle, NSW, 2305, Australia.
- Department of Maternity and Gynaecology, John Hunter Hospital, Newcastle, NSW, 2305, Australia.
| |
Collapse
|
17
|
Erawijantari PP, Kartal E, Liñares-Blanco J, Laajala TD, Feldman LE, Carmona-Saez P, Shigdel R, Claesson MJ, Bertelsen RJ, Gomez-Cabrero D, Minot S, Albrecht J, Chung V, Inouye M, Jousilahti P, Schultz JH, Friederich HC, Knight R, Salomaa V, Niiranen T, Havulinna AS, Saez-Rodriguez J, Levinson RT, Lahti L. Microbiome-based risk prediction in incident heart failure: a community challenge. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.10.12.23296829. [PMID: 37873403 PMCID: PMC10593042 DOI: 10.1101/2023.10.12.23296829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Heart failure (HF) is a major public health problem. Early identification of at-risk individuals could allow for interventions that reduce morbidity or mortality. The community-based FINRISK Microbiome DREAM challenge (synapse.org/finrisk) evaluated the use of machine learning approaches on shotgun metagenomics data obtained from fecal samples to predict incident HF risk over 15 years in a population cohort of 7231 Finnish adults (FINRISK 2002, n=559 incident HF cases). Challenge participants used synthetic data for model training and testing. Final models submitted by seven teams were evaluated in the real data. The two highest-scoring models were both based on Cox regression but used different feature selection approaches. We aggregated their predictions to create an ensemble model. Additionally, we refined the models after the DREAM challenge by eliminating phylum information. Models were also evaluated at intermediate timepoints and they predicted 10-year incident HF more accurately than models for 5- or 15-year incidence. We found that bacterial species, especially those linked to inflammation, are predictive of incident HF. This highlights the role of the gut microbiome as a potential driver of inflammation in HF pathophysiology. Our results provide insights into potential modeling strategies of microbiome data in prospective cohort studies. Overall, this study provides evidence that incorporating microbiome information into incident risk models can provide important biological insights into the pathogenesis of HF.
Collapse
Affiliation(s)
| | - Ece Kartal
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - José Liñares-Blanco
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- GENYO. Centre for Genomics and Oncological Research: Pfizer, University of Granada, Andalusian Regional Government, PTS Granada, Avenida de la Ilustración 114, 18016, Granada, Spain
- Department of Statistics and Operations Research, University of Granada, Spain
| | - Teemu D Laajala
- Department of Mathematics and Statistics, Faculty of Science, University of Turku, Finland
- Department of Pharmacology, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Lily Elizabeth Feldman
- Department of Pharmacology, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Pedro Carmona-Saez
- GENYO. Centre for Genomics and Oncological Research: Pfizer, University of Granada, Andalusian Regional Government, PTS Granada, Avenida de la Ilustración 114, 18016, Granada, Spain
- Department of Statistics and Operations Research, University of Granada, Spain
| | - Rajesh Shigdel
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Marcus Joakim Claesson
- APC Microbiome Ireland, University College Cork, T12 YT20 Cork, Ireland
- School of Microbiology, University College Cork, T12 YT20 Cork, Ireland
| | | | - David Gomez-Cabrero
- Translational Bioinformatics Unit, Navarrabiomed, Public University of Navarra, IDISNA, Pamplona, Spain
- Biological and Environmental Sciences & Engineering Division, King Abdullah University of Science & Technology, Thuwal, Kingdom of Saudi Arabia
| | - Samuel Minot
- Data Core, Shared Resources, Fred Hutchinson Cancer Center. Seattle, WA. USA
| | | | | | - Michael Inouye
- Cambridge Baker Systems Genomics Initiative, Baker Heart & Diabetes Institute, Melbourne, Victoria, Australia
- Cambridge Baker Systems Genomics Initiative, Department of Public Health and Primary Care, Cambridge University, Cambridge, UK
- Victor Phillip Dahdaleh Heart and Lung Research Institute, University of Cambridge, Cambridge, UK
| | - Pekka Jousilahti
- Department of Public Health and Welfare, Finnish Institute for Health and Welfare, Helsinki, Finland
| | - Jobst-Hendrik Schultz
- Department of General Internal Medicine & Psychosomatics, Heidelberg University Hospital, Heidelberg, Germany
| | - Hans-Christoph Friederich
- Department of General Internal Medicine & Psychosomatics, Heidelberg University Hospital, Heidelberg, Germany
| | - Rob Knight
- Jacobs School of Engineering, University of California San Diego, La Jolla, CA. USA
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA. USA
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA. USA
- Department of Computer Science & Engineering, University of California San Diego, La Jolla, CA. USA
| | - Veikko Salomaa
- Department of Public Health and Welfare, Finnish Institute for Health and Welfare, Helsinki, Finland
| | - Teemu Niiranen
- Department of Public Health and Welfare, Finnish Institute for Health and Welfare, Helsinki, Finland
- Division of Medicine, Turku University Hospital, Turku, Finland
- Department of Internal Medicine, University of Turku, Turku, Finland
| | - Aki S Havulinna
- Department of Public Health and Welfare, Finnish Institute for Health and Welfare, Helsinki, Finland
- Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, Helsinki, Finland
| | - Julio Saez-Rodriguez
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Rebecca T Levinson
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- Department of General Internal Medicine & Psychosomatics, Heidelberg University Hospital, Heidelberg, Germany
| | - Leo Lahti
- Department of Computing, Faculty of Technology, University of Turku, Turku, Finland
| |
Collapse
|
18
|
D’Elia D, Truu J, Lahti L, Berland M, Papoutsoglou G, Ceci M, Zomer A, Lopes MB, Ibrahimi E, Gruca A, Nechyporenko A, Frohme M, Klammsteiner T, Pau ECDS, Marcos-Zambrano LJ, Hron K, Pio G, Simeon A, Suharoschi R, Moreno-Indias I, Temko A, Nedyalkova M, Apostol ES, Truică CO, Shigdel R, Telalović JH, Bongcam-Rudloff E, Przymus P, Jordamović NB, Falquet L, Tarazona S, Sampri A, Isola G, Pérez-Serrano D, Trajkovik V, Klucar L, Loncar-Turukalo T, Havulinna AS, Jansen C, Bertelsen RJ, Claesson MJ. Advancing microbiome research with machine learning: key findings from the ML4Microbiome COST action. Front Microbiol 2023; 14:1257002. [PMID: 37808321 PMCID: PMC10558209 DOI: 10.3389/fmicb.2023.1257002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/05/2023] [Indexed: 10/10/2023] Open
Abstract
The rapid development of machine learning (ML) techniques has opened up the data-dense field of microbiome research for novel therapeutic, diagnostic, and prognostic applications targeting a wide range of disorders, which could substantially improve healthcare practices in the era of precision medicine. However, several challenges must be addressed to exploit the benefits of ML in this field fully. In particular, there is a need to establish "gold standard" protocols for conducting ML analysis experiments and improve interactions between microbiome researchers and ML experts. The Machine Learning Techniques in Human Microbiome Studies (ML4Microbiome) COST Action CA18131 is a European network established in 2019 to promote collaboration between discovery-oriented microbiome researchers and data-driven ML experts to optimize and standardize ML approaches for microbiome analysis. This perspective paper presents the key achievements of ML4Microbiome, which include identifying predictive and discriminatory 'omics' features, improving repeatability and comparability, developing automation procedures, and defining priority areas for the novel development of ML methods targeting the microbiome. The insights gained from ML4Microbiome will help to maximize the potential of ML in microbiome research and pave the way for new and improved healthcare practices.
Collapse
Affiliation(s)
- Domenica D’Elia
- Department of Biomedical Sciences, National Research Council, Institute for Biomedical Technologies, Bari, Italy
| | - Jaak Truu
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Leo Lahti
- Department of Computing, University of Turku, Turku, Finland
| | - Magali Berland
- Université Paris-Saclay, INRAE, MetaGenoPolis, Jouy-en-Josas, France
| | - Georgios Papoutsoglou
- JADBio Gnosis DA S.A., Science and Technology Park of Crete, Heraklion, Greece
- Department of Computer Science, University of Crete, Heraklion, Greece
| | - Michelangelo Ceci
- Department of Computer Science, University of Bari Aldo Moro, Bari, Italy
| | - Aldert Zomer
- Department of Biomolecular Health Sciences (Infectious Diseases and Immunology), Faculty of Veterinary Medicine, Utrecht University, Utrecht, Netherlands
| | - Marta B. Lopes
- Center for Mathematics and Applications (NOVA Math), NOVA School of Science and Technology, Caparica, Portugal
- UNIDEMI, Department of Mechanical and Industrial Engineering, NOVA School of Science and Technology, Caparica, Portugal
| | - Eliana Ibrahimi
- Department of Biology, University of Tirana, Tirana, Albania
| | - Aleksandra Gruca
- Department of Computer Networks and Systems, Silesian University of Technology, Gliwice, Poland
| | - Alina Nechyporenko
- Systems Engineering Department, Kharkiv National University of Radio Electronics, Kharkiv, Ukraine
- Department of Molecular Biotechnology and Functional Genomics, Technical University of Applied Sciences Wildau, Wildau, Germany
| | - Marcus Frohme
- Department of Molecular Biotechnology and Functional Genomics, Technical University of Applied Sciences Wildau, Wildau, Germany
| | - Thomas Klammsteiner
- Department of Microbiology, Universität Innsbruck, Innsbruck, Austria
- Department of Ecology, Universität Innsbruck, Innsbruck, Austria
| | - Enrique Carrillo-de Santa Pau
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, CEI UAM+CSIC, Madrid, Spain
| | - Laura Judith Marcos-Zambrano
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, CEI UAM+CSIC, Madrid, Spain
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czechia
| | - Gianvito Pio
- Department of Computer Science, University of Bari Aldo Moro, Bari, Italy
| | - Andrea Simeon
- BioSense Institute, University of Novi Sad, Novi Sad, Serbia
| | - Ramona Suharoschi
- Molecular Nutrition and Proteomics Research Laboratory, Department of Food Science, University of Agricultural Sciences and Veterinary Medicine of Cluj-Napoca, Cluj-Napoca, Romania
| | - Isabel Moreno-Indias
- Department of Endocrinology and Nutrition, Virgen de la Victoria University Hospital, the Biomedical Research Institute of Malaga and Platform in Nanomedicine (IBIMA-BIONAND Platform), University of Malaga, Malaga, Spain
| | - Andriy Temko
- Department of Electrical and Electronic Engineering, University College Cork, Cork, Ireland
| | | | - Elena-Simona Apostol
- Computer Science and Engineering Department, Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest, Romania
| | - Ciprian-Octavian Truică
- Computer Science and Engineering Department, Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest, Romania
| | - Rajesh Shigdel
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Jasminka Hasić Telalović
- Department of Computer Science, University Sarajevo School of Science and Technology, Sarajevo, Bosnia and Herzegovina
| | - Erik Bongcam-Rudloff
- Swedish University of Agricultural Sciences, Department of Animal Breeding and Genetics, Uppsala, Sweden
| | | | - Naida Babić Jordamović
- Computational Biology, International Centre for Genetic Engineering and Biotechnology, Trieste, Italy
- Verlab Research Institute for BIomedical Engineering, Medical Devices and Artificial Intelligence, Sarajevo, Bosnia and Herzegovina
| | - Laurent Falquet
- University of Fribourg and Swiss Institute of Bioinformatics, Fribourg, Switzerland
| | - Sonia Tarazona
- Department of Applied Statistics and Operations Research and Quality, Universitat Politècnica de València, València, Spain
| | - Alexia Sampri
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
- Victor Phillip Dahdaleh Heart and Lung Research Institute, University of Cambridge, Cambridge, United Kingdom
| | - Gaetano Isola
- Department of General Surgery and Surgical-Medical Specialties, School of Dentistry, University of Catania, Catania, Italy
| | - David Pérez-Serrano
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, CEI UAM+CSIC, Madrid, Spain
| | | | - Lubos Klucar
- Institute of Molecular Biology, Slovak Academy of Sciences, Bratislava, Slovakia
| | | | - Aki S. Havulinna
- Finnish Institute for Health and Welfare, Helsinki, Finland
- Institute for Molecular Medicine Finland, FIMM-HiLIFE, Helsinki, Finland
| | - Christian Jansen
- Biome Diagnostics GmbH, Vienna, Austria
- Institute of Science and Technology Austria (ISTA), Klosterneuburg, Austria
| | | | | |
Collapse
|
19
|
Astrologo NCN, Gaudillo JD, Albia JR, Roxas-Villanueva RML. Genetic risk assessment based on association and prediction studies. Sci Rep 2023; 13:15230. [PMID: 37709797 PMCID: PMC10502006 DOI: 10.1038/s41598-023-41862-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 09/01/2023] [Indexed: 09/16/2023] Open
Abstract
The genetic basis of phenotypic emergence provides valuable information for assessing individual risk. While association studies have been pivotal in identifying genetic risk factors within a population, complementing it with insights derived from predictions studies that assess individual-level risk offers a more comprehensive approach to understanding phenotypic expression. In this study, we established personalized risk assessment models using single-nucleotide polymorphism (SNP) data from 200 Korean patients, of which 100 experienced hepatitis B surface antigen (HBsAg) seroclearance and 100 patients demonstrated high levels of HBsAg. The risk assessment models determined the predictive power of the following: (1) genome-wide association study (GWAS)-identified candidate biomarkers considered significant in a reference study and (2) machine learning (ML)-identified candidate biomarkers with the highest feature importance scores obtained by using random forest (RF). While utilizing all features yielded 64% model accuracy, using relevant biomarkers achieved higher model accuracies: 82% for 52 GWAS-identified candidate biomarkers, 71% for three GWAS-identified biomarkers, and 80% for 150 ML-identified candidate biomarkers. Findings highlight that the joint contributions of relevant biomarkers significantly influence phenotypic emergence. On the other hand, combining ML-identified candidate biomarkers into the pool of GWAS-identified candidate biomarkers resulted in the improved predictive accuracy of 90%, demonstrating the capability of ML as an auxiliary analysis to GWAS. Furthermore, some of the ML-identified candidate biomarkers were found to be linked with hepatocellular carcinoma (HCC), reinforcing previous claims that HCC can still occur despite the absence of HBsAg.
Collapse
Affiliation(s)
- Nicole Cathlene N Astrologo
- Data Analytics Research Laboratory (DARELab), Institute of Mathematical Sciences and Physics, University of the Philippines Los Baños, 4031, Los Baños, Laguna, Philippines
- Computational Interdisciplinary Research Laboratory (CINTERLabs), University of the Philippines Los Baños, 4031, Los Baños, Laguna, Philippines
| | - Joverlyn D Gaudillo
- Data Analytics Research Laboratory (DARELab), Institute of Mathematical Sciences and Physics, University of the Philippines Los Baños, 4031, Los Baños, Laguna, Philippines.
- Computational Interdisciplinary Research Laboratory (CINTERLabs), University of the Philippines Los Baños, 4031, Los Baños, Laguna, Philippines.
- Domingo AI Research Center (DARC Labs), 1606, Pasig, Philippines.
| | - Jason R Albia
- Domingo AI Research Center (DARC Labs), 1606, Pasig, Philippines
- Venn Biosciences Corporation Dba InterVenn Biosciences, Metro Manila, Pasig, Philippines
- Graduate School, University of the Philippines Los Baños, 4031, Los Baños, Laguna, Philippines
| | - Ranzivelle Marianne L Roxas-Villanueva
- Data Analytics Research Laboratory (DARELab), Institute of Mathematical Sciences and Physics, University of the Philippines Los Baños, 4031, Los Baños, Laguna, Philippines
- Computational Interdisciplinary Research Laboratory (CINTERLabs), University of the Philippines Los Baños, 4031, Los Baños, Laguna, Philippines
| |
Collapse
|
20
|
Hegarty C, Neto N, Cahill P, Floudas A. Computational approaches in rheumatic diseases - Deciphering complex spatio-temporal cell interactions. Comput Struct Biotechnol J 2023; 21:4009-4020. [PMID: 37649712 PMCID: PMC10462794 DOI: 10.1016/j.csbj.2023.08.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 08/04/2023] [Accepted: 08/04/2023] [Indexed: 09/01/2023] Open
Abstract
Inflammatory arthritis, including rheumatoid (RA), and psoriatic (PsA) arthritis, are clinically and immunologically heterogeneous diseases with no identified cure. Chronic inflammation of the synovial tissue ushers loss of function of the joint that severely impacts the patient's quality of life, eventually leading to disability and life-threatening comorbidities. The pathogenesis of synovial inflammation is the consequence of compounded immune and stromal cell interactions influenced by genetic and environmental factors. Deciphering the complexity of the synovial cellular landscape has accelerated primarily due to the utilisation of bulk and single cell RNA sequencing. Particularly the capacity to generate cell-cell interaction networks could reveal evidence of previously unappreciated processes leading to disease. However, there is currently a lack of universal nomenclature as a result of varied experimental and technological approaches that discombobulates the study of synovial inflammation. While spatial transcriptomic analysis that combines anatomical information with transcriptomic data of synovial tissue biopsies promises to provide more insights into disease pathogenesis, in vitro functional assays with single-cell resolution will be required to validate current bioinformatic applications. In order to provide a comprehensive approach and translate experimental data to clinical practice, a combination of clinical and molecular data with machine learning has the potential to enhance patient stratification and identify individuals at risk of arthritis that would benefit from early therapeutic intervention. This review aims to provide a comprehensive understanding of the effect of computational approaches in deciphering synovial inflammation pathogenesis and discuss the impact that further experimental and novel computational tools may have on therapeutic target identification and drug development.
Collapse
Affiliation(s)
- Ciara Hegarty
- Translational Immunology lab, School of Biotechnology, Dublin City University, Dublin, Ireland
| | - Nuno Neto
- Trinity Centre for Biomedical Engineering, Trinity College Dublin, Ireland
| | - Paul Cahill
- Vascular Biology lab, School of Biotechnology, Dublin City University, Dublin, Ireland
| | - Achilleas Floudas
- Translational Immunology lab, School of Biotechnology, Dublin City University, Dublin, Ireland
| |
Collapse
|
21
|
Lallah PN, Laite C, Bangash AB, Chooah O, Jiang C. The Use of Artificial Intelligence for Detecting and Predicting Atrial Arrhythmias Post Catheter Ablation. Rev Cardiovasc Med 2023; 24:215. [PMID: 39076714 PMCID: PMC11266764 DOI: 10.31083/j.rcm2408215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 03/01/2023] [Accepted: 03/07/2023] [Indexed: 07/31/2024] Open
Abstract
Catheter ablation (CA) is considered as one of the most effective methods technique for eradicating persistent and abnormal cardiac arrhythmias. Nevertheless, in some cases, these arrhythmias are not treated properly, resulting in their recurrences. If left untreated, they may result in complications such as strokes, heart failure, or death. Until recently, the primary techniques for diagnosing recurrent arrhythmias following CA were the findings predisposing to the changes caused by the arrhythmias on cardiac imaging and electrocardiograms during follow-up visits, or if patients reported having palpitations or chest discomfort after the ablation. However, these follow-ups may be time-consuming and costly, and they may not always determine the root cause of the recurrences. With the introduction of artificial intelligence (AI), these follow-up visits can be effectively shortened, and improved methods for predicting the likelihood of recurring arrhythmias after their ablation procedures can be developed. AI can be divided into two categories: machine learning (ML) and deep learning (DL), the latter of which is a subset of ML. ML and DL models have been used in several studies to demonstrate their ability to predict and identify cardiac arrhythmias using clinical variables, electrophysiological characteristics, and trends extracted from imaging data. AI has proven to be a valuable aid for cardiologists due to its ability to compute massive amounts of data and detect subtle changes in electric signals and cardiac images, which may potentially increase the risk of recurrent arrhythmias after CA. Despite the fact that these studies involving AI have generated promising outcomes comparable to or superior to human intervention, they have primarily focused on atrial fibrillation while atrial flutter (AFL) and atrial tachycardia (AT) were the subjects of relatively few AI studies. Therefore, the aim of this review is to investigate the interaction of AI algorithms, electrophysiological characteristics, imaging data, risk score calculators, and clinical variables in predicting cardiac arrhythmias following an ablation procedure. This review will also discuss the implementation of these algorithms to enable the detection and prediction of AFL and AT recurrences following CA.
Collapse
Affiliation(s)
- Poojesh Nikhil Lallah
- Department of Cardiology, Sir Run Run Shaw Hospital, School of Medicine,
Zhejiang University, 310016 Hangzhou, Zhejiang, China
| | - Chen Laite
- Department of Cardiology, Sir Run Run Shaw Hospital, School of Medicine,
Zhejiang University, 310016 Hangzhou, Zhejiang, China
| | - Abdul Basit Bangash
- Department of Cardiology, Sir Run Run Shaw Hospital, School of Medicine,
Zhejiang University, 310016 Hangzhou, Zhejiang, China
| | - Outesh Chooah
- Department of Radiology, Sir Run Run Shaw Hospital, School of Medicine,
Zhejiang University, 310016 Hangzhou, Zhejiang, China
| | - Chenyang Jiang
- Department of Cardiology, Sir Run Run Shaw Hospital, School of Medicine,
Zhejiang University, 310016 Hangzhou, Zhejiang, China
| |
Collapse
|
22
|
Güldener U, Kessler T, von Scheidt M, Hawe JS, Gerhard B, Maier D, Lachmann M, Laugwitz KL, Cassese S, Schömig AW, Kastrati A, Schunkert H. Machine Learning Identifies New Predictors on Restenosis Risk after Coronary Artery Stenting in 10,004 Patients with Surveillance Angiography. J Clin Med 2023; 12:jcm12082941. [PMID: 37109283 PMCID: PMC10142067 DOI: 10.3390/jcm12082941] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 03/31/2023] [Accepted: 04/15/2023] [Indexed: 04/29/2023] Open
Abstract
OBJECTIVE Machine learning (ML) approaches have the potential to uncover regular patterns in multi-layered data. Here we applied self-organizing maps (SOMs) to detect such patterns with the aim to better predict in-stent restenosis (ISR) at surveillance angiography 6 to 8 months after percutaneous coronary intervention with stenting. METHODS In prospectively collected data from 10,004 patients receiving percutaneous coronary intervention (PCI) for 15,004 lesions, we applied SOMs to predict ISR angiographically 6-8 months after index procedure. SOM findings were compared with results of conventional uni- and multivariate analyses. The predictive value of both approaches was assessed after random splitting of patients into training and test sets (50:50). RESULTS Conventional multivariate analyses revealed 10, mostly known, predictors for restenosis after coronary stenting: balloon-to-vessel ratio, complex lesion morphology, diabetes mellitus, left main stenting, stent type (bare metal vs. first vs. second generation drug eluting stent), stent length, stenosis severity, vessel size reduction, and prior bypass surgery. The SOM approach identified all these and nine further predictors, including chronic vessel occlusion, lesion length, and prior PCI. Moreover, the SOM-based model performed well in predicting ISR (AUC under ROC: 0.728); however, there was no meaningful advantage in predicting ISR at surveillance angiography in comparison with the conventional multivariable model (0.726, p = 0.3). CONCLUSIONS The agnostic SOM-based approach identified-without clinical knowledge-even more contributors to restenosis risk. In fact, SOMs applied to a large prospectively sampled cohort identified several novel predictors of restenosis after PCI. However, as compared with established covariates, ML technologies did not improve identification of patients at high risk for restenosis after PCI in a clinically relevant fashion.
Collapse
Affiliation(s)
- Ulrich Güldener
- Department of Cardiology, Deutsches Herzzentrum München, Technische Universität München, 80636 Munich, Germany
| | - Thorsten Kessler
- Department of Cardiology, Deutsches Herzzentrum München, Technische Universität München, 80636 Munich, Germany
- DZHK (German Center for Cardiovascular Research), Partner Site Munich Heart Alliance, 80802 Munich, Germany
| | - Moritz von Scheidt
- Department of Cardiology, Deutsches Herzzentrum München, Technische Universität München, 80636 Munich, Germany
- DZHK (German Center for Cardiovascular Research), Partner Site Munich Heart Alliance, 80802 Munich, Germany
| | - Johann S Hawe
- Department of Cardiology, Deutsches Herzzentrum München, Technische Universität München, 80636 Munich, Germany
| | | | - Dieter Maier
- Biomax, Robert-Koch-Str. 2, 82152 Planegg, Germany
| | - Mark Lachmann
- Department of Cardiology, Klinikum Rechts der Isar, Technische Universität München, 81675 Munich, Germany
| | - Karl-Ludwig Laugwitz
- DZHK (German Center for Cardiovascular Research), Partner Site Munich Heart Alliance, 80802 Munich, Germany
- Department of Cardiology, Klinikum Rechts der Isar, Technische Universität München, 81675 Munich, Germany
| | - Salvatore Cassese
- Department of Cardiology, Deutsches Herzzentrum München, Technische Universität München, 80636 Munich, Germany
| | - Albert W Schömig
- Department of Cardiology, Deutsches Herzzentrum München, Technische Universität München, 80636 Munich, Germany
| | - Adnan Kastrati
- Department of Cardiology, Deutsches Herzzentrum München, Technische Universität München, 80636 Munich, Germany
- DZHK (German Center for Cardiovascular Research), Partner Site Munich Heart Alliance, 80802 Munich, Germany
| | - Heribert Schunkert
- Department of Cardiology, Deutsches Herzzentrum München, Technische Universität München, 80636 Munich, Germany
- DZHK (German Center for Cardiovascular Research), Partner Site Munich Heart Alliance, 80802 Munich, Germany
| |
Collapse
|