1
|
Chafai N, Bonizzi L, Botti S, Badaoui B. Emerging applications of machine learning in genomic medicine and healthcare. Crit Rev Clin Lab Sci 2024; 61:140-163. [PMID: 37815417 DOI: 10.1080/10408363.2023.2259466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 09/12/2023] [Indexed: 10/11/2023]
Abstract
The integration of artificial intelligence technologies has propelled the progress of clinical and genomic medicine in recent years. The significant increase in computing power has facilitated the ability of artificial intelligence models to analyze and extract features from extensive medical data and images, thereby contributing to the advancement of intelligent diagnostic tools. Artificial intelligence (AI) models have been utilized in the field of personalized medicine to integrate clinical data and genomic information of patients. This integration allows for the identification of customized treatment recommendations, ultimately leading to enhanced patient outcomes. Notwithstanding the notable advancements, the application of artificial intelligence (AI) in the field of medicine is impeded by various obstacles such as the limited availability of clinical and genomic data, the diversity of datasets, ethical implications, and the inconclusive interpretation of AI models' results. In this review, a comprehensive evaluation of multiple machine learning algorithms utilized in the fields of clinical and genomic medicine is conducted. Furthermore, we present an overview of the implementation of artificial intelligence (AI) in the fields of clinical medicine, drug discovery, and genomic medicine. Finally, a number of constraints pertaining to the implementation of artificial intelligence within the healthcare industry are examined.
Collapse
Affiliation(s)
- Narjice Chafai
- Laboratory of Biodiversity, Ecology, and Genome, Faculty of Sciences, Department of Biology, Mohammed V University in Rabat, Rabat, Morocco
| | - Luigi Bonizzi
- Department of Biomedical, Surgical and Dental Science, University of Milan, Milan, Italy
| | - Sara Botti
- PTP Science Park, Via Einstein - Loc. Cascina Codazza, Lodi, Italy
| | - Bouabid Badaoui
- Laboratory of Biodiversity, Ecology, and Genome, Faculty of Sciences, Department of Biology, Mohammed V University in Rabat, Rabat, Morocco
- African Sustainable Agriculture Research Institute (ASARI), Mohammed VI Polytechnic University (UM6P), Laâyoune, Morocco
| |
Collapse
|
2
|
Robust SNP-based prediction of rheumatoid arthritis through machine-learning-optimized polygenic risk score. J Transl Med 2023; 21:92. [PMID: 36750873 PMCID: PMC9903430 DOI: 10.1186/s12967-023-03939-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 01/28/2023] [Indexed: 02/09/2023] Open
Abstract
BACKGROUND The popular statistics-based Genome-wide association studies (GWAS) have provided deep insights into the field of complex disorder genetics. However, its clinical applicability to predict disease/trait outcomes remains unclear as statistical models are not designed to make predictions. This study employs statistics-free machine-learning (ML)-optimized polygenic risk score (PRS) to complement existing GWAS and bring the prediction of disease/trait outcomes closer to clinical application. Rheumatoid Arthritis (RA) was selected as a model disease to demonstrate the robustness of ML in disease prediction as RA is a prevalent chronic inflammatory joint disease with high mortality rates, affecting adults at the economic prime. Early identification of at-risk individuals may facilitate measures to mitigate the effects of the disease. METHODS This study employs a robust ML feature selection algorithm to identify single nucleotide polymorphisms (SNPs) that can predict RA from a set of training data comprising RA patients and population control samples. Thereafter, selected SNPs were evaluated for their predictive performances across 3 independent, unseen test datasets. The selected SNPs were subsequently used to generate PRS which was also evaluated for its predictive capacity as a sole feature. RESULTS Through robust ML feature selection, 9 SNPs were found to be the minimum number of features for excellent predictive performance (AUC > 0.9) in 3 independent, unseen test datasets. PRS based on these 9 SNPs was significantly associated with (P < 1 × 10-16) and predictive (AUC > 0.9) of RA in the 3 unseen datasets. A RA ML-PRS calculator of these 9 SNPs was developed ( https://xistance.shinyapps.io/prs-ra/ ) to facilitate individualized clinical applicability. The majority of the predictive SNPs are protective, reside in non-coding regions, and are either predicted to be potentially functional SNPs (pfSNPs) or in high linkage disequilibrium (r2 > 0.8) with un-interrogated pfSNPs. CONCLUSIONS These findings highlight the promise of this ML strategy to identify useful genetic features that can robustly predict disease and amenable to translation for clinical application.
Collapse
|
3
|
Sastry NC, Roy D, Banerjee A. Stability of sensorimotor network sculpts the dynamic repertoire of resting state over lifespan. Cereb Cortex 2023; 33:1246-1262. [PMID: 35368068 PMCID: PMC9930636 DOI: 10.1093/cercor/bhac133] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Revised: 02/08/2022] [Accepted: 03/07/2022] [Indexed: 11/14/2022] Open
Abstract
Temporally stable patterns of neural coordination among distributed brain regions are crucial for survival. Recently, many studies highlight association between healthy aging and modifications in organization of functional brain networks, across various time-scales. Nonetheless, quantitative characterization of temporal stability of functional brain networks across healthy aging remains unexplored. This study introduces a data-driven unsupervised approach to capture high-dimensional dynamic functional connectivity (dFC) via low-dimensional patterns and subsequent estimation of temporal stability using quantitative metrics. Healthy aging related changes in temporal stability of dFC were characterized across resting-state, movie-viewing, and sensorimotor tasks (SMT) on a large (n = 645) healthy aging dataset (18-88 years). Prominent results reveal that (1) whole-brain temporal dynamics of dFC movie-watching task is closer to resting-state than to SMT with an overall trend of highest temporal stability observed during SMT followed by movie-watching and resting-state, invariant across lifespan aging, (2) in both tasks conditions stability of neurocognitive networks in young adults is higher than older adults, and (3) temporal stability of whole brain resting-state follows a U-shaped curve along lifespan-a pattern shared by sensorimotor network stability indicating their deeper relationship. Overall, the results can be applied generally for studying cohorts of neurological disorders using neuroimaging tools.
Collapse
Affiliation(s)
- Nisha Chetana Sastry
- Cognitive Brain Dynamics Laboratory, National Brain Research Centre, NH 8, Manesar, Gurgaon 122052, India
| | - Dipanjan Roy
- School of Artificial Intelligence & Data Science, Centre for Brain Science & Applications, Indian Institute of Technology, Jodhpur NH 62, Surpura Bypass Rd, Karwar, Rajasthan 342030, India
| | - Arpan Banerjee
- Cognitive Brain Dynamics Laboratory, National Brain Research Centre, NH 8, Manesar, Gurgaon 122052, India
| |
Collapse
|
5
|
Mendonca-Neto R, Li Z, Fenyo D, Silva CT, Nakamura FG, Nakamura EF. A Gene Selection Method Based on Outliers for Breast Cancer Subtype Classification. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2547-2559. [PMID: 34860652 DOI: 10.1109/tcbb.2021.3132339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Breast cancer is the second most common cancer type and is the leading cause of cancer-related deaths worldwide. Since it is a heterogeneous disease, subtyping breast cancer plays an important role in performing a specific treatment. Gene expression data is a viable alternative to be employed on cancer subtype classification, as they represent the state of a cell at the molecular level, but generally has a relatively small number of samples compared to a large number of genes. Gene selection is a promising approach that addresses this uneven high-dimensional matrix of genes versus samples and plays an important role in the development of efficient cancer subtype classification. In this work, an innovative outlier-based gene selection (OGS) method is proposed to select relevant genes for efficiently and effectively classify breast cancer subtypes. Experiments show that our strategy presents an F1 score of 1.0 for basal and 0.86 for her 2, the two subtypes with the worst prognoses, respectively. Compared to other methods, our proposed method outperforms in the F1 score using 80% less genes. In general, our method selects only a few highly relevant genes, speeding up the classification, and significantly improving the classifier's performance.
Collapse
|
6
|
Stenzinger A, Alber M, Allgäuer M, Jurmeister P, Bockmayr M, Budczies J, Lennerz J, Eschrich J, Kazdal D, Schirmacher P, Wagner AH, Tacke F, Capper D, Müller KR, Klauschen F. Artificial intelligence and pathology: From principles to practice and future applications in histomorphology and molecular profiling. Semin Cancer Biol 2021; 84:129-143. [PMID: 33631297 DOI: 10.1016/j.semcancer.2021.02.011] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Revised: 01/29/2021] [Accepted: 02/16/2021] [Indexed: 02/07/2023]
Abstract
The complexity of diagnostic (surgical) pathology has increased substantially over the last decades with respect to histomorphological and molecular profiling. Pathology has steadily expanded its role in tumor diagnostics and beyond from disease entity identification via prognosis estimation to precision therapy prediction. It is therefore not surprising that pathology is among the disciplines in medicine with high expectations in the application of artificial intelligence (AI) or machine learning approaches given their capabilities to analyze complex data in a quantitative and standardized manner to further enhance scope and precision of diagnostics. While an obvious application is the analysis of histological images, recent applications for the analysis of molecular profiling data from different sources and clinical data support the notion that AI will enhance both histopathology and molecular pathology in the future. At the same time, current literature should not be misunderstood in a way that pathologists will likely be replaced by AI applications in the foreseeable future. Although AI will transform pathology in the coming years, recent studies reporting AI algorithms to diagnose cancer or predict certain molecular properties deal with relatively simple diagnostic problems that fall short of the diagnostic complexity pathologists face in clinical routine. Here, we review the pertinent literature of AI methods and their applications to pathology, and put the current achievements and what can be expected in the future in the context of the requirements for research and routine diagnostics.
Collapse
Affiliation(s)
- Albrecht Stenzinger
- Institute of Pathology, University Hospital Heidelberg, Im Neuenheimer Feld 224, Heidelberg, 69120, Germany; German Cancer Consortium (DKTK), Partner Site Heidelberg, and German Cancer Research Center (DKFZ), Heidelberg, Germany; German Center for Lung Research (DZL), Partner Site Heidelberg, Heidelberg, Germany.
| | - Maximilian Alber
- Institute of Pathology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin and Berlin Institute of Health, Berlin, Germany; Aignostics GmbH, Schumannstr. 17, Berlin, 10117, Germany
| | - Michael Allgäuer
- Institute of Pathology, University Hospital Heidelberg, Im Neuenheimer Feld 224, Heidelberg, 69120, Germany
| | - Philipp Jurmeister
- Institute of Pathology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin and Berlin Institute of Health, Berlin, Germany; German Cancer Consortium (DKTK), Partner Site Berlin, and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Michael Bockmayr
- Institute of Pathology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin and Berlin Institute of Health, Berlin, Germany; Department of Pediatric Hematology and Oncology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany; Research Institute, Children's Cancer Center Hamburg, Hamburg, Germany
| | - Jan Budczies
- Institute of Pathology, University Hospital Heidelberg, Im Neuenheimer Feld 224, Heidelberg, 69120, Germany; German Cancer Consortium (DKTK), Partner Site Heidelberg, and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Jochen Lennerz
- Department of Pathology, Center for Integrated Diagnostics, Harvard Medical School, Massachusetts General Hospital, Boston, MA, USA
| | - Johannes Eschrich
- Department of Hepatology & Gastroenterology, Charité University Medical Center, Berlin, Germany
| | - Daniel Kazdal
- Institute of Pathology, University Hospital Heidelberg, Im Neuenheimer Feld 224, Heidelberg, 69120, Germany; German Center for Lung Research (DZL), Partner Site Heidelberg, Heidelberg, Germany
| | - Peter Schirmacher
- Institute of Pathology, University Hospital Heidelberg, Im Neuenheimer Feld 224, Heidelberg, 69120, Germany; German Cancer Consortium (DKTK), Partner Site Heidelberg, and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Alex H Wagner
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, 43205, USA; Department of Pediatrics, The Ohio State University, Columbus, OH, 43210, USA
| | - Frank Tacke
- Department of Hepatology & Gastroenterology, Charité University Medical Center, Berlin, Germany
| | - David Capper
- German Cancer Consortium (DKTK), Partner Site Berlin, and German Cancer Research Center (DKFZ), Heidelberg, Germany; Department of Neuropathology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Klaus-Robert Müller
- Machine Learning Group, Technische Universität Berlin, Berlin, 10587, Germany; Department of Artificial Intelligence, Korea University, Seoul, 136-713, South Korea; Max-Planck-Institute for Informatics, Saarland Informatics Campus E1 4, Saarbrücken, 66123, Germany; Google Research, Brain Team, Berlin, Germany.
| | - Frederick Klauschen
- Institute of Pathology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin and Berlin Institute of Health, Berlin, Germany; German Cancer Consortium (DKTK), Partner Site Berlin, and German Cancer Research Center (DKFZ), Heidelberg, Germany; Institute of Pathology, Ludwig-Maximilians-Universität München, Thalkirchner Strasse 36, München, 80337, Germany.
| |
Collapse
|
7
|
Kanza S, Bird CL, Niranjan M, McNeill W, Frey JG. The AI for Scientific Discovery Network . PATTERNS (NEW YORK, N.Y.) 2021; 2:100162. [PMID: 33511363 PMCID: PMC7815949 DOI: 10.1016/j.patter.2020.100162] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The Artificial Intelligence and Augmented Intelligence for Automated Investigation for Scientific Discovery Network+ (AI3SD) was established in response to the UK Engineering and Physical Sciences Research Council (EPSRC) late-2017 call for a Network+ to promote cutting-edge research in artificial intelligence to accelerate groundbreaking scientific discoveries. This article provides the philosophical, scientific, and technical underpinnings of the Network+, the history of the different domains represented in the Network+, and the specific focus of the Network+. The activities, collaborations, and research covered in the first year of the Network+ have highlighted the significant challenges in the chemistry and augmented and artificial intelligence space. These challenges are shaping the future directions of the Network+. The article concludes with a summary of the lessons learned in running this Network+ and introduces our plans for the future in a landscape redrawn by COVID-19, including rebranding into the AI 4 Scientific Discovery Network (www.ai4science.network).
Collapse
Affiliation(s)
- Samantha Kanza
- School of Chemistry, University of Southampton, Southampton SO17 1BJ, UK
| | - Colin Leonard Bird
- School of Chemistry, University of Southampton, Southampton SO17 1BJ, UK
| | - Mahesan Niranjan
- School of Electronics and Computer Science and University of Southampton, Southampton SO17 1BJ, UK
| | - William McNeill
- School of Humanities, University of Southampton, Southampton SO17 1BJ, UK
| | - Jeremy Graham Frey
- School of Chemistry, University of Southampton, Southampton SO17 1BJ, UK
| |
Collapse
|