1
|
Vakili Ojarood M, Torabi H, Soltani A, Farzan R, Farhadi B. Machine learning as a hopeful indicator for prediction of complications and mortality in burn patients. Burns 2024; 50:1942-1946. [PMID: 38821726 DOI: 10.1016/j.burns.2024.05.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2024] [Accepted: 05/02/2024] [Indexed: 06/02/2024]
Affiliation(s)
| | - Hossein Torabi
- Department of General Surgery, Poursina Medical and Educational Center, School of Medicine, Guilan University of Medical Sciences, Rasht, Iran.
| | - Azadeh Soltani
- Department of Information Technology Engineering, Mehrastan University, Astaneh Ashrafieh, Iran.
| | - Ramyar Farzan
- Department of Plastic & Reconstructive Surgery, School of Medicine, Guilan University of Medical Sciences, Rasht, Iran.
| | - Bahar Farhadi
- School of Medicine, Islamic Azad University, Mashhad Branch, Mashhad, Iran.
| |
Collapse
|
2
|
Rashidi HH, Ikram A, Dang LT, Bashir A, Zohra T, Ali A, Tanvir H, Mudassar M, Ravindran R, Akhtar N, Sikandar RI, Umer M, Akhter N, Butt R, Fennell BD, Khan IH. Comparing machine learning screening approaches using clinical data and cytokine profiles for COVID-19 in resource-limited and resource-abundant settings. Sci Rep 2024; 14:14892. [PMID: 38937503 PMCID: PMC11211475 DOI: 10.1038/s41598-024-63707-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 05/31/2024] [Indexed: 06/29/2024] Open
Abstract
Accurate screening of COVID-19 infection status for symptomatic patients is a critical public health task. Although molecular and antigen tests now exist for COVID-19, in resource-limited settings, screening tests are often not available. Furthermore, during the early stages of the pandemic tests were not available in any capacity. We utilized an automated machine learning (ML) approach to train and evaluate thousands of models on a clinical dataset consisting of commonly available clinical and laboratory data, along with cytokine profiles for patients (n = 150). These models were then further tested for generalizability on an out-of-sample secondary dataset (n = 120). We were able to develop a ML model for rapid and reliable screening of patients as COVID-19 positive or negative using three approaches: commonly available clinical and laboratory data, a cytokine profile, and a combination of the common data and cytokine profile. Of the tens of thousands of models automatically tested for the three approaches, all three approaches demonstrated > 92% sensitivity and > 88 specificity while our highest performing model achieved 95.6% sensitivity and 98.1% specificity. These models represent a potential effective deployable solution for COVID-19 status classification for symptomatic patients in resource-limited settings and provide proof-of-concept for rapid development of screening tools for novel emerging infectious diseases.
Collapse
Affiliation(s)
- Hooman H Rashidi
- Computational Pathology and AI Center of Excellence (CPACE), University of Pittsburgh Medical Center, and University of Pittsburgh School of Medicine, Pittsburgh, USA.
| | - Aamer Ikram
- National Institutes of Health, Islamabad, Pakistan
| | - Luke T Dang
- Department of Pathology and Laboratory Medicine, University of California, 4400 V Street, DavisSacramento, CA, 95817, USA
| | - Adnan Bashir
- Health Information Systems Program (HISP), Islamabad, Pakistan
| | | | - Amna Ali
- National Institutes of Health, Islamabad, Pakistan
| | - Hamza Tanvir
- National Institutes of Health, Islamabad, Pakistan
| | | | - Resmi Ravindran
- Department of Pathology and Laboratory Medicine, University of California, 4400 V Street, DavisSacramento, CA, 95817, USA
| | - Nasim Akhtar
- Pakistan Institute of Medical Sciences, Islamabad, Pakistan
| | | | - Mohammed Umer
- Rawalpindi Medical University-Rawalpindi, Rawalpindi, Pakistan
| | - Naeem Akhter
- Rawalpindi Medical University-Rawalpindi, Rawalpindi, Pakistan
| | - Rafi Butt
- Isolation Hospital and Infectious Treatment Centre, Islamabad, Pakistan
| | - Brandon D Fennell
- Department of Medicine, University of California, San Francisco, USA
| | - Imran H Khan
- Department of Pathology and Laboratory Medicine, University of California, 4400 V Street, DavisSacramento, CA, 95817, USA.
| |
Collapse
|
3
|
Çubukçu HC, Topcu Dİ, Yenice S. Machine learning-based clinical decision support using laboratory data. Clin Chem Lab Med 2024; 62:793-823. [PMID: 38015744 DOI: 10.1515/cclm-2023-1037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 11/17/2023] [Indexed: 11/30/2023]
Abstract
Artificial intelligence (AI) and machine learning (ML) are becoming vital in laboratory medicine and the broader context of healthcare. In this review article, we summarized the development of ML models and how they contribute to clinical laboratory workflow and improve patient outcomes. The process of ML model development involves data collection, data cleansing, feature engineering, model development, and optimization. These models, once finalized, are subjected to thorough performance assessments and validations. Recently, due to the complexity inherent in model development, automated ML tools were also introduced to streamline the process, enabling non-experts to create models. Clinical Decision Support Systems (CDSS) use ML techniques on large datasets to aid healthcare professionals in test result interpretation. They are revolutionizing laboratory medicine, enabling labs to work more efficiently with less human supervision across pre-analytical, analytical, and post-analytical phases. Despite contributions of the ML tools at all analytical phases, their integration presents challenges like potential model uncertainties, black-box algorithms, and deskilling of professionals. Additionally, acquiring diverse datasets is hard, and models' complexity can limit clinical use. In conclusion, ML-based CDSS in healthcare can greatly enhance clinical decision-making. However, successful adoption demands collaboration among professionals and stakeholders, utilizing hybrid intelligence, external validation, and performance assessments.
Collapse
Affiliation(s)
- Hikmet Can Çubukçu
- General Directorate of Health Services, Rare Diseases Department, Turkish Ministry of Health, Ankara, Türkiye
- Hacettepe University Institute of Informatics, Ankara, Türkiye
| | - Deniz İlhan Topcu
- Health Sciences University İzmir Tepecik Education and Research Hospital, Medical Biochemistry, İzmir, Türkiye
| | - Sedef Yenice
- Florence Nightingale Hospital, Istanbul, Türkiye
| |
Collapse
|
4
|
Rashidi HH, Pepper J, Howard T, Klein K, May L, Albahra S, Phinney B, Salemi MR, Tran NK. Comparative performance of two automated machine learning platforms for COVID-19 detection by MALDI-TOF-MS. PLoS One 2022; 17:e0263954. [PMID: 35905092 PMCID: PMC9337631 DOI: 10.1371/journal.pone.0263954] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 06/25/2022] [Indexed: 11/19/2022] Open
Abstract
The 2019 novel coronavirus infectious disease (COVID-19) pandemic has resulted in an unsustainable need for diagnostic tests. Currently, molecular tests are the accepted standard for the detection of SARS-CoV-2. Mass spectrometry (MS) enhanced by machine learning (ML) has recently been postulated to serve as a rapid, high-throughput, and low-cost alternative to molecular methods. Automated ML is a novel approach that could move mass spectrometry techniques beyond the confines of traditional laboratory settings. However, it remains unknown how different automated ML platforms perform for COVID-19 MS analysis. To this end, the goal of our study is to compare algorithms produced by two commercial automated ML platforms (Platforms A and B). Our study consisted of MS data derived from 361 subjects with molecular confirmation of COVID-19 status including SARS-CoV-2 variants. The top optimized ML model with respect to positive percent agreement (PPA) within Platforms A and B exhibited an accuracy of 94.9%, PPA of 100%, negative percent agreement (NPA) of 93%, and an accuracy of 91.8%, PPA of 100%, and NPA of 89%, respectively. These results illustrate the MS method's robustness against SARS-CoV-2 variants and highlight similarities and differences in automated ML platforms in producing optimal predictive algorithms for a given dataset.
Collapse
Affiliation(s)
- Hooman H. Rashidi
- Robert. J. Tomsich Pathology and Laboratory Medicine Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - John Pepper
- Spectra Pass LLC & Allegiant Airlines, Las Vegas, Nevada, United States of America
| | - Taylor Howard
- Dept. of Pathology and Laboratory Medicine, UC Davis, Davis, California, United States of America
| | - Karina Klein
- Dept. of Emergency Medicine, UC Davis, Davis, California, United States of America
| | - Larissa May
- Dept. of Emergency Medicine, UC Davis, Davis, California, United States of America
| | - Samer Albahra
- Robert. J. Tomsich Pathology and Laboratory Medicine Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Brett Phinney
- Proteomics Core, UC Davis, Davis, California, United States of America
| | | | - Nam K. Tran
- Dept. of Pathology and Laboratory Medicine, UC Davis, Davis, California, United States of America
| |
Collapse
|
5
|
Rashidi H, Khan I, Dang L, Albahra S, Ratan U, Chadderwala N, To W, Srinivas P, Wajda J, Tran N. Prediction of tuberculosis using an automated machine learning platform for models trained on synthetic data. J Pathol Inform 2022; 13:10. [PMID: 35136677 PMCID: PMC8794034 DOI: 10.4103/jpi.jpi_75_21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 11/18/2021] [Accepted: 11/30/2021] [Indexed: 11/15/2022] Open
Abstract
High-quality medical data is critical to the development and implementation of machine learning (ML) algorithms in healthcare; however, security, and privacy concerns continue to limit access. We sought to determine the utility of “synthetic data” in training ML algorithms for the detection of tuberculosis (TB) from inflammatory biomarker profiles. A retrospective dataset (A) comprised of 278 patients was used to generate synthetic datasets (B, C, and D) for training models prior to secondary validation on a generalization dataset. ML models trained and validated on the Dataset A (real) demonstrated an accuracy of 90%, a sensitivity of 89% (95% CI, 83–94%), and a specificity of 100% (95% CI, 81–100%). Models trained using the optimal synthetic dataset B showed an accuracy of 91%, a sensitivity of 93% (95% CI, 87–96%), and a specificity of 77% (95% CI, 50–93%). Synthetic datasets C and D displayed diminished performance measures (respective accuracies of 71% and 54%). This pilot study highlights the promise of synthetic data as an expedited means for ML algorithm development.
Collapse
|
6
|
Rashidi HH, Dang LT, Albahra S, Ravindran R, Khan IH. Automated machine learning for endemic active tuberculosis prediction from multiplex serological data. Sci Rep 2021; 11:17900. [PMID: 34504228 PMCID: PMC8429671 DOI: 10.1038/s41598-021-97453-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 08/25/2021] [Indexed: 11/09/2022] Open
Abstract
Serological diagnosis of active tuberculosis (TB) is enhanced by detection of multiple antibodies due to variable immune responses among patients. Clinical interpretation of these complex datasets requires development of suitable algorithms, a time consuming and tedious undertaking addressed by the automated machine learning platform MILO (Machine Intelligence Learning Optimizer). MILO seamlessly integrates data processing, feature selection, model training, and model validation to simultaneously generate and evaluate thousands of models. These models were then further tested for generalizability on out-of-sample secondary and tertiary datasets. Out of 31 antigens evaluated, a 23-antigen model was the most robust on both the secondary dataset (TB vs healthy) and the tertiary dataset (TB vs COPD) with sensitivity of 90.5% and respective specificities of 100.0% and 74.6%. MILO represents a user-friendly, end-to-end solution for automated generation and deployment of optimized models, ideal for applications where rapid clinical implementation is critical such as emerging infectious diseases.
Collapse
Affiliation(s)
- Hooman H Rashidi
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V Street, Sacramento, CA, 95817, USA.
| | - Luke T Dang
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V Street, Sacramento, CA, 95817, USA
| | - Samer Albahra
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V Street, Sacramento, CA, 95817, USA
| | - Resmi Ravindran
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V Street, Sacramento, CA, 95817, USA
| | - Imran H Khan
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V Street, Sacramento, CA, 95817, USA.
| |
Collapse
|
7
|
Leditzke K, Wagner MEH, Neunaber C, Clausen JD, Winkelmann M. Neutrophil Gelatinase-associated Lipocalin Predicts Post-traumatic Acute Kidney Injury in Severely Injured Patients. In Vivo 2021; 35:2755-2762. [PMID: 34410965 DOI: 10.21873/invivo.12560] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Revised: 07/02/2021] [Accepted: 07/08/2021] [Indexed: 12/12/2022]
Abstract
BACKGROUND Early detection of acute kidney injury (AKI) is crucial in the management of multiple-organ dysfunction syndrome in severely injured patients. Standard laboratory parameters usually increase with temporal delay. Therefore, we evaluated neutrophil gelatinase-associated lipocalin (NGAL) as an early marker for acute kidney injury. PATIENTS AND METHODS We retrospectively evaluated patients admitted to a level 1 trauma center. We collected clinicodemographic data and measured kidney-related factors and plasma cytokines. RESULTS A total of 39 patients were included. Patients with AKI had significantly higher levels not only of serum creatinine and urea, but also of NGAL (all p<0.001) than patients without AKI. The optimal NGAL cut-off value was determined to be 177 ng/ml, showing significant correlation with imminent or manifest AKI (p<0.001). Other independent markers correlated with AKI included pre-existing chronic kidney disease, use of catecholamines, and severe injury (p<0.001). CONCLUSION The serum level of NGAL is feasible early predictor of AKI.
Collapse
|
8
|
Tran NK, Howard T, Walsh R, Pepper J, Loegering J, Phinney B, Salemi MR, Rashidi HH. Novel application of automated machine learning with MALDI-TOF-MS for rapid high-throughput screening of COVID-19: a proof of concept. Sci Rep 2021; 11:8219. [PMID: 33859233 PMCID: PMC8050054 DOI: 10.1038/s41598-021-87463-w] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 03/23/2021] [Indexed: 01/05/2023] Open
Abstract
The 2019 novel coronavirus infectious disease (COVID-19) pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has created an unsustainable need for molecular diagnostic testing. Molecular approaches such as reverse transcription (RT) polymerase chain reaction (PCR) offers highly sensitive and specific means to detect SARS-CoV-2 RNA, however, despite it being the accepted "gold standard", molecular platforms often require a tradeoff between speed versus throughput. Matrix assisted laser desorption ionization (MALDI)-time of flight (TOF)-mass spectrometry (MS) has been proposed as a potential solution for COVID-19 testing and finding a balance between analytical performance, speed, and throughput, without relying on impacted supply chains. Combined with machine learning (ML), this MALDI-TOF-MS approach could overcome logistical barriers encountered by current testing paradigms. We evaluated the analytical performance of an ML-enhanced MALDI-TOF-MS method for screening COVID-19. Residual nasal swab samples from adult volunteers were used for testing and compared against RT-PCR. Two optimized ML models were identified, exhibiting accuracy of 98.3%, positive percent agreement (PPA) of 100%, negative percent agreement (NPA) of 96%, and accuracy of 96.6%, PPA of 98.5%, and NPA of 94% respectively. Machine learning enhanced MALDI-TOF-MS for COVID-19 testing exhibited performance comparable to existing commercial SARS-CoV-2 tests.
Collapse
Affiliation(s)
- Nam K Tran
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V St., Sacramento, CA, 95817, USA.
| | - Taylor Howard
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V St., Sacramento, CA, 95817, USA
| | - Ryan Walsh
- Shimadzu North America/Shimadzu Scientific Instruments, Inc., Baltimore, USA
| | - John Pepper
- Spectra Pass, LLC and Allegiant Airlines, Las Vegas, USA
| | - Julia Loegering
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V St., Sacramento, CA, 95817, USA
| | - Brett Phinney
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V St., Sacramento, CA, 95817, USA
| | - Michelle R Salemi
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V St., Sacramento, CA, 95817, USA
| | - Hooman H Rashidi
- Department of Pathology and Laboratory Medicine, University of California Davis, 4400 V St., Sacramento, CA, 95817, USA.
| |
Collapse
|