1
|
Cao Y, Li Y, Wang M, Wang L, Fang Y, Wu Y, Liu Y, Liu Y, Hao Z, Kang H, Gao H. INTERPRETABLE MACHINE LEARNING FOR PREDICTING RISK OF INVASIVE FUNGAL INFECTION IN CRITICALLY ILL PATIENTS IN THE INTENSIVE CARE UNIT: A RETROSPECTIVE COHORT STUDY BASED ON MIMIC-IV DATABASE. Shock 2024; 61:817-827. [PMID: 38407989 DOI: 10.1097/shk.0000000000002312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/28/2024]
Abstract
ABSTRACT The delayed diagnosis of invasive fungal infection (IFI) is highly correlated with poor prognosis in patients. Early identification of high-risk patients with invasive fungal infections and timely implementation of targeted measures is beneficial for patients. The objective of this study was to develop a machine learning-based predictive model for invasive fungal infection in patients during their intensive care unit (ICU) stay. Retrospective data was extracted from adult patients in the MIMIC-IV database who spent a minimum of 48 h in the ICU. Feature selection was performed using LASSO regression, and the dataset was balanced using the BL-SMOTE approach. Predictive models were built using six machine learning algorithms. The Shapley additive explanation algorithm was used to assess the impact of various clinical features in the optimal model, enhancing interpretability. The study included 26,346 ICU patients, of whom 379 (1.44%) were diagnosed with invasive fungal infection. The predictive model was developed using 20 risk factors, and the dataset was balanced using the borderline-SMOTE (BL-SMOTE) algorithm. The BL-SMOTE random forest model demonstrated the highest predictive performance (area under curve = 0.88, 95% CI = 0.84-0.91). Shapley additive explanation analysis revealed that the three most influential clinical features in the BL-SMOTE random forest model were dialysis treatment, APSIII scores, and liver disease. The machine learning model provides a reliable tool for predicting the occurrence of IFI in ICU patients. The BL-SMOTE random forest model, based on 20 risk factors, exhibited superior predictive performance and can assist clinicians in early assessment of IFI occurrence in ICU patients. Importance: Invasive fungal infections are characterized by high incidence and high mortality rates characteristics. In this study, we developed a clinical prediction model for invasive fungal infections in critically ill patients based on machine learning algorithms. The results show that the machine learning model based on 20 clinical features has good predictive value.
Collapse
Affiliation(s)
- Yuan Cao
- Emergency Department, The Second Hospital of Hebei Medical University, Shijiazhuang, China
| | | | | | | | - Yuan Fang
- Department of Critical Care Medicine, the First Medical Centre, Chinese PLA General Hospital, Beijing, China
| | | | | | - Yixuan Liu
- Emergency Department, The Second Hospital of Hebei Medical University, Shijiazhuang, China
| | - Ziqian Hao
- Emergency Department, The Second Hospital of Hebei Medical University, Shijiazhuang, China
| | - Hongjun Kang
- Department of Critical Care Medicine, the First Medical Centre, Chinese PLA General Hospital, Beijing, China
| | - Hengbo Gao
- Emergency Department, The Second Hospital of Hebei Medical University, Shijiazhuang, China
| |
Collapse
|
2
|
Giacobbe DR, Marelli C, Mora S, Guastavino S, Russo C, Brucci G, Limongelli A, Vena A, Mikulska M, Tayefi M, Peluso S, Signori A, Di Biagio A, Marchese A, Campi C, Giacomini M, Bassetti M. Early diagnosis of candidemia with explainable machine learning on automatically extracted laboratory and microbiological data: results of the AUTO-CAND project. Ann Med 2023; 55:2285454. [PMID: 38010342 PMCID: PMC10836245 DOI: 10.1080/07853890.2023.2285454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 11/13/2023] [Indexed: 11/29/2023] Open
Abstract
BACKGROUND Candidemia is associated with a heavy burden of morbidity and mortality in hospitalized patients. The availability of blood culture results could require up to 48-72 h after blood draw; thus, early treatment decisions are made in the absence of a definite diagnosis. METHODS In this retrospective study, we assessed the performance of different supervised machine learning algorithms for the early differential diagnosis of candidemia and bacteremia in adult patients on a large dataset automatically extracted within the AUTO-CAND project. RESULTS Overall, 12,483 episodes of candidemia (1275; 10%) or bacteremia (11,208; 90%) were included in the analysis. A random forest classifier achieved the best diagnostic performance for candidemia, with sensitivity 0.98 and specificity 0.65 on the training set (true skill statistic [TSS] = 0.63) and sensitivity 0.74 and specificity 0.57 on the test set (TSS = 0.31). Then, the random classifier was trained in the subgroup of patients with available serum β-D-glucan (BDG) and procalcitonin (PCT) values by exploiting the feature ranking learned in the entire dataset. Although no statistically significant differences were observed from the performance measures obtained by employing BDG and PCT alone, the performance measures of the classifier that included the features selected in the entire dataset, plus BDG and PCT, were the highest in most cases. CONCLUSIONS Random forest classifiers trained on large datasets of automatically extracted data have the potential to improve current diagnostic algorithms for candidemia. However, further development through implementation of automatically extracted clinical features may be necessary to achieve crucial improvements.
Collapse
Affiliation(s)
- Daniele Roberto Giacobbe
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Cristina Marelli
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Sara Mora
- Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
| | | | - Chiara Russo
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Giorgia Brucci
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Alessandro Limongelli
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Antonio Vena
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Malgorzata Mikulska
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Maryam Tayefi
- Norwegian Centre for E-Health Research, Tromsø, Norway
| | - Stefano Peluso
- Department of Statistics and Quantitative Methods, University of Milan - Bicocca, Milan, Italy
| | - Alessio Signori
- Section of Biostatistics, Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy
| | - Antonio Di Biagio
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Anna Marchese
- Department of Surgical Sciences and Integrated Diagnostics (DISC), University of Genoa, Genoa, Italy
- Microbiology Unit, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Cristina Campi
- Department of Mathematics (DIMA), University of Genoa, Genoa, Italy
- Life Science Computational Laboratory (LISCOMP), IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Mauro Giacomini
- Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
| | - Matteo Bassetti
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| |
Collapse
|
3
|
Fernandes GJ, Choi A, Schauer JM, Pfammatter AF, Spring BJ, Darwiche A, Alshurafa NI. An Explainable Artificial Intelligence Software Tool for Weight Management Experts (PRIMO): Mixed Methods Study. J Med Internet Res 2023; 25:e42047. [PMID: 37672333 PMCID: PMC10512114 DOI: 10.2196/42047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 01/27/2023] [Accepted: 04/20/2023] [Indexed: 09/07/2023] Open
Abstract
BACKGROUND Predicting the likelihood of success of weight loss interventions using machine learning (ML) models may enhance intervention effectiveness by enabling timely and dynamic modification of intervention components for nonresponders to treatment. However, a lack of understanding and trust in these ML models impacts adoption among weight management experts. Recent advances in the field of explainable artificial intelligence enable the interpretation of ML models, yet it is unknown whether they enhance model understanding, trust, and adoption among weight management experts. OBJECTIVE This study aimed to build and evaluate an ML model that can predict 6-month weight loss success (ie, ≥7% weight loss) from 5 engagement and diet-related features collected over the initial 2 weeks of an intervention, to assess whether providing ML-based explanations increases weight management experts' agreement with ML model predictions, and to inform factors that influence the understanding and trust of ML models to advance explainability in early prediction of weight loss among weight management experts. METHODS We trained an ML model using the random forest (RF) algorithm and data from a 6-month weight loss intervention (N=419). We leveraged findings from existing explainability metrics to develop Prime Implicant Maintenance of Outcome (PRIMO), an interactive tool to understand predictions made by the RF model. We asked 14 weight management experts to predict hypothetical participants' weight loss success before and after using PRIMO. We compared PRIMO with 2 other explainability methods, one based on feature ranking and the other based on conditional probability. We used generalized linear mixed-effects models to evaluate participants' agreement with ML predictions and conducted likelihood ratio tests to examine the relationship between explainability methods and outcomes for nested models. We conducted guided interviews and thematic analysis to study the impact of our tool on experts' understanding and trust in the model. RESULTS Our RF model had 81% accuracy in the early prediction of weight loss success. Weight management experts were significantly more likely to agree with the model when using PRIMO (χ2=7.9; P=.02) compared with the other 2 methods with odds ratios of 2.52 (95% CI 0.91-7.69) and 3.95 (95% CI 1.50-11.76). From our study, we inferred that our software not only influenced experts' understanding and trust but also impacted decision-making. Several themes were identified through interviews: preference for multiple explanation types, need to visualize uncertainty in explanations provided by PRIMO, and need for model performance metrics on similar participant test instances. CONCLUSIONS Our results show the potential for weight management experts to agree with the ML-based early prediction of success in weight loss treatment programs, enabling timely and dynamic modification of intervention components to enhance intervention effectiveness. Our findings provide methods for advancing the understandability and trust of ML models among weight management experts.
Collapse
Affiliation(s)
- Glenn J Fernandes
- Department of Computer Science, Northwestern University, Evanston, IL, United States
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States
| | - Arthur Choi
- Department of Computer Science, Kennesaw State University, Kennesaw, GA, United States
| | - Jacob Michael Schauer
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States
| | - Angela F Pfammatter
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States
| | - Bonnie J Spring
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States
| | - Adnan Darwiche
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, United States
| | - Nabil I Alshurafa
- Department of Computer Science, Northwestern University, Evanston, IL, United States
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States
| |
Collapse
|
4
|
Validation of an Automated System for the Extraction of a Wide Dataset for Clinical Studies Aimed at Improving the Early Diagnosis of Candidemia. Diagnostics (Basel) 2023; 13:diagnostics13050961. [PMID: 36900105 PMCID: PMC10001256 DOI: 10.3390/diagnostics13050961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 02/26/2023] [Accepted: 02/27/2023] [Indexed: 03/06/2023] Open
Abstract
There is increasing interest in assessing whether machine learning (ML) techniques could further improve the early diagnosis of candidemia among patients with a consistent clinical picture. The objective of the present study is to validate the accuracy of a system for the automated extraction from a hospital laboratory software of a large number of features from candidemia and/or bacteremia episodes as the first phase of the AUTO-CAND project. The manual validation was performed on a representative and randomly extracted subset of episodes of candidemia and/or bacteremia. The manual validation of the random extraction of 381 episodes of candidemia and/or bacteremia, with automated organization in structured features of laboratory and microbiological data resulted in ≥99% correct extractions (with confidence interval < ±1%) for all variables. The final automatically extracted dataset consisted of 1338 episodes of candidemia (8%), 14,112 episodes of bacteremia (90%), and 302 episodes of mixed candidemia/bacteremia (2%). The final dataset will serve to assess the performance of different ML models for the early diagnosis of candidemia in the second phase of the AUTO-CAND project.
Collapse
|
5
|
Kim SH, Mun SJ, Kang JS, Moon C, Kim HT, Lee HY. Multifaceted Evaluation of Antibiotic Therapy as a Factor Associated with Candidemia in Non-Neutropenic Patients. J Fungi (Basel) 2023; 9:jof9020270. [PMID: 36836385 PMCID: PMC9960229 DOI: 10.3390/jof9020270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 02/04/2023] [Accepted: 02/12/2023] [Indexed: 02/22/2023] Open
Abstract
We aimed to evaluate various aspects of antibiotic therapy as factors associated with candidemia in non-neutropenic patients. A retrospective, matched, case-control study was conducted in two teaching hospitals. Patients with candidemia (cases) were compared to patients without candidemia (controls), matched by age, intensive care unit admission, duration of hospitalization, and type of surgery. Logistic regression analyses were performed to identify factors associated with candidemia. A total of 246 patients were included in the study. Of 123 candidemia patients, 36% had catheter-related bloodstream infections (CRBSIs). Independent factors in the whole population included immunosuppression (adjusted odds ratio [aOR] = 2.195; p = 0.036), total parenteral nutrition (aOR = 3.642; p < 0.001), and anti-methicillin-resistant S. aureus (MRSA) therapy for ≥11 days (aOR = 5.151; p = 0.004). The antibiotic factor in the non-CRBSI population was anti-pseudomonal beta-lactam treatment duration of ≥3 days (aOR = 5.260; p = 0.008). The antibiotic factors in the CRBSI population included anti-MRSA therapy for ≥11 days (aOR = 10.031; p = 0.019). Antimicrobial stewardship that reduces exposure to these antibacterial spectra could help prevent the development of candidemia.
Collapse
Affiliation(s)
- Si-Ho Kim
- Division of Infectious Diseases, Samsung Changwon Hospital, Sungkyunkwan University School of Medicine, Changwon 51353, Republic of Korea
| | - Seok Jun Mun
- Division of Infectious Diseases, Department of Internal Medicine, Inje University Busan Paik Hospital, Inje University College of Medicine, 75 Bokji-ro, Busanjin-gu, Busan 47392, Republic of Korea
- Correspondence: ; Tel.: +82-51-890-6986; Fax: +82-51-890-6341
| | - Jin Suk Kang
- Division of Infectious Diseases, Department of Internal Medicine, Inje University Busan Paik Hospital, Inje University College of Medicine, 75 Bokji-ro, Busanjin-gu, Busan 47392, Republic of Korea
| | - Chisook Moon
- Division of Infectious Diseases, Department of Internal Medicine, Inje University Busan Paik Hospital, Inje University College of Medicine, 75 Bokji-ro, Busanjin-gu, Busan 47392, Republic of Korea
| | - Hyoung-Tae Kim
- Department of Laboratory Medicine, Samsung Changwon Hospital, Sungkyunkwan University School of Medicine, Changwon 51353, Republic of Korea
| | - Ho Young Lee
- Division of Pulmonary, Allergy and Critical Care Medicine, Department of Internal Medicine, Inje University Busan Paik Hospital, Inje University College of Medicine, Busan 47392, Republic of Korea
| |
Collapse
|
6
|
Prediction of Prognostic Risk Factors in Patients with Invasive Candidiasis and Cancer: A Single-Centre Retrospective Study. BIOMED RESEARCH INTERNATIONAL 2022; 2022:7896218. [PMID: 35692595 PMCID: PMC9185171 DOI: 10.1155/2022/7896218] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 05/09/2022] [Accepted: 05/16/2022] [Indexed: 11/18/2022]
Abstract
Background Invasive candidiasis is a common cancer-related complication with a high fatality rate. If patients with a high risk of dying in the hospital are identified early and accurately, physicians can make better clinical judgments. However, epidemiological analyses and mortality prediction models of cancer patients with invasive candidiasis remain limited. Method A set of 40 potential risk factors was acquired in a sample of 258 patients with both invasive candidiasis and cancer. To begin, risk factors for Candida albicans vs. non-Candida albicans infections and persistent vs. nonpersistent Candida infections were analysed using classic statistical methods. Then, we applied three machine learning models (random forest, logistic regression, and support vector machine) to identify prognostic indicators related to mortality. Prediction performance of different models was assessed by precision, recall, F1 score, accuracy, and AUC. Results Of the 258 patients both with invasive candidiasis and cancer included in the analysis. The median age of patients was 62 years, and 95 (36.82%) patients were older than 65 years, of which 178 (66.28%) were male. And 186 (72.1%) patients underwent surgery 2 weeks before data collection, 100 (39.1%) patients stayed in ICU during hospitalisation, 99 (38.4%) patients had bacterial blood infection, 85 (32.9%) patients had persistent invasive candidiasis, and 41 (15.9%) patients died within 30 days. The usage of drainage catheter and prolonged length of hospitalisation are the dominant risk factors for non-Candida albicans infections and persistent Candida infections, respectively. Risk factors, such as septic shock, history of surgery within the past 2 weeks, usage of drainage tubes, length of stay in ICU, total parenteral nutrition, serum creatinine level, fungal antigen, stay in ICU during hospitalisation, and total bilirubin level, were significant predictors of death. The RF model outperformed the LR and SVM models. Precision, recall, F1 score, accuracy, and AUC for RF were 64.29%, 75.63%, 69.23%, 89.61%, and 91.28%. Conclusions In this study, the machine learning-based models accurately predicted the prognosis of cancer and invasive candidiasis patients. The algorithm could be used to help clinicians in high-risk patients' early intervention.
Collapse
|
7
|
Synthesis and Study of Antifungal Properties of New Cationic Beta-Glucan Derivatives. Pharmaceuticals (Basel) 2021; 14:ph14090838. [PMID: 34577538 PMCID: PMC8469811 DOI: 10.3390/ph14090838] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 08/18/2021] [Accepted: 08/23/2021] [Indexed: 01/12/2023] Open
Abstract
The interaction of positively charged polymers (polycations) with a biological membrane is considered to be the cause of the frequently observed toxicity of these macromolecules. If it is possible to obtain polymers with a predominantly negative effect on bacterial and fungal cells, such systems would have great potential in the treatment of infectious diseases, especially now when reports indicate the growing risk of fungal co-infections in COVID-19 patients. We describe in this article cationic derivatives of natural beta-glucan polymers obtained by reacting the polysaccharide isolated from Saccharomyces boulardii (SB) and Cetraria islandica (CI) with glycidyl trimethyl ammonium chloride (GTMAC). Two synthesis strategies were applied to optimize the product yield. Fungal diseases particularly affect low-income countries, hence the emphasis on the simplicity of the synthesis of such drugs so they can be produced without outside help. The three structures obtained showed selective anti-mycotic properties (against, i.e., Scopulariopsis brevicaulis, Aspergillus brasiliensis, and Fusarium solani), and their toxicity established using fibroblast 3T3-L1 cell line was negligible in a wide range of concentrations. For one of the polymers (SB derivative), using in vivo model of Aspergillus brasiliensis infection in Galleria mellonella insect model, we confirmed the promising results obtained in the preliminary study.
Collapse
|