1
|
Yu Q, Zhang L, Ma Q, Da L, Li J, Li W. Predicting all-cause mortality and premature death using interpretable machine learning among a middle-aged and elderly Chinese population. Heliyon 2024; 10:e36878. [PMID: 39281518 PMCID: PMC11399635 DOI: 10.1016/j.heliyon.2024.e36878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2024] [Revised: 08/15/2024] [Accepted: 08/23/2024] [Indexed: 09/18/2024] Open
Abstract
Objective To develop machine learning-based prediction models for all-cause and premature mortality among the middle-aged and elderly population in China. Method Adults aged 45 years or older at baseline of 2011 from the China Health and Retirement Longitudinal Study (CHARLS) were included. The stacked ensemble model was built utilizing five selected machine learning algorithms. These models underwent training and testing using the CHARLS 2011-2015 cohort (derivation cohort) and subsequently underwent external validation using the CHARLS 2015-2018 cohort (validation cohort). SHapley Additive exPlanations (SHAP) was introduced to quantify the importance of risk factors and explain machine learning algorithms. Result In derivation cohort, a total of 10,677 subjects were included, 478 died during the follow-up. The stacked ensemble model demonstrated the highest efficacy in terms of its discrimination capability for predicting all-cause mortality and premature death, with an AUC[95 % CI] of 0.826[0.792-0.859] and 0.773[0.725-0.821], respectively. In validation cohort, the corresponding AUC[95 % CI] were 0.803[0.743-0.864] and 0.791[0.719-0.863], respectively. Risk factors including age, sex, self-reported health, activities of daily living, cognitive function, ever smoker, levels of systolic blood pressure, Cystatin C and low density lipoprotein were strong predictors for both all-cause mortality and premature death. Conclusion Stacked ensemble models performed well in predicting all-cause and premature death in this Chinese cohort. Interpretable techniques can aid in identifying significant risk factors and non-linear relationships between predictors and mortality.
Collapse
Affiliation(s)
- Qi Yu
- Center of Clinical Big Data and Analytics of The Second Affiliated Hospital and Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, 310058, Zhejiang, China
| | - Lingzhi Zhang
- Center of Clinical Big Data and Analytics of The Second Affiliated Hospital and Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, 310058, Zhejiang, China
| | - Qian Ma
- Center of Clinical Big Data and Analytics of The Second Affiliated Hospital and Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, 310058, Zhejiang, China
| | - Lijuan Da
- Center of Clinical Big Data and Analytics of The Second Affiliated Hospital and Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, 310058, Zhejiang, China
| | - Jiahui Li
- Center of Clinical Big Data and Analytics of The Second Affiliated Hospital and Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, 310058, Zhejiang, China
| | - Wenyuan Li
- Center of Clinical Big Data and Analytics of The Second Affiliated Hospital and Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, 310058, Zhejiang, China
| |
Collapse
|
2
|
Yang X, Li Y, Mei T, Duan J, Yan X, McNaughton LR, He Z. Genome-wide association study of exercise-induced skeletal muscle hypertrophy and the construction of predictive model. Physiol Genomics 2024; 56:578-589. [PMID: 38881426 DOI: 10.1152/physiolgenomics.00019.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Revised: 05/21/2024] [Accepted: 06/10/2024] [Indexed: 06/18/2024] Open
Abstract
The aim of the current study was to investigate interindividual differences in muscle thickness of the rectus femoris (MTRF) following 12 wk of resistance training (RT) or high-intensity interval training (HIIT) to explore the genetic architecture underlying skeletal muscle hypertrophy and to construct predictive models. We conducted musculoskeletal ultrasound assessments of the MTRF response in 440 physically inactive adults after the 12-wk exercise period. A genome-wide association study was used to identify variants associated with the MTRF response, separately for RT and HIIT. Using the polygenic predictor score (PPS), we estimated the genetic contribution to exercise-induced hypertrophy. Predictive models for the MTRF response were constructed using random forest (RF), support vector mac (SVM), and generalized linear model (GLM) in 10 cross-validated approaches. MTRF increased significantly after both RT (8.8%, P < 0.05) and HIIT (5.3%, P < 0.05), but with considerable interindividual differences (RT: -13.5 to 38.4%, HIIT: -14.2 to 30.7%). Eleven lead single-nucleotide polymorphisms in RT and eight lead single-nucleotide polymorphisms in HIIT were identified at a significance level of P < 1 × 10-5. The PPS was associated with the MTRF response, explaining 47.2% of the variation in response to RT and 38.3% of the variation in response to HIIT. Notably, the GLM and SVM predictive models exhibited superior performance compared with RF models (P < 0.05), and the GLM demonstrated optimal performance with an area under curve of 0.809 (95% confidence interval: 0.669-0.949). Factors such as PPS, baseline MTRF, and exercise protocol exerted influence on the MTRF response to exercise, with PPS being the primary contributor. The GLM and SVM predictive model, incorporating both genetic and phenotypic factors, emerged as promising tools for predicting exercise-induced skeletal muscle hypertrophy.NEW & NOTEWORTHY The interindividual variability induced muscle hypertrophy by resistance training (RT) or high-intensity interval training (HIIT) and the associated genetic architecture remain uncertain. We identified genetic variants that underlie RT- or HIIT-induced muscle hypertrophy and established them as pivotal factors influencing the response regardless of the training type. The genetic-phenotype predictive model developed has the potential to identify nonresponders or individuals with low responsiveness before engaging in exercise training.
Collapse
Affiliation(s)
- Xiaolin Yang
- China Institute of Sport and Health Science, Beijing Sport University, Beijing, China
- Key Laboratory for Performance Training and Recovery of General Administration of Sport, Beijing Sport University, Beijing, China
| | - Yanchun Li
- China Institute of Sport and Health Science, Beijing Sport University, Beijing, China
- Key Laboratory for Performance Training and Recovery of General Administration of Sport, Beijing Sport University, Beijing, China
| | - Tao Mei
- China Institute of Sport and Health Science, Beijing Sport University, Beijing, China
- Key Laboratory for Performance Training and Recovery of General Administration of Sport, Beijing Sport University, Beijing, China
| | - Jiayan Duan
- China Institute of Sport and Health Science, Beijing Sport University, Beijing, China
| | - Xu Yan
- Institute for Health and Sport, Victoria University, Melbourne, Victoria, Australia
- Regenerative Medicine and Stem Cells Program, Australian Institute for Musculoskeletal Science, St Albans, Victoria, Australia
| | - Lars Robert McNaughton
- Sport Performance, Exercise and Nutrition Research Group, Department of Sport and Physical Activity, Edge Hill University, Ormskirk, United Kingdom
| | - Zihong He
- Biology Center, China Institute of Sport Science, Beijing, China
| |
Collapse
|
3
|
Bonikowske AR, Taylor JL, Larson KF, Hardwick J, Ozemek C, Harber MP, Kaminsky LA, Arena R, Lavie CJ. Evaluating current assessment techniques of cardiorespiratory fitness. Expert Rev Cardiovasc Ther 2024; 22:231-241. [PMID: 38855917 DOI: 10.1080/14779072.2024.2363393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Accepted: 05/30/2024] [Indexed: 06/11/2024]
Abstract
INTRODUCTION Considerable and convincing global data from cohorts across the health spectrum (i.e. apparently healthy to known disease) indicate that cardiorespiratory fitness (CRF) is a major predictor of overall and cardiovascular disease (CVD)-survival, seemingly with greater prognostic resolution compared to other traditional CVD risk factors. Therefore, the assessment of CRF in research and clinical settings is of major importance. AREAS COVERED In this manuscript, we review the technology of measuring CRF assessed by the 'gold standard,' cardiopulmonary exercise testing (CPET), as well as with various other methods (e.g. estimated metabolic equivalents, 6-minute walk tests, shuttle tests, and non-exercise equations that estimate CRF), all of which provide significant prognostic information for CVD- and all-cause survival. The literature through May 2024 has been cited. EXPERT OPINION The promotion of physical activity in efforts to improve levels of CRF is needed throughout the world to improve lifespan and, more importantly, healthspan. The routine assessment of CRF should be considered a vital sign that is routinely assessed in clinical practice.
Collapse
Affiliation(s)
| | - Jenna L Taylor
- School of Human Movement and Nutrition Sciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Kathryn F Larson
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA
| | - Joel Hardwick
- Department of Physical Therapy, College of Applied Science, University of Illinois, Chicago, IL, USA
| | - Cemal Ozemek
- Department of Physical Therapy, College of Applied Science, University of Illinois, Chicago, IL, USA
| | - Matthew P Harber
- Clinical Exercise Physiology, Ball State University, Muncie, IN, USA
| | - Lenny A Kaminsky
- Clinical Exercise Physiology, Ball State University, Muncie, IN, USA
| | - Ross Arena
- Department of Physical Therapy, College of Applied Science, University of Illinois, Chicago, IL, USA
- Healthy Living for Pandemic Event Protection (HL - PIVOT) Network, Chicago, IL, USA
| | - Carl J Lavie
- John Ochsner Heart and Vascular Institute, Ochsner Clinical School-the UQ School of Medicine, New Orleans, LA, USA
| |
Collapse
|
4
|
Noble PA, Hamilton BD, Gerber G. Stone decision engine accurately predicts stone removal and treatment complications for shock wave lithotripsy and laser ureterorenoscopy patients. PLoS One 2024; 19:e0301812. [PMID: 38696418 PMCID: PMC11065282 DOI: 10.1371/journal.pone.0301812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 03/24/2024] [Indexed: 05/04/2024] Open
Abstract
Kidney stones form when mineral salts crystallize in the urinary tract. While most stones exit the body in the urine stream, some can block the ureteropelvic junction or ureters, leading to severe lower back pain, blood in the urine, vomiting, and painful urination. Imaging technologies, such as X-rays or ureterorenoscopy (URS), are typically used to detect kidney stones. Subsequently, these stones are fragmented into smaller pieces using shock wave lithotripsy (SWL) or laser URS. Both treatments yield subtly different patient outcomes. To predict successful stone removal and complication outcomes, Artificial Neural Network models were trained on 15,126 SWL and 2,116 URS patient records. These records include patient metrics like Body Mass Index and age, as well as treatment outcomes obtained using various medical instruments and healthcare professionals. Due to the low number of outcome failures in the data (e.g., treatment complications), Nearest Neighbor and Synthetic Minority Oversampling Technique (SMOTE) models were implemented to improve prediction accuracies. To reduce noise in the predictions, ensemble modeling was employed. The average prediction accuracies based on Confusion Matrices for SWL stone removal and treatment complications were 84.8% and 95.0%, respectively, while those for URS were 89.0% and 92.2%, respectively. The average prediction accuracies for SWL based on Area-Under-the-Curve were 74.7% and 62.9%, respectively, while those for URS were 77.2% and 78.9%, respectively. Taken together, the approach yielded moderate to high accurate predictions, regardless of treatment or outcome. These models were incorporated into a Stone Decision Engine web application (http://peteranoble.com/webapps.html) that suggests the best interventions to healthcare providers based on individual patient metrics.
Collapse
Affiliation(s)
- Peter A. Noble
- Department of Microbiology, University of Alabama Birmingham, Birmingham, AL, United States of America
| | - Blake D. Hamilton
- School of Medicine, University of Utah, Salt Lake City, UT, United States of America
| | - Glenn Gerber
- University of Chicago Medical Center, Chicago, IL, United States of America
| |
Collapse
|
5
|
Elshawi R, Sakr S, Al-Mallah MH, Keteyian SJ, Brawner CA, Ehrman JK. FIT calculator: a multi-risk prediction framework for medical outcomes using cardiorespiratory fitness data. Sci Rep 2024; 14:8745. [PMID: 38627439 PMCID: PMC11021455 DOI: 10.1038/s41598-024-59401-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 04/10/2024] [Indexed: 04/19/2024] Open
Abstract
Accurately predicting patients' risk for specific medical outcomes is paramount for effective healthcare management and personalized medicine. While a substantial body of literature addresses the prediction of diverse medical conditions, existing models predominantly focus on singular outcomes, limiting their scope to one disease at a time. However, clinical reality often entails patients concurrently facing multiple health risks across various medical domains. In response to this gap, our study proposes a novel multi-risk framework adept at simultaneous risk prediction for multiple clinical outcomes, including diabetes, mortality, and hypertension. Leveraging a concise set of features extracted from patients' cardiorespiratory fitness data, our framework minimizes computational complexity while maximizing predictive accuracy. Moreover, we integrate a state-of-the-art instance-based interpretability technique into our framework, providing users with comprehensive explanations for each prediction. These explanations afford medical practitioners invaluable insights into the primary health factors influencing individual predictions, fostering greater trust and utility in the underlying prediction models. Our approach thus stands to significantly enhance healthcare decision-making processes, facilitating more targeted interventions and improving patient outcomes in clinical practice. Our prediction framework utilizes an automated machine learning framework, Auto-Weka, to optimize machine learning models and hyper-parameter configurations for the simultaneous prediction of three medical outcomes: diabetes, mortality, and hypertension. Additionally, we employ a local interpretability technique to elucidate predictions generated by our framework. These explanations manifest visually, highlighting key attributes contributing to each instance's prediction for enhanced interpretability. Using automated machine learning techniques, the models simultaneously predict hypertension, mortality, and diabetes risks, utilizing only nine patient features. They achieved an average AUC of 0.90 ± 0.001 on the hypertension dataset, 0.90 ± 0.002 on the mortality dataset, and 0.89 ± 0.001 on the diabetes dataset through tenfold cross-validation. Additionally, the models demonstrated strong performance with an average AUC of 0.89 ± 0.001 on the hypertension dataset, 0.90 ± 0.001 on the mortality dataset, and 0.89 ± 0.001 on the diabetes dataset using bootstrap evaluation with 1000 resamples.
Collapse
Affiliation(s)
- Radwa Elshawi
- Institute of Computer Science, University of Tartu, Tartu, Estonia.
| | - Sherif Sakr
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | | | - Steven J Keteyian
- Division of Cardiovascular Medicine, Henry Ford Hospital, 6525 Second Ave., Detroit, MI, 48202, USA
| | - Clinton A Brawner
- Division of Cardiovascular Medicine, Henry Ford Hospital, 6525 Second Ave., Detroit, MI, 48202, USA
| | - Jonathan K Ehrman
- Division of Cardiovascular Medicine, Henry Ford Hospital, 6525 Second Ave., Detroit, MI, 48202, USA
| |
Collapse
|
6
|
Nazari L, Aslan MF, Sabanci K, Ropelewska E. Integrated transcriptomic meta-analysis and comparative artificial intelligence models in maize under biotic stress. Sci Rep 2023; 13:15899. [PMID: 37741865 PMCID: PMC10517993 DOI: 10.1038/s41598-023-42984-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 09/17/2023] [Indexed: 09/25/2023] Open
Abstract
Biotic stress imposed by pathogens, including fungal, bacterial, and viral, can cause heavy damage leading to yield reduction in maize. Therefore, the identification of resistant genes paves the way to the development of disease-resistant cultivars and is essential for reliable production in maize. Identifying different gene expression patterns can deepen our perception of maize resistance to disease. This study includes machine learning and deep learning-based application for classifying genes expressed under normal and biotic stress in maize. Machine learning algorithms used are Naive Bayes (NB), K-Nearest Neighbor (KNN), Ensemble, Support Vector Machine (SVM), and Decision Tree (DT). A Bidirectional Long Short Term Memory (BiLSTM) based network with Recurrent Neural Network (RNN) architecture is proposed for gene classification with deep learning. To increase the performance of these algorithms, feature selection is made from the raw gene features through the Relief feature selection algorithm. The obtained finding indicated the efficacy of BiLSTM over other machine learning algorithms. Some top genes ((S)-beta-macrocarpene synthase, zealexin A1 synthase, polyphenol oxidase I, chloroplastic, pathogenesis-related protein 10, CHY1, chitinase chem 5, barwin, and uncharacterized LOC100273479 were proved to be differentially upregulated under biotic stress condition.
Collapse
Affiliation(s)
- Leyla Nazari
- Crop and Horticultural Science Research Department, Fars Agricultural and Natural Resources Research and Education Center, Agricultural Research, Education and Extension Organization (AREEO), Shiraz, Iran.
| | - Muhammet Fatih Aslan
- Electrical and Electronics Engineering, Karamanoglu Mehmetbey University, Karaman, Turkey
| | - Kadir Sabanci
- Electrical and Electronics Engineering, Karamanoglu Mehmetbey University, Karaman, Turkey
| | - Ewa Ropelewska
- Fruit and Vegetable Storage and Processing Department, The National Institute of Horticultural Research, Skierniewice, Poland
| |
Collapse
|
7
|
Cauwenberghs N, Sente J, Van Criekinge H, Sabovčik F, Ntalianis E, Haddad F, Claes J, Claessen G, Budts W, Goetschalckx K, Cornelissen V, Kuznetsova T. Integrative Interpretation of Cardiopulmonary Exercise Tests for Cardiovascular Outcome Prediction: A Machine Learning Approach. Diagnostics (Basel) 2023; 13:2051. [PMID: 37370946 DOI: 10.3390/diagnostics13122051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 06/01/2023] [Accepted: 06/10/2023] [Indexed: 06/29/2023] Open
Abstract
Integrative interpretation of cardiopulmonary exercise tests (CPETs) may improve assessment of cardiovascular (CV) risk. Here, we identified patient phenogroups based on CPET summary metrics and evaluated their predictive value for CV events. We included 2280 patients with diverse CV risk who underwent maximal CPET by cycle ergometry. Key CPET indices and information on incident CV events (median follow-up time: 5.3 years) were derived. Next, we applied unsupervised clustering by Gaussian Mixture modeling to subdivide the cohort into four male and four female phenogroups solely based on differences in CPET metrics. Ten of 18 CPET metrics were used for clustering as eight were removed due to high collinearity. In males and females, the phenogroups differed significantly in age, BMI, blood pressure, disease prevalence, medication intake and spirometry. In males, phenogroups 3 and 4 presented a significantly higher risk for incident CV events than phenogroup 1 (multivariable-adjusted hazard ratio: 1.51 and 2.19; p ≤ 0.048). In females, differences in the risk for future CV events between the phenogroups were not significant after adjustment for clinical covariables. Integrative CPET-based phenogrouping, thus, adequately stratified male patients according to CV risk. CPET phenomapping may facilitate comprehensive evaluation of CPET results and steer CV risk stratification and management.
Collapse
Affiliation(s)
- Nicholas Cauwenberghs
- Hypertension and Cardiovascular Epidemiology, Department of Cardiovascular Sciences, University of Leuven, 3000 Leuven, Belgium
| | - Josephine Sente
- Hypertension and Cardiovascular Epidemiology, Department of Cardiovascular Sciences, University of Leuven, 3000 Leuven, Belgium
| | - Hanne Van Criekinge
- Hypertension and Cardiovascular Epidemiology, Department of Cardiovascular Sciences, University of Leuven, 3000 Leuven, Belgium
| | - František Sabovčik
- Hypertension and Cardiovascular Epidemiology, Department of Cardiovascular Sciences, University of Leuven, 3000 Leuven, Belgium
| | - Evangelos Ntalianis
- Hypertension and Cardiovascular Epidemiology, Department of Cardiovascular Sciences, University of Leuven, 3000 Leuven, Belgium
| | - Francois Haddad
- Stanford Cardiovascular Institute and Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Jomme Claes
- Rehabilitation in Internal Disorders, Department of Rehabilitation Sciences, University of Leuven, 3001 Leuven, Belgium
| | - Guido Claessen
- Department of Cardiology, Hartcentrum, Virga Jessa Hospital, 3500 Hasselt, Belgium
- Faculty of Medicine and Life Sciences, Hasselt University, 3500 Hasselt, Belgium
| | - Werner Budts
- Cardiology, Department of Cardiovascular Sciences, University of Leuven, 3000 Leuven, Belgium
| | - Kaatje Goetschalckx
- Cardiovascular Imaging and Dynamics, Department of Cardiovascular Sciences, University of Leuven, 3000 Leuven, Belgium
| | - Véronique Cornelissen
- Rehabilitation in Internal Disorders, Department of Rehabilitation Sciences, University of Leuven, 3001 Leuven, Belgium
| | - Tatiana Kuznetsova
- Hypertension and Cardiovascular Epidemiology, Department of Cardiovascular Sciences, University of Leuven, 3000 Leuven, Belgium
| |
Collapse
|
8
|
Jha AK, Mithun S, Sherkhane UB, Jaiswar V, Osong B, Purandare N, Kannan S, Prabhash K, Gupta S, Vanneste B, Rangarajan V, Dekker A, Wee L. Systematic review and meta-analysis of prediction models used in cervical cancer. Artif Intell Med 2023; 139:102549. [PMID: 37100501 DOI: 10.1016/j.artmed.2023.102549] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Revised: 11/18/2022] [Accepted: 04/04/2023] [Indexed: 04/28/2023]
Abstract
BACKGROUND Cervical cancer is one of the most common cancers in women with an incidence of around 6.5 % of all the cancer in women worldwide. Early detection and adequate treatment according to staging improve the patient's life expectancy. Outcome prediction models might aid treatment decisions, but a systematic review on prediction models for cervical cancer patients is not available. DESIGN We performed a systematic review for prediction models in cervical cancer following PRISMA guidelines. Key features that were used for model training and validation, the endpoints were extracted from the article and data were analyzed. Selected articles were grouped based on prediction endpoints i.e. Group1: Overall survival, Group2: progression-free survival; Group3: recurrence or distant metastasis; Group4: treatment response; Group5: toxicity or quality of life. We developed a scoring system to evaluate the manuscript. As per our criteria, studies were divided into four groups based on scores obtained in our scoring system, the Most significant study (Score > 60 %); Significant study (60 % > Score > 50 %); Moderately Significant study (50 % > Score > 40 %); least significant study (score < 40 %). A meta-analysis was performed for all the groups separately. RESULTS The first line of search selected 1358 articles and finally 39 articles were selected as eligible for inclusion in the review. As per our assessment criteria, 16, 13 and 10 studies were found to be the most significant, significant and moderately significant respectively. The intra-group pooled correlation coefficient for Group1, Group2, Group3, Group4, and Group5 were 0.76 [0.72, 0.79], 0.80 [0.73, 0.86], 0.87 [0.83, 0.90], 0.85 [0.77, 0.90], 0.88 [0.85, 0.90] respectively. All the models were found to be good (prediction accuracy [c-index/AUC/R2] >0.7) in endpoint prediction. CONCLUSIONS Prediction models of cervical cancer toxicity, local or distant recurrence and survival prediction show promising results with reasonable prediction accuracy [c-index/AUC/R2 > 0.7]. These models should also be validated on external data and evaluated in prospective clinical studies.
Collapse
Affiliation(s)
- Ashish Kumar Jha
- Department of Radiation Oncology (Maastro), GROW School for Oncology, Maastricht University Medical Centre+, Maastricht, the Netherlands; Department of Nuclear Medicine, Tata Memorial Hospital, Mumbai, Maharashtra, India; Homi Bhabha National Institute, Mumbai, Maharashtra, India.
| | - Sneha Mithun
- Department of Radiation Oncology (Maastro), GROW School for Oncology, Maastricht University Medical Centre+, Maastricht, the Netherlands; Department of Nuclear Medicine, Tata Memorial Hospital, Mumbai, Maharashtra, India; Homi Bhabha National Institute, Mumbai, Maharashtra, India
| | - Umeshkumar B Sherkhane
- Department of Radiation Oncology (Maastro), GROW School for Oncology, Maastricht University Medical Centre+, Maastricht, the Netherlands; Department of Nuclear Medicine, Tata Memorial Hospital, Mumbai, Maharashtra, India
| | - Vinay Jaiswar
- Department of Nuclear Medicine, Tata Memorial Hospital, Mumbai, Maharashtra, India
| | - Biche Osong
- Department of Radiation Oncology (Maastro), GROW School for Oncology, Maastricht University Medical Centre+, Maastricht, the Netherlands
| | - Nilendu Purandare
- Department of Nuclear Medicine, Tata Memorial Hospital, Mumbai, Maharashtra, India; Homi Bhabha National Institute, Mumbai, Maharashtra, India
| | - Sadhana Kannan
- Homi Bhabha National Institute, Mumbai, Maharashtra, India; Advance Centre for Treatment, Research, Education in Cancer, Mumbai, Maharashtra, India
| | - Kumar Prabhash
- Department of Medical Oncology, Tata Memorial Hospital, Mumbai, Maharashtra, India; Homi Bhabha National Institute, Mumbai, Maharashtra, India
| | - Sudeep Gupta
- Department of Medical Oncology, Tata Memorial Hospital, Mumbai, Maharashtra, India; Homi Bhabha National Institute, Mumbai, Maharashtra, India; Advance Centre for Treatment, Research, Education in Cancer, Mumbai, Maharashtra, India
| | - Ben Vanneste
- Department of Radiation Oncology (Maastro), GROW School for Oncology, Maastricht University Medical Centre+, Maastricht, the Netherlands
| | - Venkatesh Rangarajan
- Department of Nuclear Medicine, Tata Memorial Hospital, Mumbai, Maharashtra, India; Homi Bhabha National Institute, Mumbai, Maharashtra, India
| | - Andre Dekker
- Department of Radiation Oncology (Maastro), GROW School for Oncology, Maastricht University Medical Centre+, Maastricht, the Netherlands
| | - Leonard Wee
- Department of Radiation Oncology (Maastro), GROW School for Oncology, Maastricht University Medical Centre+, Maastricht, the Netherlands
| |
Collapse
|
9
|
Saito S, Sakamoto S, Higuchi K, Sato K, Zhao X, Wakai K, Kanesaka M, Kamada S, Takeuchi N, Sazuka T, Imamura Y, Anzai N, Ichikawa T, Kawakami E. Machine-learning predicts time-series prognosis factors in metastatic prostate cancer patients treated with androgen deprivation therapy. Sci Rep 2023; 13:6325. [PMID: 37072487 PMCID: PMC10113215 DOI: 10.1038/s41598-023-32987-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 04/05/2023] [Indexed: 05/03/2023] Open
Abstract
Machine learning technology is expected to support diagnosis and prognosis prediction in medicine. We used machine learning to construct a new prognostic prediction model for prostate cancer patients based on longitudinal data obtained from age at diagnosis, peripheral blood and urine tests of 340 prostate cancer patients. Random survival forest (RSF) and survival tree were used for machine learning. In the time-series prognostic prediction model for metastatic prostate cancer patients, the RSF model showed better prediction accuracy than the conventional Cox proportional hazards model for almost all time periods of progression-free survival (PFS), overall survival (OS) and cancer-specific survival (CSS). Based on the RSF model, we created a clinically applicable prognostic prediction model using survival trees for OS and CSS by combining the values of lactate dehydrogenase (LDH) before starting treatment and alkaline phosphatase (ALP) at 120 days after treatment. Machine learning provides useful information for predicting the prognosis of metastatic prostate cancer prior to treatment intervention by considering the nonlinear and combined impacts of multiple features. The addition of data after the start of treatment would allow for more precise prognostic risk assessment of patients and would be beneficial for subsequent treatment selection.
Collapse
Affiliation(s)
- Shinpei Saito
- Department of Urology, Graduate School of Medicine, Chiba University, 1-8-1 Inohana, Chuo-Ku, Chiba, Chiba, 260-8670, Japan
- Department of Artificial Intelligence Medicine, Graduate School of Medicine, Chiba University, Chiba, Chiba, Japan
| | - Shinichi Sakamoto
- Department of Urology, Graduate School of Medicine, Chiba University, 1-8-1 Inohana, Chuo-Ku, Chiba, Chiba, 260-8670, Japan.
| | | | - Kodai Sato
- Department of Urology, Graduate School of Medicine, Chiba University, 1-8-1 Inohana, Chuo-Ku, Chiba, Chiba, 260-8670, Japan
- Department of Artificial Intelligence Medicine, Graduate School of Medicine, Chiba University, Chiba, Chiba, Japan
| | - Xue Zhao
- Department of Urology, Graduate School of Medicine, Chiba University, 1-8-1 Inohana, Chuo-Ku, Chiba, Chiba, 260-8670, Japan
| | - Ken Wakai
- Teikyo University Chiba Medical Center, Ichihara, Chiba, Japan
| | - Manato Kanesaka
- Department of Urology, Graduate School of Medicine, Chiba University, 1-8-1 Inohana, Chuo-Ku, Chiba, Chiba, 260-8670, Japan
| | - Shuhei Kamada
- Department of Urology, Graduate School of Medicine, Chiba University, 1-8-1 Inohana, Chuo-Ku, Chiba, Chiba, 260-8670, Japan
| | - Nobuyoshi Takeuchi
- Department of Urology, Graduate School of Medicine, Chiba University, 1-8-1 Inohana, Chuo-Ku, Chiba, Chiba, 260-8670, Japan
| | - Tomokazu Sazuka
- Department of Urology, Graduate School of Medicine, Chiba University, 1-8-1 Inohana, Chuo-Ku, Chiba, Chiba, 260-8670, Japan
| | - Yusuke Imamura
- Department of Urology, Graduate School of Medicine, Chiba University, 1-8-1 Inohana, Chuo-Ku, Chiba, Chiba, 260-8670, Japan
| | - Naohiko Anzai
- Department of Pharmacology, Graduate School of Medicine, Chiba University, Chiba, Chiba, Japan
| | - Tomohiko Ichikawa
- Department of Urology, Graduate School of Medicine, Chiba University, 1-8-1 Inohana, Chuo-Ku, Chiba, Chiba, 260-8670, Japan
| | - Eiryo Kawakami
- Department of Artificial Intelligence Medicine, Graduate School of Medicine, Chiba University, Chiba, Chiba, Japan
- Advanced Data Science Project (ADSP), RIKEN Information R&D and Strategy Headquarters, RIKEN, Kanagawa, Japan
- Institute for Advanced Academic Research (IAAR), Chiba University, Chiba, Chiba, Japan
| |
Collapse
|
10
|
Prediction of Prednisolone Dose Correction Using Machine Learning. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2023; 7:84-103. [PMID: 36910914 PMCID: PMC9995628 DOI: 10.1007/s41666-023-00128-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 11/20/2022] [Accepted: 02/03/2023] [Indexed: 02/17/2023]
Abstract
Wrong dose, a common prescription error, can cause serious patient harm, especially in the case of high-risk drugs like oral corticosteroids. This study aims to build a machine learning model to predict dose-related prescription modifications for oral prednisolone tablets (i.e., highly imbalanced data with very few positive cases). Prescription data were obtained from the electronic medical records at a single institute. Cluster analysis classified the clinical departments into six clusters with similar patterns of prednisolone prescription. Two patterns of training datasets were created with/without preprocessing by the SMOTE method. Five ML models (SVM, KNN, GB, RF, and BRF) and logistic regression (LR) models were constructed by Python. The model was internally validated by five-fold stratified cross-validation and was validated with a 30% holdout test dataset. Eighty-two thousand five hundred fifty-three prescribing data for prednisolone tablets containing 135 dose-corrected positive cases were obtained. In the original dataset (without SMOTE), only the BRF model showed a good performance (in test dataset, ROC-AUC:0.917, recall: 0.951). In the training dataset preprocessed by SMOTE, performance was improved on all models. The highest performance models with SMOTE were SVM (in test dataset, ROC-AUC: 0.820, recall: 0.659) and BRF (ROC-AUC: 0.814, recall: 0.634). Although the prescribing data for dose-related collection are highly imbalanced, various techniques such as the following have allowed us to build high-performance prediction models: data preprocessing by SMOTE, stratified cross-validation, and BRF classifier corresponding to imbalanced data. ML is useful in complicated dose audits such as oral prednisolone. Supplementary Information The online version contains supplementary material available at 10.1007/s41666-023-00128-3.
Collapse
|
11
|
Li Z, Yang N, He L, Wang J, Ping F, Li W, Xu L, Zhang H, Li Y. Development and validation of questionnaire-based machine learning models for predicting all-cause mortality in a representative population of China. Front Public Health 2023; 11:1033070. [PMID: 36778549 PMCID: PMC9911458 DOI: 10.3389/fpubh.2023.1033070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 01/11/2023] [Indexed: 01/28/2023] Open
Abstract
Background Considering that the previously developed mortality prediction models have limited applications to the Chinese population, a questionnaire-based prediction model is of great importance for its accuracy and convenience in clinical practice. Methods Two national cohort, namely, the China Health and Nutrition Survey (8,355 individual older than 18) and the China Health and Retirement Longitudinal Study (12,711 individuals older than 45) were used for model development and validation. One hundred and fifty-nine variables were compiled to generate predictions. The Cox regression model and six machine learning (ML) models were used to predict all-cause mortality. Finally, a simple questionnaire-based ML prediction model was developed using the best algorithm and validated. Results In the internal validation set, all the ML models performed better than the traditional Cox model in predicting 6-year mortality and the random survival forest (RSF) model performed best. The questionnaire-based ML model, which only included 20 variables, achieved a C-index of 0.86 (95%CI: 0.80-0.92). On external validation, the simple questionnaire-based model achieved a C-index of 0.82 (95%CI: 0.77-0.87), 0.77 (95%CI: 0.75-0.79), and 0.79 (95%CI: 0.77-0.81), respectively, in predicting 2-, 9-, and 11-year mortality. Conclusions In this prospective population-based study, a model based on the RSF analysis performed best among all models. Furthermore, there was no significant difference between the prediction performance of the questionnaire-based ML model, which only included 20 variables, and that of the model with all variables (including laboratory variables). The simple questionnaire-based ML prediction model, which needs to be further explored, is of great importance for its accuracy and suitability to the Chinese general population.
Collapse
|
12
|
Ebrahimi A, Wiil UK, Naemi A, Mansourvar M, Andersen K, Nielsen AS. Identification of clinical factors related to prediction of alcohol use disorder from electronic health records using feature selection methods. BMC Med Inform Decis Mak 2022; 22:304. [PMID: 36424597 PMCID: PMC9686074 DOI: 10.1186/s12911-022-02051-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Accepted: 11/16/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND High dimensionality in electronic health records (EHR) causes a significant computational problem for any systematic search for predictive, diagnostic, or prognostic patterns. Feature selection (FS) methods have been indicated to be effective in feature reduction as well as in identifying risk factors related to prediction of clinical disorders. This paper examines the prediction of patients with alcohol use disorder (AUD) using machine learning (ML) and attempts to identify risk factors related to the diagnosis of AUD. METHODS A FS framework consisting of two operational levels, base selectors and ensemble selectors. The first level consists of five FS methods: three filter methods, one wrapper method, and one embedded method. Base selector outputs are aggregated to develop four ensemble FS methods. The outputs of FS method were then fed into three ML algorithms: support vector machine (SVM), K-nearest neighbor (KNN), and random forest (RF) to compare and identify the best feature subset for the prediction of AUD from EHRs. RESULTS In terms of feature reduction, the embedded FS method could significantly reduce the number of features from 361 to 131. In terms of classification performance, RF based on 272 features selected by our proposed ensemble method (Union FS) with the highest accuracy in predicting patients with AUD, 96%, outperformed all other models in terms of AUROC, AUPRC, Precision, Recall, and F1-Score. Considering the limitations of embedded and wrapper methods, the best overall performance was achieved by our proposed Union Filter FS, which reduced the number of features to 223 and improved Precision, Recall, and F1-Score in RF from 0.77, 0.65, and 0.71 to 0.87, 0.81, and 0.84, respectively. Our findings indicate that, besides gender, age, and length of stay at the hospital, diagnosis related to digestive organs, bones, muscles and connective tissue, and the nervous systems are important clinical factors related to the prediction of patients with AUD. CONCLUSION Our proposed FS method could improve the classification performance significantly. It could identify clinical factors related to prediction of AUD from EHRs, thereby effectively helping clinical staff to identify and treat AUD patients and improving medical knowledge of the AUD condition. Moreover, the diversity of features among female and male patients as well as gender disparity were investigated using FS methods and ML techniques.
Collapse
Affiliation(s)
- Ali Ebrahimi
- grid.10825.3e0000 0001 0728 0170SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, Denmark
| | - Uffe Kock Wiil
- grid.10825.3e0000 0001 0728 0170SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, Denmark
| | - Amin Naemi
- grid.10825.3e0000 0001 0728 0170SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, Denmark
| | - Marjan Mansourvar
- grid.10825.3e0000 0001 0728 0170Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark
| | - Kjeld Andersen
- grid.10825.3e0000 0001 0728 0170Unit for Clinical Alcohol Research, Clinical Institute, University of Southern Denmark, Odense, Denmark
| | - Anette Søgaard Nielsen
- grid.10825.3e0000 0001 0728 0170Unit for Clinical Alcohol Research, Clinical Institute, University of Southern Denmark, Odense, Denmark
| |
Collapse
|
13
|
Mattogno PP, Caccavella VM, Giordano M, D'Alessandris QG, Chiloiro S, Tariciotti L, Olivi A, Lauretti L. Interpretable Machine Learning-Based Prediction of Intraoperative Cerebrospinal Fluid Leakage in Endoscopic Transsphenoidal Pituitary Surgery: A Pilot Study. J Neurol Surg B Skull Base 2022; 83:485-495. [PMID: 36091632 PMCID: PMC9462964 DOI: 10.1055/s-0041-1740621] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 11/12/2021] [Indexed: 01/18/2023] Open
Abstract
Purpose Transsphenoidal surgery (TSS) for pituitary adenomas can be complicated by the occurrence of intraoperative cerebrospinal fluid (CSF) leakage (IOL). IOL significantly affects the course of surgery predisposing to the development of postoperative CSF leakage, a major source of morbidity and mortality in the postoperative period. The authors trained and internally validated the Random Forest (RF) prediction model to preoperatively identify patients at high risk for IOL. A locally interpretable model-agnostic explanations (LIME) algorithm is employed to elucidate the main drivers behind each machine learning (ML) model prediction. Methods The data of 210 patients who underwent TSS were collected; first, risk factors for IOL were identified via conventional statistical methods (multivariable logistic regression). Then, the authors trained, optimized, and audited a RF prediction model. Results IOL reported in 45 patients (21.5%). The recursive feature selection algorithm identified the following variables as the most significant determinants of IOL: Knosp's grade, sellar Hardy's grade, suprasellar Hardy's grade, tumor diameter (on X, Y, and Z axes), intercarotid distance, and secreting status (nonfunctioning and growth hormone [GH] secreting). Leveraging the predictive values of these variables, the RF prediction model achieved an area under the curve (AUC) of 0.83 (95% confidence interval [CI]: 0.78; 0.86), significantly outperforming the multivariable logistic regression model (AUC = 0.63). Conclusion A RF model that reliably identifies patients at risk for IOL was successfully trained and internally validated. ML-based prediction models can predict events that were previously judged nearly unpredictable; their deployment in clinical practice may result in improved patient care and reduced postoperative morbidity and healthcare costs.
Collapse
Affiliation(s)
- Pier Paolo Mattogno
- Department of Neurosurgery, Fondazione Policlinico Universitario A. Gemell iIstituto di Ricovero e Cura a Carattere Scientifico Università Cattolica del Sacro Cuore, Rome, Italy
| | - Valerio M. Caccavella
- Department of Neurosurgery, Fondazione Policlinico Universitario A. Gemell iIstituto di Ricovero e Cura a Carattere Scientifico Università Cattolica del Sacro Cuore, Rome, Italy
| | - Martina Giordano
- Department of Neurosurgery, Fondazione Policlinico Universitario A. Gemell iIstituto di Ricovero e Cura a Carattere Scientifico Università Cattolica del Sacro Cuore, Rome, Italy
| | - Quintino G. D'Alessandris
- Department of Neurosurgery, Fondazione Policlinico Universitario A. Gemell iIstituto di Ricovero e Cura a Carattere Scientifico Università Cattolica del Sacro Cuore, Rome, Italy
| | - Sabrina Chiloiro
- Department of Endocrinology, Fondazione Policlinico Universitario A. Gemelli Istituto di Ricovero e Cura a Carattere Scientifico Università Cattolica del Sacro Cuore, Rome, Italy
| | - Leonardo Tariciotti
- Unit of Neurosurgery, Fondazione Istituto di Ricovero e Cura a Carattere Scientifico Cà Granda Ospedale Maggiore Policlinico, Milan, Italy
- University of Milan, Milan, Italy
| | - Alessandro Olivi
- Department of Neurosurgery, Fondazione Policlinico Universitario A. Gemell iIstituto di Ricovero e Cura a Carattere Scientifico Università Cattolica del Sacro Cuore, Rome, Italy
| | - Liverana Lauretti
- Department of Neurosurgery, Fondazione Policlinico Universitario A. Gemell iIstituto di Ricovero e Cura a Carattere Scientifico Università Cattolica del Sacro Cuore, Rome, Italy
| |
Collapse
|
14
|
de Souza E Silva CG, Buginga GC, de Souza E Silva EA, Arena R, Rouleau CR, Aggarwal S, Wilton SB, Austford L, Hauer T, Myers J. Prediction of Mortality in Coronary Artery Disease: Role of Machine Learning and Maximal Exercise Capacity. Mayo Clin Proc 2022; 97:1472-1482. [PMID: 35431026 DOI: 10.1016/j.mayocp.2022.01.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 01/07/2022] [Accepted: 01/11/2022] [Indexed: 10/18/2022]
Abstract
OBJECTIVE To develop a prediction model for survival of patients with coronary artery disease (CAD) using health conditions beyond cardiovascular risk factors, including maximal exercise capacity, through the application of machine learning (ML) techniques. METHODS Analysis of data from a retrospective cohort linking clinical, administrative, and vital status databases from 1995 to 2016 was performed. Inclusion criteria were age 18 years or older, diagnosis of CAD, referral to a cardiac rehabilitation program, and available baseline exercise test results. Primary outcome was death from any cause. Feature selection was performed using supervised and unsupervised ML techniques. The final prognostic model used the survival tree (ST) algorithm. RESULTS From the cohort of 13,362 patients (60±11 years; 2400 [18%] women), 1577 died during a median follow-up of 8 years (interquartile range, 4 to 13 years), with an estimated survival of 67% up to 21 years. Feature selection revealed age and peak metabolic equivalents (METs) as the features with the greatest importance for mortality prediction. Using these 2 features, the ST generated a long-term prediction with a C-index of 0.729 by splitting patients in 8 clusters with different survival probabilities (P<.001). The ST root node was split by peak METs of 6.15 or less or more than 6.15, and each patient's subgroup was further split by age or other peak METs cut points. CONCLUSION Applying ML techniques, age and maximal exercise capacity accurately predict mortality in patients with CAD and outperform variables commonly used for decision-making in clinical practice. A novel and simple prognostic model was established, and maximal exercise capacity was further suggested to be one of the most powerful predictors of mortality in CAD.
Collapse
Affiliation(s)
| | - Gabriel C Buginga
- Systems Engineering and Computer Science/COPPE, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Edmundo A de Souza E Silva
- Systems Engineering and Computer Science/COPPE, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Ross Arena
- Department of Physical Therapy, College of Applied Health Sciences, University of Illinois at Chicago; TotalCardiology(TM) Research Network, Calgary, Alberta, Canada
| | - Codie R Rouleau
- TotalCardiology(TM) Research Network, Calgary, Alberta, Canada; Department of Psychology, University of Calgary, Alberta, Canada
| | - Sandeep Aggarwal
- TotalCardiology(TM) Research Network, Calgary, Alberta, Canada; Libin Cardiovascular Institute, University of Calgary, Alberta, Canada
| | - Stephen B Wilton
- Libin Cardiovascular Institute, University of Calgary, Alberta, Canada
| | - Leslie Austford
- TotalCardiology(TM) Research Network, Calgary, Alberta, Canada
| | - Trina Hauer
- TotalCardiology(TM) Research Network, Calgary, Alberta, Canada
| | - Jonathan Myers
- Cardiovascular Division, Veterans Affairs Palo Alto Health Care System, Palo Alto, CA; Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, CA
| |
Collapse
|
15
|
Mfateneza E, Rutayisire PC, Biracyaza E, Musafiri S, Mpabuka WG. Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014-15 dataset. BMC Pregnancy Childbirth 2022; 22:388. [PMID: 35509018 PMCID: PMC9066935 DOI: 10.1186/s12884-022-04699-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2021] [Accepted: 04/18/2022] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Extensive research on infant mortality (IM) exists in developing countries; however, most of the methods applied thus far relied on conventional regression analyses with limited prediction capability. Advanced of Machine Learning (AML) methods provide accurate prediction of IM; however, there is no study conducted using ML methods in Rwanda. This study, therefore, applied Machine Learning Methods for predicting infant mortality in Rwanda. METHODS: A cross-sectional study design was conducted using the 2014-15 Rwanda Demographic and Health Survey. Python software version 3.8 was employed to test and apply ML methods through Random Forest (RF), Decision Tree, Support Vector Machine and Logistic regression. STATA version 13 was used for analysing conventional methods. Evaluation metrics methods specifically confusion matrix, accuracy, precision, recall, F1 score, and Area under the Receiver Operating Characteristics (AUROC) were used to evaluate the performance of predictive models. RESULTS Ability of prediction was between 68.6% and 61.5% for AML. We preferred with the RF model (61.5%) presenting the best performance. The RF model was the best predictive model of IM with accuracy (84.3%), recall (91.3%), precision (80.3%), F1 score (85.5%), and AUROC (84.2%); followed by decision tree model with model accuracy (83%), recall (91%), precision (79%), F1 score (84.67%) and AUROC(82.9%), followed by support vector machine with model accuracy (68.6%), recall (74.9%), precision(67%), F1 score (70.73%) and AUROC (68.6%) and last was a logistic regression with the low accuracy of prediction (61.5%), recall (61.1%), precision (62.2%), F1 score (61.6%) and AUROC (61.5%) compared to other predictive models. Our predictive models showed that marital status, children ever born, birth order and wealth index are the 4 top predictors of IM. CONCLUSIONS In developing a predictive model, ML methods are used to classify certain hidden information that could not be detected by traditional statistical methods. Random Forest was classified as the best classifier to be used for the predictive models of IM.
Collapse
Affiliation(s)
- Emmanuel Mfateneza
- African Centre of Excellence in Data Science, University of Rwanda, Kigali, Rwanda
| | | | | | - Sanctus Musafiri
- Clinical Department of Internal Medicine, University of Rwanda, Kigali, Rwanda
| | | |
Collapse
|
16
|
Rathore N, Jain PK, Parida M. A Sustainable Model for Emergency Medical Services in Developing Countries: A Novel Approach Using Partial Outsourcing and Machine Learning. Risk Manag Healthc Policy 2022; 15:193-218. [PMID: 35173497 PMCID: PMC8841749 DOI: 10.2147/rmhp.s338186] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Accepted: 01/06/2022] [Indexed: 11/23/2022] Open
Abstract
INTRODUCTION Unlike Western countries, many low- and middle-income countries (LMIC), like India, have a de-centralized emergency medical services (EMS) involving both semi-government and non-government organizations. It is alarming that due to the absence of a common ecosystem, the utilization of resources is inefficient, which leads to shortage of available vehicles and larger response time. Fragmentation of emergency supply chain resources motivates us to propose a new vehicle routing and scheduling model equipped with novel features to ensure minimal response time using existing resources. MATERIALS AND METHODS The data set of medical and fire-related emergencies from January 2018 to May 2018 of Uttarakhand State in India was provided by GVK Emergency Management and Research Institute (GVK EMRI) also known as 108 EMSs was used in the study. The proposed model integrates all the available EMS vehicles including partial outsourcing to non-ambulatory vehicles like police vans, taxis, etc., using a novel two-echelon heuristic approach. In the first stage, an offline learning model is developed to yield the deployment strategy for EMS vehicles. Seven well researched machine learning (ML) algorithms were analyzed for parameter prediction namely random forest (RF), convolutional neural network (CNN), k-nearest neighbor (KNN), classification and regression tree (CART), support vector machine (SVM), logistic regression (LR), and linear discriminant analysis (LDA). In the second stage, a real-time routing model is proposed for EMS vehicle routing at the time of emergency, considering partial outsourcing. RESULTS AND DISCUSSION The results indicate that the RF classifier outperforms the LR, LDA, SVM, CNN, CART and NB classifier in terms of both accuracy as well as F-1 score. The proposed vehicle routing and scheduling model for automated decision-making shows an improvement of 42.1%, 54%, 27.9% and 62% in vehicle assignment time, vehicle travel time from base to scene, travel time from scene to hospital, and total response time, respectively, in urban areas.
Collapse
Affiliation(s)
- Nikki Rathore
- Department of Mechanical and Industrial Engineering, Indian Institute of Technology Roorkee, Roorkee, 247667, India
| | - Pramod Kumar Jain
- Department of Mechanical and Industrial Engineering, Indian Institute of Technology Roorkee, Roorkee, 247667, India
| | - Manoranjan Parida
- Department of Civil Engineering Indian Institute of Technology Roorkee, Roorkee, 247667, India
| |
Collapse
|
17
|
Vazquez-Zapien GJ, Mata-Miranda MM, Garibay-Gonzalez F, Sanchez-Brito M. Artificial intelligence model validation before its application in clinical diagnosis assistance. World J Gastroenterol 2022; 28:602-604. [PMID: 35316961 PMCID: PMC8905022 DOI: 10.3748/wjg.v28.i5.602] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 10/07/2021] [Accepted: 01/17/2022] [Indexed: 02/06/2023] Open
Abstract
The process of selecting an artificial intelligence (AI) model to assist clinical diagnosis of a particular pathology and its validation tests is relevant since the values of accuracy, sensitivity and specificity may not reflect the behavior of the method in a real environment. Here, we provide helpful considerations to increase the success of using an AI model in clinical practice.
Collapse
Affiliation(s)
| | | | | | - Miguel Sanchez-Brito
- Instituto Tecnológico de Zacatepec, Industrial Engineering, TecNM, Zacatepec 62780, Morelos, Mexico
- Instituto Tecnológico de Aguascalientes, Computational Sciences, TecNM, Aguascalientes 20256, Mexico
| |
Collapse
|
18
|
Virtual reality: a powerful technology to provide novel insight into treatment mechanisms of addiction. Transl Psychiatry 2021; 11:617. [PMID: 34873146 PMCID: PMC8648903 DOI: 10.1038/s41398-021-01739-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 10/20/2021] [Accepted: 11/01/2021] [Indexed: 12/31/2022] Open
Abstract
Due to its high ecological validity, virtual reality (VR) technology has emerged as a powerful tool for mental health research. Despite the wide use of VR simulations in research on mental illnesses, the study of addictive processes through the use of VR environments is still at its dawn. In a systematic literature search, we identified 38 reports of research projects using highly immersive head-mounted displays, goggles, or CAVE technologies to provide insight into treatment mechanisms of addictive behaviors. So far, VR research has mainly addressed the roles of craving, psychophysiology, affective states, cognition, and brain activity in addiction. The computer-generated VR environments offer very realistic, dynamic, interactive, and complex real-life simulations requesting active participation. They create a high sense of immersion in users by combining stereoscopic three-dimensional visual, auditory, olfactory, and tactile perceptions, tracking systems responding to user movements, and social interactions. VR is an emerging tool to study how proximal multi-sensorial cues, contextual environmental cues, as well as their interaction (complex cues) modulate addictive behaviors. VR allows for experimental designs under highly standardized, strictly controlled, predictable, and repeatable conditions. Moreover, VR simulations can be personalized. They are currently refined for psychotherapeutic interventions. Embodiment, eye-tracking, and neurobiological factors represent novel future directions. The progress of VR applications has bred auspicious ways to advance the understanding of treatment mechanisms underlying addictions, which researchers have only recently begun to exploit. VR methods promise to yield significant achievements to the addiction field. These are necessary to develop more efficacious and efficient preventive and therapeutic strategies.
Collapse
|
19
|
Della Pepa GM, Caccavella VM, Menna G, Ius T, Auricchio AM, Sabatino G, La Rocca G, Chiesa S, Gaudino S, Marchese E, Olivi A. Machine Learning-Based Prediction of Early Recurrence in Glioblastoma Patients: A Glance Towards Precision Medicine. Neurosurgery 2021; 89:873-883. [PMID: 34459917 DOI: 10.1093/neuros/nyab320] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 06/09/2021] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Ability to thrive and time-to-recurrence following treatment are important parameters to assess in patients with glioblastoma multiforme (GBM), given its dismal prognosis. Though there is an ongoing debate whether it can be considered an appropriate surrogate endpoint for overall survival in clinical trials, progression-free survival (PFS) is routinely used for clinical decision-making. OBJECTIVE To investigate whether machine learning (ML)-based models can reliably stratify newly diagnosed GBM patients into prognostic subclasses on PFS basis, identifying those at higher risk for an early recurrence (≤6 mo). METHODS Data were extracted from a multicentric database, according to the following eligibility criteria: histopathologically verified GBM and follow-up >12 mo: 474 patients met our inclusion criteria and were included in the analysis. Relevant demographic, clinical, molecular, and radiological variables were selected by a feature selection algorithm (Boruta) and used to build a ML-based model. RESULTS Random forest prediction model, evaluated on an 80:20 split ratio, achieved an AUC of 0.81 (95% CI: 0.77; 0.83) demonstrating high discriminative ability. Optimizing the predictive value derived from the linear and nonlinear combinations of the selected input features, our model outperformed across all performance metrics multivariable logistic regression. CONCLUSION A robust ML-based prediction model that identifies patients at high risk for early recurrence was successfully trained and internally validated. Considerable effort remains to integrate these predictions in a patient-centered care context.
Collapse
Affiliation(s)
- Giuseppe Maria Della Pepa
- Institute of Neurosurgery, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy
| | - Valerio Maria Caccavella
- Institute of Neurosurgery, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy
| | - Grazia Menna
- Institute of Neurosurgery, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy
| | - Tamara Ius
- Neurosurgery Unit, Department of Neuroscience, Santa Maria della Misericordia, University Hospital, Udine, Italy
| | - Anna Maria Auricchio
- Institute of Neurosurgery, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy
| | - Giovanni Sabatino
- Institute of Neurosurgery, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy.,Department of Neurosurgery, Mater Olbia Hospital, Olbia, Italy
| | - Giuseppe La Rocca
- Institute of Neurosurgery, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy.,Department of Neurosurgery, Mater Olbia Hospital, Olbia, Italy
| | - Silvia Chiesa
- Radiotherapy Department, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy
| | - Simona Gaudino
- Radiology and Neuroradiology Department, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy
| | - Enrico Marchese
- Institute of Neurosurgery, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy
| | - Alessandro Olivi
- Institute of Neurosurgery, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy
| |
Collapse
|
20
|
Estimated Artificial Neural Network Modeling of Maximal Oxygen Uptake Based on Multistage 10-m Shuttle Run Test in Healthy Adults. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18168510. [PMID: 34444259 PMCID: PMC8391137 DOI: 10.3390/ijerph18168510] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 08/11/2021] [Accepted: 08/11/2021] [Indexed: 11/17/2022]
Abstract
We aimed to develop an artificial neural network (ANN) model to estimate the maximal oxygen uptake (VO2max) based on a multistage 10 m shuttle run test (SRT) in healthy adults. For ANN-based VO2max estimation, 118 healthy Korean adults (59 men and 59 women) in their twenties and fifties (38.3 ± 11.8 years, men aged 37.8 ± 12.1 years, and women aged 38.8 ± 11.6 years) participated in this study; data included age, sex, blood pressure (systolic blood pressure (SBP), diastolic blood pressure (DBP)), waist circumference, hip circumference, waist-to-hip ratio (WHR), body composition (weight, height, body mass index (BMI), percent skeletal muscle, and percent body), 10 m SRT parameters (number of round trips and final speed), and VO2max by graded exercise test (GXT) using a treadmill. The best estimation results (R2 = 0.8206, adjusted R2 = 0.7010, root mean square error; RMSE = 3.1301) were obtained in case 3 (using age, sex, height, weight, BMI, waist circumference, hip circumference, WHR, SBP, DBP, number of round trips in 10 m SRT, and final speed in 10 m SRT), while the worst results (R2 = 0.7765, adjusted R2 = 0.7206, RMSE = 3.494) were obtained for case 1 (using age, sex, height, weight, BMI, number of round trips in 10 m SRT, and final speed in 10 m SRT). The estimation results of case 2 (using age, sex, height, weight, BMI, waist circumference, hip circumference, WHR, number of round trips in 10 m SRT, and final speed in 10 m SRT) were lower (R2 = 0.7909, adjusted R2 = 0.7072, RMSE = 3.3798) than those of case 3 and higher than those of case 1. However, all cases showed high performance (R2) in the estimation results. This brief report developed an ANN-based estimation model to predict the VO2max of healthy adults, and the model’s performance was confirmed to be excellent.
Collapse
|
21
|
Kuo CC, Wang HH, Tseng LP. Using data mining technology to predict medication-taking behaviour in women with breast cancer: A retrospective study. Nurs Open 2021; 9:2646-2656. [PMID: 34156764 PMCID: PMC9584494 DOI: 10.1002/nop2.963] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Revised: 05/10/2021] [Accepted: 05/27/2021] [Indexed: 11/19/2022] Open
Abstract
Aims Medication‐taking behaviours of breast cancer survivors undergoing adjuvant hormone therapy have received considerable attention. This study aimed to determine factors affecting medication‐taking behaviours in people with breast cancer using data mining. Design A longitudinal observational retrospective cohort study with a hospital‐based survey. Methods A total of 385 subjects were surveyed, analysing existing data from January 2010 to December 2017 in Taiwan. Three data mining approaches—multiple logistic regression, decision tree and artificial neural network—were used to build the prediction models and rank the importance of influencing factors. Accuracy, specificity and sensitivity were used as assessment indicators for the prediction models. Results Multiple logistic regression was the most effective approach, achieving an accuracy of 96.37%, specificity of 96.75% and sensitivity of 96.12%. The duration of adjuvant hormone therapy discontinuation, duration of adjuvant hormone therapy use and age at diagnosis by data mining were the three most critical factors influencing the medication‐taking behaviours of people with breast cancer.
Collapse
Affiliation(s)
- Chen-Chen Kuo
- The Cancer Prevention and Treatment Center, St. Martin De Porres Hospital, Chiayi, Taiwan.,School of Nursing, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Hsiu-Hung Wang
- School of Nursing, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Li-Ping Tseng
- Management Center, St. Martin De Porres Hospital, Chiayi, Taiwan.,Department of Industrial Engineering and Management, National Yunlin University of Science and Technology, Douliu, Taiwan
| |
Collapse
|
22
|
Guo C, Liu M, Lu M. A Dynamic Ensemble Learning Algorithm based on K-means for ICU mortality prediction. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107166] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
23
|
Binkheder S, Aldekhyyel R, Almulhem J. Health informatics publication trends in Saudi Arabia: a bibliometric analysis over the last twenty-four years. J Med Libr Assoc 2021; 109:219-239. [PMID: 34285665 PMCID: PMC8270356 DOI: 10.5195/jmla.2021.1072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
OBJECTIVE Understanding health informatics (HI) publication trends in Saudi Arabia may serve as a framework for future research efforts and contribute toward meeting national "e-Health" goals. The authors' intention was to understand the state of the HI field in Saudi Arabia by exploring publication trends and their alignment with national goals. METHODS A scoping review was performed to identify HI publications from Saudi Arabia in PubMed, Embase, and Web of Science. We analyzed publication trends based on topics, keywords, and how they align with the Ministry of Health's (MOH's) "digital health journey" framework. RESULTS The total number of publications included was 242. We found 1 (0.4%) publication in 1995-1999, 11 (4.5%) publications in 2000-2009, and 230 (95.0%) publications in 2010-2019. We categorized publications into 3 main HI fields and 4 subfields: 73.1% (n=177) of publications were in clinical informatics (85.1%, n=151 medical informatics; 5.6%, n=10 pharmacy informatics; 6.8%, n=12 nursing informatics; 2.3%, n=4 dental informatics); 22.3% (n=54) were in consumer health informatics; and 4.5% (n=11) were in public health informatics. The most common keyword was "medical informatics" (21.5%, n=52). MOH framework-based analysis showed that most publications were categorized as "digitally enabled care" and "digital health foundations." CONCLUSIONS The years of 2000-2009 may be seen as an infancy stage of the HI field in Saudi Arabia. Exploring how the Saudi Arabian MOH's e-Health initiatives may influence research is valuable for advancing the field. Data exchange and interoperability, artificial intelligence, and intelligent health enterprises might be future research directions in Saudi Arabia.
Collapse
Affiliation(s)
- Samar Binkheder
- , Assistant Professor of Biomedical and Health Informatics, College of Medicine, King Saud University, Riyadh, Kingdom of Saudi Arabia
| | - Raniah Aldekhyyel
- , Assistant Professor of Biomedical and Health Informatics, College of Medicine, King Saud University, Riyadh, Kingdom of Saudi Arabia
| | - Jwaher Almulhem
- , Assistant Professor of Biomedical and Health Informatics, College of Medicine, King Saud University, Riyadh, Kingdom of Saudi Arabia
| |
Collapse
|
24
|
Bayesian Network as a Decision Tool for Predicting ALS Disease. Brain Sci 2021; 11:brainsci11020150. [PMID: 33498784 PMCID: PMC7912628 DOI: 10.3390/brainsci11020150] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 01/09/2021] [Accepted: 01/20/2021] [Indexed: 12/14/2022] Open
Abstract
Clinical diagnosis of amyotrophic lateral sclerosis (ALS) is difficult in the early period. But blood tests are less time consuming and low cost methods compared to other methods for the diagnosis. The ALS researchers have been used machine learning methods to predict the genetic architecture of disease. In this study we take advantages of Bayesian networks and machine learning methods to predict the ALS patients with blood plasma protein level and independent personal features. According to the comparison results, Bayesian Networks produced best results with accuracy (0.887), area under the curve (AUC) (0.970) and other comparison metrics. We confirmed that sex and age are effective variables on the ALS. In addition, we found that the probability of onset involvement in the ALS patients is very high. Also, a person’s other chronic or neurological diseases are associated with the ALS disease. Finally, we confirmed that the Parkin level may also have an effect on the ALS disease. While this protein is at very low levels in Parkinson’s patients, it is higher in the ALS patients than all control groups.
Collapse
|
25
|
Truong VT, Beyerbach D, Mazur W, Wigle M, Bateman E, Pallerla A, Ngo TNM, Shreenivas S, Tretter JT, Palmer C, Kereiakes DJ, Chung ES. Machine learning method for predicting pacemaker implantation following transcatheter aortic valve replacement. PACING AND CLINICAL ELECTROPHYSIOLOGY: PACE 2021; 44:334-340. [PMID: 33433905 DOI: 10.1111/pace.14163] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Revised: 11/30/2020] [Accepted: 12/13/2020] [Indexed: 12/29/2022]
Abstract
BACKGROUND An accurate assessment of permanent pacemaker implantation (PPI) risk following transcatheter aortic valve replacement (TAVR) is important for clinical decision making. The aims of this study were to investigate the significance and utility of pre- and post-TAVR ECG data and compare machine learning approaches with traditional logistic regression in predicting pacemaker risk following TAVR. METHODS Five hundred fifity seven patients in sinus rhythm undergoing TAVR for severe aortic stenosis (AS) were included in the analysis. Baseline demographics, clinical, pre-TAVR ECG, post-TAVR data, post-TAVR ECGs (24 h following TAVR and before PPI), and echocardiographic data were recorded. A Random Forest (RF) algorithm and logistic regression were used to train models for assessing the likelihood of PPI following TAVR. RESULTS Average age was 80 ± 9 years, with 52% male. PPI after TAVR occurred in 95 patients (17.1%). The optimal cutoff of delta PR (difference between post and pre TAVR PR intervals) to predict PPI was 20 ms with a sensitivity of 0.82, a specificity of 0.66. With regard to delta QRS, the optimal cutoff was 13 ms with a sensitivity of 0.68 and a specificity of 0.59. The RF model that incorporated post-TAVR ECG data (AUC 0.81) more accurately predicted PPI risk compared to the RF model without post-TAVR ECG data (AUC 0.72). Moreover, the RF model performed better than logistic regression model in predicting PPI risk (AUC: 0.81 vs. 0.69). CONCLUSIONS Machine learning using RF methodology is significantly more powerful than traditional logistic regression in predicting PPI risk following TAVR.
Collapse
Affiliation(s)
- Vien T Truong
- The Christ Hospital Health Network and The Lindner Research Center, Cincinnati, Ohio, USA.,The Sue and Bill Butler Research Fellow, The Linder Research Center, Cincinnati, Ohio, USA
| | - Daniel Beyerbach
- The Christ Hospital Health Network and The Lindner Research Center, Cincinnati, Ohio, USA
| | - Wojciech Mazur
- The Christ Hospital Health Network and The Lindner Research Center, Cincinnati, Ohio, USA
| | - Matthew Wigle
- The Christ Hospital Health Network and The Lindner Research Center, Cincinnati, Ohio, USA
| | - Emma Bateman
- University of Kentucky, Lexington, Kentucky, USA
| | | | - Tam N M Ngo
- The Christ Hospital Health Network and The Lindner Research Center, Cincinnati, Ohio, USA
| | - Satya Shreenivas
- The Christ Hospital Health Network and The Lindner Research Center, Cincinnati, Ohio, USA
| | - Justin T Tretter
- Heart Institute, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio, USA
| | - Cassady Palmer
- The Christ Hospital Health Network and The Lindner Research Center, Cincinnati, Ohio, USA
| | - Dean J Kereiakes
- The Christ Hospital Health Network and The Lindner Research Center, Cincinnati, Ohio, USA
| | - Eugene S Chung
- The Christ Hospital Health Network and The Lindner Research Center, Cincinnati, Ohio, USA
| |
Collapse
|
26
|
Adeyinka DA, Muhajarine N. Time series prediction of under-five mortality rates for Nigeria: comparative analysis of artificial neural networks, Holt-Winters exponential smoothing and autoregressive integrated moving average models. BMC Med Res Methodol 2020; 20:292. [PMID: 33267817 PMCID: PMC7712624 DOI: 10.1186/s12874-020-01159-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Accepted: 11/09/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Accurate forecasting model for under-five mortality rate (U5MR) is essential for policy actions and planning. While studies have used traditional time series modeling techniques (e.g., autoregressive integrated moving average (ARIMA) and Holt-Winters smoothing exponential methods), their appropriateness to predict noisy and non-linear data (such as childhood mortality) has been debated. The objective of this study was to model long-term U5MR with group method of data handling (GMDH)-type artificial neural network (ANN), and compare the forecasts with the commonly used conventional statistical methods-ARIMA regression and Holt-Winters exponential smoothing models. METHODS The historical dataset of annual U5MR in Nigeria from 1964 to 2017 was obtained from the official website of World Bank. The optimal models for each forecasting methods were used for forecasting mortality rates to 2030 (ending of Sustainable Development Goal era). The predictive performances of the three methods were evaluated, based on root mean squared errors (RMSE), root mean absolute error (RMAE) and modified Nash-Sutcliffe efficiency (NSE) coefficient. Statistically significant differences in loss function between forecasts of GMDH-type ANN model compared to each of the ARIMA and Holt-Winters models were assessed with Diebold-Mariano (DM) test and Deming regression. RESULTS The modified NSE coefficient was slightly lower for Holt-Winters methods (96.7%), compared to GMDH-type ANN (99.8%) and ARIMA (99.6%). The RMSE of GMDH-type ANN (0.09) was lower than ARIMA (0.23) and Holt-Winters (2.87). Similarly, RMAE was lowest for GMDH-type ANN (0.25), compared with ARIMA (0.41) and Holt-Winters (1.20). From the DM test, the mean absolute error (MAE) was significantly lower for GMDH-type ANN, compared with ARIMA (difference = 0.11, p-value = 0.0003), and Holt-Winters model (difference = 0.62, p-value< 0.001). Based on the intercepts from Deming regression, the predictions from GMDH-type ANN were more accurate (β0 = 0.004 ± standard error: 0.06; 95% confidence interval: - 0.113 to 0.122). CONCLUSIONS GMDH-type neural network performed better in predicting and forecasting of under-five mortality rates for Nigeria, compared to the ARIMA and Holt-Winters models. Therefore, GMDH-type ANN might be more suitable for data with non-linear or unknown distribution, such as childhood mortality. GMDH-type ANN increases forecasting accuracy of childhood mortalities in order to inform policy actions in Nigeria.
Collapse
Affiliation(s)
- Daniel Adedayo Adeyinka
- Department of Community Health and Epidemiology, College of Medicine, University of Saskatchewan, Saskatoon, SK, S7N 5E5, Canada. .,Department of Public Health, Federal Ministry of Health, Abuja, Nigeria.
| | - Nazeem Muhajarine
- Department of Community Health and Epidemiology, College of Medicine, University of Saskatchewan, Saskatoon, SK, S7N 5E5, Canada.,Saskatchewan Population Health and Evaluation Research Unit, Saskatoon, Saskatchewan, Canada
| |
Collapse
|
27
|
ElShawi R, Sherif Y, Al‐Mallah M, Sakr S. Interpretability in healthcare: A comparative study of local machine learning interpretability techniques. Comput Intell 2020. [DOI: 10.1111/coin.12410] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
28
|
Identification of early liver toxicity gene biomarkers using comparative supervised machine learning. Sci Rep 2020; 10:19128. [PMID: 33154507 PMCID: PMC7645727 DOI: 10.1038/s41598-020-76129-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 10/12/2020] [Indexed: 02/08/2023] Open
Abstract
Screening agrochemicals and pharmaceuticals for potential liver toxicity is required for regulatory approval and is an expensive and time-consuming process. The identification and utilization of early exposure gene signatures and robust predictive models in regulatory toxicity testing has the potential to reduce time and costs substantially. In this study, comparative supervised machine learning approaches were applied to the rat liver TG-GATEs dataset to develop feature selection and predictive testing. We identified ten gene biomarkers using three different feature selection methods that predicted liver necrosis with high specificity and selectivity in an independent validation dataset from the Microarray Quality Control (MAQC)-II study. Nine of the ten genes that were selected with the supervised methods are involved in metabolism and detoxification (Car3, Crat, Cyp39a1, Dcd, Lbp, Scly, Slc23a1, and Tkfc) and transcriptional regulation (Ablim3). Several of these genes are also implicated in liver carcinogenesis, including Crat, Car3 and Slc23a1. Our biomarker gene signature provides high statistical accuracy and a manageable number of genes to study as indicators to potentially accelerate toxicity testing based on their ability to induce liver necrosis and, eventually, liver cancer.
Collapse
|
29
|
Ding X, Li Y, Li D, Li L, Liu X. Using machine-learning approach to distinguish patients with methamphetamine dependence from healthy subjects in a virtual reality environment. Brain Behav 2020; 10:e01814. [PMID: 32862513 PMCID: PMC7667292 DOI: 10.1002/brb3.1814] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/20/2019] [Revised: 08/08/2020] [Accepted: 08/09/2020] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND The aim of this study was to evaluate whether machine learning (ML) can be used to distinguish patients with methamphetamine dependence from healthy controls by using their surface electroencephalography (EEG) and galvanic skin response (GSR) in a drug-simulated virtual reality (VR) environment. METHODS A total of 333 participants with methamphetamine (METH) dependence and 332 healthy control subjects were recruited between January 2018 and January 2019. EEG (five electrodes) and GSR signals were collected under four VR environments: one neutral scenario and three METH-simulated scenarios. Three ML classification techniques were evaluated: random forest (RF), support vector machine (SVM), and logistic regression (LR). RESULTS The MANOVA showed no interaction effects among the two subject groups and the 4 VR scenarios. Taking patient groups as the main effect, the METH user group had significantly lower GSR, lower EEG power in delta (p < .001), and alpha bands (p < .001) than healthy subjects. The EEG power of beta band (p < .001) and gamma band (p < .001) was significantly higher in METH group than the control group. Taking the VR scenarios (Neutral versus METH-VR) as the main effects, the GSR, EEG power in delta, theta, and alpha bands in neutral scenario were significantly higher than in the METH-VR scenario (p < .001). The LR algorithm showed the highest specificity and sensitivity in distinguishing methamphetamine-dependent patients from healthy controls. CONCLUSION The study shows the potential of using machine learning to distinguish methamphetamine-dependent patients from healthy subjects by using EEG and GSR data. The LR algorithm shows the best performance comparing with SVM and RF algorithm.
Collapse
Affiliation(s)
- Xinfang Ding
- School of Medical Humanities, Capital Medical University, Beijing, China
| | - Yuanhui Li
- Adai Technology (Beijing) Ltd., Co, Beijing, China
| | - Dai Li
- Adai Technology (Beijing) Ltd., Co, Beijing, China
| | - Ling Li
- School of Computing, University of Kent, Kent, UK
| | - Xiuyun Liu
- Department of Anesthesiology and Critical Care Medicine, School of Medicine, Johns Hopkins University, Baltimore, MD, USA.,School of Precision Instrument and Optoelectronics Engineering, Tianjin University, Tianjin, China
| |
Collapse
|
30
|
Forecasting mortality rates using hybrid Lee–Carter model, artificial neural network and random forest. COMPLEX INTELL SYST 2020. [DOI: 10.1007/s40747-020-00185-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
AbstractInaccurate prediction would cause the insurance company encounter catastrophic losses and may lead to overpriced premiums where low-earning consumers cannot afford to insure themselves. The ability to forecast mortality rates accurately can allow the insurance company to take preventive measures to introduce new policies with reasonable prices. In this paper, several Lee–Carter (LC) based models are used to forecast the mortality rates in a case study of the Malaysian population. The LC-ARIMA model and also a combination of the LC model with two machine learning (ML) methods, namely the random forest (RF) and artificial neural network (ANN) methods are utilized on the prediction of mortality rates for males and females in Malaysia, whereby the LC-Random Forest (LC-RF) hybrid model is a new model that is introduced in this paper. Seventeen years of mortality data in Malaysia are selected as the dataset for this research. To analyze how the forecasting models perform for other countries, we have determined the model that has the best fit and produced the best forecasted mortality rates for all the other countries that are studied. This research has showed that LC-ANN and LC-ARIMA are the best model in predicting the mortality rates of males and females in Malaysia, respectively. This study has also found that the LC-ARIMA model is the best performing model in forecasting the mortality rates in countries that have longer life expectancy and a good healthcare system such as Sweden, Ireland, Japan, Hong Kong, Norway, Switzerland and Czechia. In contrast, the LC-ANN model is the best performing model in forecasting the mortality rates in countries that have a less efficiency, less accessibility healthcare system, and bad personal behavior such as Malaysia, Canada and Latvia.
Collapse
|
31
|
Wang J, Deng F, Zeng F, Shanahan AJ, Li WV, Zhang L. Predicting long-term multicategory cause of death in patients with prostate cancer: random forest versus multinomial model. Am J Cancer Res 2020; 10:1344-1355. [PMID: 32509383 PMCID: PMC7269775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Accepted: 04/07/2020] [Indexed: 06/11/2023] Open
Abstract
The majority of patients with prostate cancer die of non-cancer causes of death (COD). It is thus important to accurately predict multi-category COD in these patients. Random forest (RF), a popular machine learning model, has been shown useful for predicting binary cancer-specific deaths. However, its accuracy for predicting multi-category COD in cancer patients is unclear. We included patients in Surveillance, Epidemiology, and End Results-18 cancer registry-program with prostate cancer diagnosed in 2004 (followed-up through 2016). They were randomly divided into training and testing sets with equal sizes. We evaluated prediction accuracies of RF and conventional statistical/multinomial models for 6-category COD by data-encoding types using the 2-fold cross-validation approach. Among 49,864 prostate cancer patients, 29,611 (59.4%) were alive at the end of follow-up, and 5,448 (10.9%) died of cardiovascular disease, 4,607 (9.2%) of prostate cancer, 3,681 (7.4%) of non-prostate cancer, 717 (1.4%) of infection, and 5,800 (11.6%) of other causes. We predicted 6-category COD among these patients with a mean accuracy of 59.1% (n=240, 95% CI, 58.7%-59.4%) in RF models with one-hot encoding, and 50.4% (95% CI, 49.7%-51.0%) in multinomial models. Tumor characteristics, prostate-specific antigen level, and diagnosis confirmation-method were important in RF and multinomial models. In RF models, no statistical differences were found between the accuracies of training versus cross-validation phases, and those of categorical versus one-hot encoding. We here report that RF models can outperform multinomial logistic models (absolute accuracy-difference, 8.7%) in predicting long-term 6-category COD among prostate cancer patients, while pathology diagnosis itself and tumor pathology remain important factors.
Collapse
Affiliation(s)
- Jianwei Wang
- Department of Urology, Beijing Jishuitan Hospital, The Fourth Medical College of Peking UniversityBeijing, China
| | - Fei Deng
- School of Electrical and Electronic Engineering, Shanghai Institute of TechnologyShanghai, China
| | - Fuqing Zeng
- Department of Urology, Wuhan Union Hospital of Tongji Medical Collage, Huazhong University of Science and TechnologyWuhan, China
| | | | - Wei Vivian Li
- Department of Biostatistics and Epidemiology, Rutgers School of Public HealthPiscataway, NJ, USA
| | - Lanjing Zhang
- Department of Pathology, Princeton Medical CenterPlainsboro, NJ, USA
- Department of Biological Sciences, Rutgers UniversityNewark, NJ, USA
- Rutgers Cancer Institute of New JerseyNew Brunswick, NJ, USA
- Department of Chemical Biology, Ernest Mario School of Pharmacy, Rutgers UniversityPiscataway, NJ, USA
| |
Collapse
|
32
|
Gravesteijn BY, Nieboer D, Ercole A, Lingsma HF, Nelson D, van Calster B, Steyerberg EW. Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury. J Clin Epidemiol 2020; 122:95-107. [PMID: 32201256 DOI: 10.1016/j.jclinepi.2020.03.005] [Citation(s) in RCA: 104] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Revised: 02/04/2020] [Accepted: 03/09/2020] [Indexed: 12/23/2022]
Abstract
OBJECTIVE We aimed to explore the added value of common machine learning (ML) algorithms for prediction of outcome for moderate and severe traumatic brain injury. STUDY DESIGN AND SETTING We performed logistic regression (LR), lasso regression, and ridge regression with key baseline predictors in the IMPACT-II database (15 studies, n = 11,022). ML algorithms included support vector machines, random forests, gradient boosting machines, and artificial neural networks and were trained using the same predictors. To assess generalizability of predictions, we performed internal, internal-external, and external validation on the recent CENTER-TBI study (patients with Glasgow Coma Scale <13, n = 1,554). Both calibration (calibration slope/intercept) and discrimination (area under the curve) was quantified. RESULTS In the IMPACT-II database, 3,332/11,022 (30%) died and 5,233(48%) had unfavorable outcome (Glasgow Outcome Scale less than 4). In the CENTER-TBI study, 348/1,554(29%) died and 651(54%) had unfavorable outcome. Discrimination and calibration varied widely between the studies and less so between the studied algorithms. The mean area under the curve was 0.82 for mortality and 0.77 for unfavorable outcomes in the CENTER-TBI study. CONCLUSION ML algorithms may not outperform traditional regression approaches in a low-dimensional setting for outcome prediction after moderate or severe traumatic brain injury. Similar to regression-based prediction models, ML algorithms should be rigorously validated to ensure applicability to new populations.
Collapse
Affiliation(s)
- Benjamin Y Gravesteijn
- Departments of Public Health, Erasmus MC - University Medical Centre Rotterdam, Postbus 2040, 3000 CA, Rotterdam, the Netherlands.
| | - Daan Nieboer
- Departments of Public Health, Erasmus MC - University Medical Centre Rotterdam, Rotterdam, the Netherlands
| | - Ari Ercole
- Division of Anaesthesia, University of Cambridge, Cambridge, United Kingdom
| | - Hester F Lingsma
- Departments of Public Health, Erasmus MC - University Medical Centre Rotterdam, Rotterdam, the Netherlands
| | - David Nelson
- Department of Physiology and Pharmacology, Section of Perioperative Medicine and Intensive Care, Karolinska Institutet, Stockholm, Sweden
| | - Ben van Calster
- Department of Development and Regeneration, KU Leuven, Belgium; Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, the Netherlands
| | - Ewout W Steyerberg
- Departments of Public Health, Erasmus MC - University Medical Centre Rotterdam, Rotterdam, the Netherlands; Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, the Netherlands
| | | |
Collapse
|
33
|
Guo C, Lu M, Chen J. An evaluation of time series summary statistics as features for clinical prediction tasks. BMC Med Inform Decis Mak 2020; 20:48. [PMID: 32138733 PMCID: PMC7059727 DOI: 10.1186/s12911-020-1063-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 02/23/2020] [Indexed: 11/23/2022] Open
Abstract
Background Clinical prediction tasks such as patient mortality, length of hospital stay, and disease diagnosis are highly important in critical care research. The existing studies for clinical prediction mainly used simple summary statistics to summarize information from physiological time series. However, this lack of statistics leads to a lack of information. In addition, using only maximum and minimum statistics to indicate patient features fails to provide an adequate explanation. Few studies have evaluated which summary statistics best represent physiological time series. Methods In this paper, we summarize 14 statistics describing the characteristics of physiological time series, including the central tendency, dispersion tendency, and distribution shape. Then, we evaluate the use of summary statistics of physiological time series as features for three clinical prediction tasks. To find the combinations of statistics that yield the best performances under different tasks, we use a cross-validation-based genetic algorithm to approximate the optimal statistical combination. Results By experiments using the EHRs of 6,927 patients, we obtained prediction results based on both single statistics and commonly used combinations of statistics under three clinical prediction tasks. Based on the results of an embedded cross-validation genetic algorithm, we obtained 25 optimal sets of statistical combinations and then tested their prediction results. By comparing the performances of prediction with single statistics and commonly used combinations of statistics with quantitative analyses of the optimal statistical combinations, we found that some statistics play central roles in patient representation and different prediction tasks have certain commonalities. Conclusion Through an in-depth analysis of the results, we found many practical reference points that can provide guidance for subsequent related research. Statistics that indicate dispersion tendency, such as min, max, and range, are more suitable for length of stay prediction tasks, and they also provide information for short-term mortality prediction. Mean and quantiles that reflect the central tendency of physiological time series are more suitable for mortality and disease prediction. Skewness and kurtosis perform poorly when used separately for prediction but can be used as supplementary statistics to improve the overall prediction effect.
Collapse
Affiliation(s)
- Chonghui Guo
- Institute of Systems Engineering, Dalian University of Technology, No. 2 Linggong Road, Ganjingzi District, Dalian, 116024, People's Republic of China.
| | - Menglin Lu
- Institute of Systems Engineering, Dalian University of Technology, No. 2 Linggong Road, Ganjingzi District, Dalian, 116024, People's Republic of China
| | - Jingfeng Chen
- Institute of Systems Engineering, Dalian University of Technology, No. 2 Linggong Road, Ganjingzi District, Dalian, 116024, People's Republic of China.,Health Management Center, The First Affiliated Hospital of Zhengzhou University, No. 1 Longhu central ring road, Zhengzhou, 450052, People's Republic of China
| |
Collapse
|
34
|
Ullah R, Khan S, Ali H, Chaudhary II, Bilal M, Ahmad I. A comparative study of machine learning classifiers for risk prediction of asthma disease. Photodiagnosis Photodyn Ther 2019; 28:292-296. [PMID: 31614223 DOI: 10.1016/j.pdpdt.2019.10.011] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 10/02/2019] [Accepted: 10/07/2019] [Indexed: 12/21/2022]
Abstract
Asthma is a chronic disease characterized by wheezing, chest tightening and difficulty in breathing due to inflammation of lung airways. Early risk prediction of asthma is crucial for proper and effective management. This study presents the use of machine learning approach for risk prediction of asthma by evaluating Raman spectral variations between asthmatic as well as healthy sera samples. Specifically, Raman spectra from 150 asthma and 52 healthy control blood sera samples were acquired. Spectral analyses illustrated significant spectral variations (p < 0.0001) in the asthmatic samples when compared with healthy sera. The existing spectral differences were further exploited by using artificial neural network (ANN) along with support vector machine (SVM) and random forest (RF) algorithms towards machine-assisted classification of the two groups. Quantitative comparison of the evaluation metrics of the classification algorithms showed superior performance of SVM model. Our results indicate that Raman spectroscopy in tandem with SVM can be used in the diagnosis and machine-assisted classification of asthma patients with promising accuracy.
Collapse
Affiliation(s)
- Rahat Ullah
- Agri. & biophotonics Division, National Institute of Lasers & Optronics, Islamabad, Pakistan.
| | - Saranjam Khan
- Department of Physics, Islamia College Peshawar, Pakistan
| | - Hina Ali
- Agri. & biophotonics Division, National Institute of Lasers & Optronics, Islamabad, Pakistan
| | - Iqra Ishtiaq Chaudhary
- Department of Bioinformatics and Biotechnology, International Islamic University, Islamabad, Pakistan
| | - Muhammad Bilal
- Agri. & biophotonics Division, National Institute of Lasers & Optronics, Islamabad, Pakistan
| | - Iftikhar Ahmad
- Institute of Radiotherapy and Nuclear Medicine (IRNUM), Peshawar, Pakistan.
| |
Collapse
|
35
|
Lee S, Choe EK, Park B. Exploration of Machine Learning for Hyperuricemia Prediction Models Based on Basic Health Checkup Tests. J Clin Med 2019; 8:E172. [PMID: 30717373 PMCID: PMC6406925 DOI: 10.3390/jcm8020172] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2019] [Revised: 01/31/2019] [Accepted: 01/31/2019] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Machine learning (ML) is a promising methodology for classification and prediction applications in healthcare. However, this method has not been practically established for clinical data. Hyperuricemia is a biomarker of various chronic diseases. We aimed to predict uric acid status from basic healthcare checkup test results using several ML algorithms and to evaluate the performance. METHODS We designed a prediction model for hyperuricemia using a comprehensive health checkup database designed by the classification of ML algorithms, such as discrimination analysis, K-nearest neighbor, naïve Bayes (NBC), support vector machine, decision tree, and random forest classification (RFC). The performance of each algorithm was evaluated and compared with the performance of a conventional logistic regression (CLR) algorithm by receiver operating characteristic curve analysis. RESULTS Of the 38,001 participants, 7705 were hyperuricemic. For the maximum sensitivity criterion, NBC showed the highest sensitivity (0.73), and RFC showed the second highest (0.66); for the maximum balanced classification rate (BCR) criterion, RFC showed the highest BCR (0.68), and NBC showed the second highest (0.66) among the various ML algorithms for predicting uric acid status. In a comparison to the performance of NBC (area under the curve (AUC) = 0.669, 95% confidence intervals (CI) = 0.669⁻0.675) and RFC (AUC = 0.775, 95% CI 0.770⁻0.780) with a CLR algorithm (AUC = 0.568, 95% CI = 0.563⁻0.571), NBC and RFC showed significantly better performance (p < 0.001). CONCLUSIONS The ML model was superior to the CLR model for the prediction of hyperuricemia. Future studies are needed to determine the best-performing ML algorithms based on data set characteristics. We believe that this study will be informative for studies using ML tools in clinical research.
Collapse
Affiliation(s)
- Sangwoo Lee
- Network Division, Samsung Electronics, Suwon 16677, Korea.
| | - Eun Kyung Choe
- Department of Surgery, Seoul National University Hospital Healthcare System Gangnam Center, Seoul 06236, Korea.
- Department of Surgery, Seoul National University College of Medicine, Seoul 03080, Korea.
| | - Boram Park
- Department of Biomedical Science, Seoul National University Graduate School, Seoul 03081, Korea.
| |
Collapse
|
36
|
Sakr S, Elshawi R, Ahmed A, Qureshi WT, Brawner C, Keteyian S, Blaha MJ, Al-Mallah MH. Using machine learning on cardiorespiratory fitness data for predicting hypertension: The Henry Ford ExercIse Testing (FIT) Project. PLoS One 2018; 13:e0195344. [PMID: 29668729 PMCID: PMC5905952 DOI: 10.1371/journal.pone.0195344] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 03/20/2018] [Indexed: 12/17/2022] Open
Abstract
This study evaluates and compares the performance of different machine learning techniques on predicting the individuals at risk of developing hypertension, and who are likely to benefit most from interventions, using the cardiorespiratory fitness data. The dataset of this study contains information of 23,095 patients who underwent clinician- referred exercise treadmill stress testing at Henry Ford Health Systems between 1991 and 2009 and had a complete 10-year follow-up. The variables of the dataset include information on vital signs, diagnosis and clinical laboratory measurements. Six machine learning techniques were investigated: LogitBoost (LB), Bayesian Network classifier (BN), Locally Weighted Naive Bayes (LWB), Artificial Neural Network (ANN), Support Vector Machine (SVM) and Random Tree Forest (RTF). Using different validation methods, the RTF model has shown the best performance (AUC = 0.93) and outperformed all other machine learning techniques examined in this study. The results have also shown that it is critical to carefully explore and evaluate the performance of the machine learning models using various model evaluation methods as the prediction accuracy can significantly differ.
Collapse
Affiliation(s)
- Sherif Sakr
- King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudia Arabia
- University of Taru, Taru, Estonia
| | - Radwa Elshawi
- Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
- University of Taru, Taru, Estonia
| | - Amjad Ahmed
- King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudia Arabia
| | - Waqas T. Qureshi
- Wake Forest School of Medicine, Medical Center Boulevard, Winston-Salem, NC, United States of America
| | - Clinton Brawner
- Heart and Vascular Institute, Henry Ford Hospital System, Detroit, MI, United States of America
| | - Steven Keteyian
- Heart and Vascular Institute, Henry Ford Hospital System, Detroit, MI, United States of America
| | - Michael J. Blaha
- Johns Hopkins Medicine, Baltimore, Maryland, United States of America
| | - Mouaz H. Al-Mallah
- King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudia Arabia
- Heart and Vascular Institute, Henry Ford Hospital System, Detroit, MI, United States of America
| |
Collapse
|