Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Steyerberg EW, van der Ploeg T, Van Calster B. Risk prediction with machine learning and regression methods. Biom J 2014;56:601-6. [PMID: 24615859 DOI: 10.1002/bimj.201300297] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Revised: 01/10/2014] [Accepted: 01/10/2014] [Indexed: 11/08/2022]

For:	Steyerberg EW, van der Ploeg T, Van Calster B. Risk prediction with machine learning and regression methods. Biom J 2014;56:601-6. [PMID: 24615859 DOI: 10.1002/bimj.201300297] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Revised: 01/10/2014] [Accepted: 01/10/2014] [Indexed: 11/08/2022]

Number

Cited by Other Article(s)

Liu X, Liu X, Jin C, Luo Y, Yang L, Ning X, Zhuo C, Xiao F. Prediction models for diagnosis and prognosis of the colonisation or infection of multidrug-resistant organisms in adults: A systematic review, critical appraisal, and meta-analysis. Clin Microbiol Infect 2024:S1198-743X(24)00316-1. [PMID: 38992430 DOI: 10.1016/j.cmi.2024.07.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 05/02/2024] [Accepted: 07/04/2024] [Indexed: 07/13/2024]

Abstract

BACKGROUND

Prediction models help target patients at risk of multidrug-resistant organism (MDRO) colonisation or infection and could serve as tools informing clinical practices to prevent MDRO transmission and inappropriate empiric antibiotic therapy. However, limited evidence identifies which among the available models are of low risk of bias and suitable for clinical application.

OBJECTIVES

To identify, describe, appraise, and summarise the performance of all prognostic and diagnostic models developed or validated for predicting MDRO colonisation or infection.

DATA SOURCES

Six electronic literature databases and clinical registration databases were searched until April 2022.

STUDY ELIGIBILITY CRITERIA

Development and validation studies of any multivariable prognostic and diagnostic models to predict MDRO colonisation or infection in adults.

ASSESSMENT OF RISK OF BIAS

The Prediction Model Risk of Bias Assessment Tool was used to assess risk of bias. Evidence certainty was assessed using the GRADE approach.

METHODS OF DATA SYNTHESIS

Meta-analyses were conducted to summarise the discrimination and calibration of the models' external validations conducted in at least two non-overlapping datasets.

RESULTS

We included 162 models (108 studies) developed for diagnosing (n=135) and predicting (n=27) MDRO colonisation or infection. Models exhibited a high risk of bias, especially in statistical analysis. High-frequency predictors were age, recent invasive procedures, antibiotic usage, and prior hospitalisation. Less than 25% of the models underwent external validations, with only seven by independent teams. Meta-analyses for one diagnostic and two prognostic models only produced very-low to low certainty of evidence.

CONCLUSIONS

The review comprehensively described the models for identifying patients at risk of MDRO colonisation or infection. We cannot recommend which models are ready for application due to high risk of bias, limited validations, and low certainty of evidence from meta-analyses, indicating a clear need to improve the conducting and reporting of model development and external validation studies to facilitate clinical application.

Collapse

Lyu C, Joehanes R, Huan T, Levy D, Li Y, Wang M, Liu X, Liu C, Ma J. Enhancing selection of alcohol consumption-associated genes by random forest. Br J Nutr 2024;131:2058-2067. [PMID: 38606596 PMCID: PMC11216877 DOI: 10.1017/s0007114524000795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]

Kigo J, Kamau S, Mawji A, Mwaniki P, Dunsmuir D, Pillay Y, Zhang C, Pallot K, Ogero M, Kimutai D, Ouma M, Mohamed I, Chege M, Thuranira L, Kissoon N, Ansermino JM, Akech S. External validation of a paediatric Smart triage model for use in resource limited facilities. PLOS DIGITAL HEALTH 2024;3:e0000293. [PMID: 38905166 PMCID: PMC11192416 DOI: 10.1371/journal.pdig.0000293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Accepted: 04/24/2024] [Indexed: 06/23/2024]

Abstract

Models for digital triage of sick children at emergency departments of hospitals in resource poor settings have been developed. However, prior to their adoption, external validation should be performed to ensure their generalizability. We externally validated a previously published nine-predictor paediatric triage model (Smart Triage) developed in Uganda using data from two hospitals in Kenya. Both discrimination and calibration were assessed, and recalibration was performed by optimizing the intercept for classifying patients into emergency, priority, or non-urgent categories based on low-risk and high-risk thresholds. A total of 2539 patients were eligible at Hospital 1 and 2464 at Hospital 2, and 5003 for both hospitals combined; admission rates were 8.9%, 4.5%, and 6.8%, respectively. The model showed good discrimination, with area under the receiver-operator curve (AUC) of 0.826, 0.784 and 0.821, respectively. The pre-calibrated model at a low-risk threshold of 8% achieved a sensitivity of 93% (95% confidence interval, (CI):89%-96%), 81% (CI:74%-88%), and 89% (CI:85%-92%), respectively, and at a high-risk threshold of 40%, the model achieved a specificity of 86% (CI:84%-87%), 96% (CI:95%-97%), and 91% (CI:90%-92%), respectively. Recalibration improved the graphical fit, but new risk thresholds were required to optimize sensitivity and specificity.The Smart Triage model showed good discrimination on external validation but required recalibration to improve the graphical fit of the calibration plot. There was no change in the order of prioritization of patients following recalibration in the respective triage categories. Recalibration required new site-specific risk thresholds that may not be needed if prioritization based on rank is all that is required. The Smart Triage model shows promise for wider application for use in triage for sick children in different settings.

Collapse

Affiliation(s)

Joyce Kigo Health Service Unit, Kenya Medical Research Institute (KEMRI)-Wellcome Trust Research Programme, Nairobi, Kenya
Stephen Kamau Health Service Unit, Kenya Medical Research Institute (KEMRI)-Wellcome Trust Research Programme, Nairobi, Kenya
Alishah Mawji Centre for International Child Health, BC Children’s Hospital Research Institute, Vancouver, British Columbia, Canada
Paul Mwaniki Health Service Unit, Kenya Medical Research Institute (KEMRI)-Wellcome Trust Research Programme, Nairobi, Kenya
Dustin Dunsmuir Centre for International Child Health, BC Children’s Hospital Research Institute, Vancouver, British Columbia, Canada Department of Anesthesiology, Pharmacology & Therapeutics, University of British Columbia, Vancouver, British Columbia, Canada
Yashodani Pillay Department of Anesthesiology, Pharmacology & Therapeutics, University of British Columbia, Vancouver, British Columbia, Canada
Cherri Zhang Department of Anesthesiology, Pharmacology & Therapeutics, University of British Columbia, Vancouver, British Columbia, Canada
Katija Pallot Department of Anesthesiology, Pharmacology & Therapeutics, University of British Columbia, Vancouver, British Columbia, Canada
Morris Ogero Health Service Unit, Kenya Medical Research Institute (KEMRI)-Wellcome Trust Research Programme, Nairobi, Kenya
David Kimutai Department of Pediatrics, Mbagathi County Hospital, Nairobi, Kenya
Mary Ouma Department of Pediatrics, Mbagathi County Hospital, Nairobi, Kenya
Ismael Mohamed Department of Pediatrics, Mbagathi County Hospital, Nairobi, Kenya
Mary Chege Department of Pediatrics, Kiambu County Referral Hospital, Kiambu, Kenya
Lydia Thuranira Department of Pediatrics, Kiambu County Referral Hospital, Kiambu, Kenya
Niranjan Kissoon Department of Pediatrics, University of British Columbia, Vancouver, British Columbia, Canada
J. Mark Ansermino Centre for International Child Health, BC Children’s Hospital Research Institute, Vancouver, British Columbia, Canada Department of Anesthesiology, Pharmacology & Therapeutics, University of British Columbia, Vancouver, British Columbia, Canada
Samuel Akech Health Service Unit, Kenya Medical Research Institute (KEMRI)-Wellcome Trust Research Programme, Nairobi, Kenya

Collapse

Khattak A, Chan PW, Chen F, Peng H. Interpretable ensemble imbalance learning strategies for the risk assessment of severe-low-level wind shear based on LiDAR and PIREPs. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2024;44:1084-1102. [PMID: 37700727 DOI: 10.1111/risa.14215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Revised: 06/05/2023] [Accepted: 08/22/2023] [Indexed: 09/14/2023]

Turchin A, Morrison FJ, Shubina M, Lipkovich I, Shinde S, Ahmad NN, Kan H. EXIST: EXamining rIsk of excesS adiposiTy-Machine learning to predict obesity-related complications. Obes Sci Pract 2024;10:e707. [PMID: 38264008 PMCID: PMC10804333 DOI: 10.1002/osp4.707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 08/02/2023] [Accepted: 08/21/2023] [Indexed: 01/25/2024] Open

Chiasakul T, Lam BD, McNichol M, Robertson W, Rosovsky RP, Lake L, Vlachos IS, Adamski A, Reyes N, Abe K, Zwicker JI, Patell R. Artificial intelligence in the prediction of venous thromboembolism: A systematic review and pooled analysis. Eur J Haematol 2023;111:951-962. [PMID: 37794526 PMCID: PMC10900245 DOI: 10.1111/ejh.14110] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 09/16/2023] [Accepted: 09/18/2023] [Indexed: 10/06/2023]

Abstract

BACKGROUND

Accurate diagnostic and prognostic predictions of venous thromboembolism (VTE) are crucial for VTE management. Artificial intelligence (AI) enables autonomous identification of the most predictive patterns from large complex data. Although evidence regarding its performance in VTE prediction is emerging, a comprehensive analysis of performance is lacking.

AIMS

To systematically review the performance of AI in the diagnosis and prediction of VTE and compare it to clinical risk assessment models (RAMs) or logistic regression models.

METHODS

A systematic literature search was performed using PubMed, MEDLINE, EMBASE, and Web of Science from inception to April 20, 2021. Search terms included "artificial intelligence" and "venous thromboembolism." Eligible criteria were original studies evaluating AI in the prediction of VTE in adults and reporting one of the following outcomes: sensitivity, specificity, positive predictive value, negative predictive value, or area under receiver operating curve (AUC). Risks of bias were assessed using the PROBAST tool. Unpaired t-test was performed to compare the mean AUC from AI versus conventional methods (RAMs or logistic regression models).

RESULTS

A total of 20 studies were included. Number of participants ranged from 31 to 111 888. The AI-based models included artificial neural network (six studies), support vector machines (four studies), Bayesian methods (one study), super learner ensemble (one study), genetic programming (one study), unspecified machine learning models (two studies), and multiple machine learning models (five studies). Twelve studies (60%) had both training and testing cohorts. Among 14 studies (70%) where AUCs were reported, the mean AUC for AI versus conventional methods were 0.79 (95% CI: 0.74-0.85) versus 0.61 (95% CI: 0.54-0.68), respectively (p < .001). However, the good to excellent discriminative performance of AI methods is unlikely to be replicated when used in clinical practice, because most studies had high risk of bias due to missing data handling and outcome determination.

CONCLUSION

The use of AI appears to improve the accuracy of diagnostic and prognostic prediction of VTE over conventional risk models; however, there was a high risk of bias observed across studies. Future studies should focus on transparent reporting, external validation, and clinical application of these models.

Collapse

Affiliation(s)

Thita Chiasakul Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA Division of Hemostasis and Thrombosis, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA Division of Hematology, Faculty of Medicine, Department of Medicine, Center of Excellence in Translational Hematology, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Bangkok, Thailand
Barbara D Lam Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA Division of Hemostasis and Thrombosis, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
Megan McNichol Division of Knowledge Services, Department of Information Services (M.M.), Beth Israel Deaconess Medical Center, Boston, Massachusetts, USA
William Robertson National Blood Clot Alliance, Philadelphia, Pennsylvania, USA Department of Emergency Healthcare, College of Health Professions, Weber State University, Ogden, Utah, USA
Rachel P Rosovsky Division of Hematology/Oncology, Department of Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, USA
Leslie Lake National Blood Clot Alliance, Philadelphia, Pennsylvania, USA
Ioannis S Vlachos Department of Pathology, Cancer Research Institute, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
Alys Adamski Division of Blood Disorders, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
Nimia Reyes Division of Blood Disorders, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
Karon Abe Division of Blood Disorders, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
Jeffrey I Zwicker Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA Division of Hemostasis and Thrombosis, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA Department of Medicine, Hematology Service, Memorial Sloan Kettering Cancer Center, New York City, New York, USA
Rushad Patell Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA Division of Hemostasis and Thrombosis, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA

Collapse

Liosis KC, Marouf AA, Rokne JG, Ghosh S, Bismar TA, Alhajj R. Genomic Biomarker Discovery in Disease Progression and Therapy Response in Bladder Cancer Utilizing Machine Learning. Cancers (Basel) 2023;15:4801. [PMID: 37835496 PMCID: PMC10571566 DOI: 10.3390/cancers15194801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 08/20/2023] [Accepted: 09/09/2023] [Indexed: 10/15/2023] Open

Wang X, Ren H, Ren J, Song W, Qiao Y, Ren Z, Zhao Y, Linghu L, Cui Y, Zhao Z, Chen L, Qiu L. Machine learning-enabled risk prediction of chronic obstructive pulmonary disease with unbalanced data. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023;230:107340. [PMID: 36640604 DOI: 10.1016/j.cmpb.2023.107340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 11/25/2022] [Accepted: 01/04/2023] [Indexed: 06/17/2023]

Abstract

BACKGROUND AND OBJECTIVE

Since the early symptoms of chronic obstructive pulmonary disease (COPD) are not obvious, patients are not easily identified, causing improper time for prevention and treatment. In present study, machine learning (ML) methods were employed to construct a risk prediction model for COPD to improve its prediction efficiency.

METHODS

We collected data from a sample of 5807 cases with a complete COPD diagnosis from the 2019 COPD Surveillance Program in Shanxi Province and extracted 34 potentially relevant variables from the dataset. Firstly, we used feature selection methods (i.e., Generalized elastic net, Lasso and Adaptive lasso) to select ten variables. Afterwards, we employed supervised classifiers for class imbalanced data by combining the cost-sensitive learning and SMOTE resampling methods with the ML methods (Logistic Regression, SVM, Random Forest, XGBoost, LightGBM, NGBoost and Stacking), respectively. Last, we assessed their performance.

RESULTS

The cough frequently at age 14 and before and other 9 variables are significant parameters for COPD. The Stacking heterogeneous ensemble model showed relatively good performance in the unbalanced datasets. The Logistic Regression with class weighting enjoyed the best classification performance in the balancing data when these composite indicators (AUC, F1-Score and G-mean) were used as criteria for model comparison. The values of F1-Score and G-mean for the top three ML models were 0.290/0.660 for Logistic Regression with class weighting, 0.288/0.649 for Stacking with synthetic minority oversampling technique (SMOTE), and 0.285/0.648 for LightGBM with SMOTE.

CONCLUSIONS

This paper combining feature selection methods, unbalanced data processing methods and machine learning methods with data from disease surveillance questionnaires and physical measurements to identify people at risk of COPD, concluded that machine learning models based on survey questionnaires could provide an automated identification for patients at risk of COPD, and provide a simple and scientific aid for early identification of COPD.

Collapse

Lu Y, Chen Q, Zhang H, Huang M, Yao Y, Ming Y, Yan M, Yu Y, Yu L. Machine Learning Models of Postoperative Atrial Fibrillation Prediction After Cardiac Surgery. J Cardiothorac Vasc Anesth 2023;37:360-366. [PMID: 36535840 DOI: 10.1053/j.jvca.2022.11.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 11/06/2022] [Accepted: 11/20/2022] [Indexed: 11/27/2022]

Zheng S, Li Y, Luo C, Chen F, Ling G, Zheng B. Machine Learning for Predicting the Development of Postoperative Acute Kidney Injury After Coronary Artery Bypass Grafting Without Extracorporeal Circulation. CARDIOVASCULAR INNOVATIONS AND APPLICATIONS 2023. [DOI: 10.15212/cvia.2023.0006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023] Open

Abstract Background: Cardiac surgery-associated acute kidney injury (CSA-AKI) is a major complication that increases morbidity and mortality after cardiac surgery. Most established predictive models are limited to the analysis of nonlinear relationships and do not adequately consider intraoperative variables and early postoperative variables. Nonextracorporeal circulation coronary artery bypass grafting (off-pump CABG) remains the procedure of choice for most coronary surgeries, and refined CSA-AKI predictive models for off-pump CABG are notably lacking. Therefore, this study used an artificial intelligence-based machine learning approach to predict CSA-AKI from comprehensive perioperative data. Methods: In total, 293 variables were analysed in the clinical data of patients undergoing off-pump CABG in the Department of Cardiac Surgery at the First Affiliated Hospital of Guangxi Medical University between 2012 and 2021. According to the KDIGO criteria, postoperative AKI was defined by an elevation of at least 50% within 7 days, or 0.3 mg/dL within 48 hours, with respect to the reference serum creatinine level. Five machine learning algorithms—a simple decision tree, random forest, support vector machine, extreme gradient boosting and gradient boosting decision tree (GBDT)—were used to construct the CSA-AKI predictive model. The performance of these models was evaluated with the area under the receiver operating characteristic curve (AUC). Shapley additive explanation (SHAP) values were used to explain the predictive model. Results: The three most influential features in the importance matrix plot were 1-day postoperative serum potassium concentration, 1-day postoperative serum magnesium ion concentration, and 1-day postoperative serum creatine phosphokinase concentration. Conclusion: GBDT exhibited the largest AUC (0.87) and can be used to predict the risk of AKI development after surgery, thus enabling clinicians to optimise treatment strategies and minimise postoperative complications. Collapse

Scott-Fordsmand JJ, Amorim MJB. Using Machine Learning to make nanomaterials sustainable. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023;859:160303. [PMID: 36410486 DOI: 10.1016/j.scitotenv.2022.160303] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 11/06/2022] [Accepted: 11/15/2022] [Indexed: 06/16/2023]

Pommerich UM, Stubbs PW, Eggertsen PP, Fabricius J, Nielsen JF. Regression-based prognostic models for functional independence after postacute brain injury rehabilitation are not transportable: a systematic review. J Clin Epidemiol 2023;156:53-65. [PMID: 36764467 DOI: 10.1016/j.jclinepi.2023.02.009] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 01/30/2023] [Accepted: 02/02/2023] [Indexed: 02/11/2023]

Chowdhury MZI, Leung AA, Walker RL, Sikdar KC, O’Beirne M, Quan H, Turin TC. A comparison of machine learning algorithms and traditional regression-based statistical modeling for predicting hypertension incidence in a Canadian population. Sci Rep 2023;13:13. [PMID: 36593280 PMCID: PMC9807553 DOI: 10.1038/s41598-022-27264-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Accepted: 12/29/2022] [Indexed: 01/03/2023] Open

Affiliation(s)

Mohammad Ziaul Islam Chowdhury grid.22072.350000 0004 1936 7697Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, AB T2N 4Z6 Canada ,2grid.22072.350000 0004 1936 7697Department of Family Medicine, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1 Canada ,3grid.22072.350000 0004 1936 7697Present Address: Department of Psychiatry, University of Calgary, 3280 Hospital Drive NW, Calgary, AB T2N 4Z6 Canada
Alexander A. Leung grid.22072.350000 0004 1936 7697Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, AB T2N 4Z6 Canada ,4grid.22072.350000 0004 1936 7697Department of Medicine, University of Calgary, 3280 Hospital Drive NW, Calgary, AB T2N 4Z6 Canada
Robin L. Walker grid.22072.350000 0004 1936 7697Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, AB T2N 4Z6 Canada ,5grid.413574.00000 0001 0693 8815Primary Health Care Integration Network, Primary Health Care, Alberta Health Services, Calgary, AB Canada
Khokan C. Sikdar grid.413574.00000 0001 0693 8815Health Status Assessment, Surveillance and Reporting, Public Health Surveillance and Infrastructure, Provincial Population and Public Health, Alberta Health Services, 10101 Southport Rd. SW, Calgary, AB T2W 3N2 Canada
Maeve O’Beirne grid.22072.350000 0004 1936 7697Department of Family Medicine, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1 Canada
Hude Quan grid.22072.350000 0004 1936 7697Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, AB T2N 4Z6 Canada
Tanvir C. Turin grid.22072.350000 0004 1936 7697Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, AB T2N 4Z6 Canada ,2grid.22072.350000 0004 1936 7697Department of Family Medicine, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1 Canada

Collapse

Ranapurwala SI, Miller VE, Carey TS, Gaynes BN, Keil AP, Fitch CV, Swilley-Martinez ME, Kavee AL, Cooper T, Dorris S, Goldston DB, Peiper LJ, Pence BW. Innovations in suicide prevention research (INSPIRE): a protocol for a population-based case-control study. Inj Prev 2022;28:injuryprev-2022-044609. [PMID: 35701110 PMCID: PMC10213808 DOI: 10.1136/injuryprev-2022-044609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Accepted: 05/28/2022] [Indexed: 11/05/2022]

Affiliation(s)

Shabbar I Ranapurwala Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA Injury Prevention Research Center, University of North Carolina, Chapel Hill, North Carolina, USA
Vanessa E Miller Injury Prevention Research Center, University of North Carolina, Chapel Hill, North Carolina, USA
Timothy S Carey Cecil G Sheps Center for Health Services Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA Department of Medicine, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, North Carolina, USA
Bradley N Gaynes Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Alexander P Keil Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Catherine Vinita Fitch Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA Injury Prevention Research Center, University of North Carolina, Chapel Hill, North Carolina, USA
Monica E Swilley-Martinez Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA Injury Prevention Research Center, University of North Carolina, Chapel Hill, North Carolina, USA
Andrew L Kavee Cecil G Sheps Center for Health Services Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Toska Cooper Injury Prevention Research Center, University of North Carolina, Chapel Hill, North Carolina, USA
Samantha Dorris Injury Prevention Research Center, University of North Carolina, Chapel Hill, North Carolina, USA
David B Goldston Department of Psychiatry and Behavioral Sciences, Duke University School of Medicine, Durham, North Carolina, USA
Lewis J Peiper Division of Adult Correction - Prisons, North Carolina Department of Public Safety, Raleigh, North Carolina, USA
Brian W Pence Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA Injury Prevention Research Center, University of North Carolina, Chapel Hill, North Carolina, USA

Collapse

Greenberg JK, Otun A, Ghogawala Z, Yen PY, Molina CA, Limbrick DD, Foraker RE, Kelly MP, Ray WZ. Translating Data Analytics Into Improved Spine Surgery Outcomes: A Roadmap for Biomedical Informatics Research in 2021. Global Spine J 2022;12:952-963. [PMID: 33973491 PMCID: PMC9344511 DOI: 10.1177/21925682211008424] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

Pedersen CF, Andersen MØ, Carreon LY, Eiskjær S. Applied Machine Learning for Spine Surgeons: Predicting Outcome for Patients Undergoing Treatment for Lumbar Disc Herniation Using PRO Data. Global Spine J 2022;12:866-876. [PMID: 33203255 PMCID: PMC9344505 DOI: 10.1177/2192568220967643] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Basmadjian RB, Kong S, Boyne DJ, Jarada TN, Xu Y, Cheung WY, Lupichuk S, Quan ML, Brenner DR. Developing a Prediction Model for Pathologic Complete Response Following Neoadjuvant Chemotherapy in Breast Cancer: A Comparison of Model Building Approaches. JCO Clin Cancer Inform 2022;6:e2100055. [PMID: 35148170 PMCID: PMC8846388 DOI: 10.1200/cci.21.00055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

Affiliation(s)

Robert B Basmadjian Department of Community Health Sciences, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada
Shiying Kong Department of Community Health Sciences, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada.,Department of Oncology, University of Calgary, Tom Baker Cancer Centre, Calgary, Alberta, Canada.,Department of Surgery, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada
Devon J Boyne Department of Community Health Sciences, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada.,Department of Oncology, University of Calgary, Tom Baker Cancer Centre, Calgary, Alberta, Canada
Tamer N Jarada Department of Oncology, University of Calgary, Tom Baker Cancer Centre, Calgary, Alberta, Canada
Yuan Xu Department of Community Health Sciences, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada.,Department of Oncology, University of Calgary, Tom Baker Cancer Centre, Calgary, Alberta, Canada.,Department of Surgery, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada
Winson Y Cheung Department of Community Health Sciences, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada.,Department of Oncology, University of Calgary, Tom Baker Cancer Centre, Calgary, Alberta, Canada
Sasha Lupichuk Department of Community Health Sciences, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada.,Department of Oncology, University of Calgary, Tom Baker Cancer Centre, Calgary, Alberta, Canada
May Lynn Quan Department of Community Health Sciences, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada.,Department of Oncology, University of Calgary, Tom Baker Cancer Centre, Calgary, Alberta, Canada.,Department of Surgery, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada
Darren R Brenner Department of Community Health Sciences, Foothills Medical Centre, University of Calgary, Calgary, Alberta, Canada.,Department of Oncology, University of Calgary, Tom Baker Cancer Centre, Calgary, Alberta, Canada

Collapse

Ellis DE, Hubbard RA, Willis AW, Zuppa AF, Zaoutis TE, Hennessy S. Comparing LASSO and random forest models for predicting neurological dysfunction among fluoroquinolone users. Pharmacoepidemiol Drug Saf 2021;31:393-403. [PMID: 34881470 DOI: 10.1002/pds.5391] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Revised: 11/01/2021] [Accepted: 11/02/2021] [Indexed: 11/10/2022]

Abstract

BACKGROUND

Fluoroquinolones are associated with central (CNS) and peripheral (PNS) nervous system symptoms, and predicting the risk of these outcomes may have important clinical implications. Both LASSO and random forest are appealing modeling methods, yet it is not clear which method performs better for clinical risk prediction.

PURPOSE

To compare models developed using LASSO versus random forest for predicting neurological dysfunction among fluoroquinolone users.

METHODS

We developed and validated risk prediction models using claims data from a commercially insured population. The study cohort included adults dispensed an oral fluoroquinolone, and outcomes were CNS and PNS dysfunction. Model predictors included demographic variables, comorbidities and medications known to be associated with neurological symptoms, and several healthcare utilization predictors. We assessed the accuracy and calibration of these models using measures including AUC, calibration curves, and Brier scores.

RESULTS

The underlying cohort contained 16 533 (1.18%) individuals with CNS dysfunction and 46 995 (3.34%) individuals with PNS dysfunction during 120 days of follow-up. For CNS dysfunction, LASSO had an AUC of 0.81 (95% CI: 0.80, 0.82), while random forest had an AUC of 0.80 (95% CI: 0.80, 0.81). For PNS dysfunction, LASSO had an AUC of 0.75 (95% CI: 0.74, 0.76) versus an AUC of 0.73 (95% CI: 0.73, 0.74) for random forest. Both LASSO models had better calibration, with Brier scores 0.17 (LASSO) versus 0.20 (random forest) for CNS dysfunction and 0.20 (LASSO) versus 0.25 (random forest) for PNS dysfunction.

CONCLUSIONS

LASSO outperformed random forest in predicting CNS and PNS dysfunction among fluoroquinolone users, and should be considered for modeling when the cohort is modest in size, when the number of model predictors is modest, and when predictors are primarily binary.

Collapse

A Machine Learning Classifier Improves Mortality Prediction Compared With Pediatric Logistic Organ Dysfunction-2 Score: Model Development and Validation. Crit Care Explor 2021;3:e0426. [PMID: 34036277 PMCID: PMC8133049 DOI: 10.1097/cce.0000000000000426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Abstract

Supplemental Digital Content is available in the text.

Objectives:

To determine whether machine learning algorithms can better predict PICU mortality than the Pediatric Logistic Organ Dysfunction-2 score.

Design:

Retrospective study.

Setting:

Quaternary care medical-surgical PICU.

Patients:

All patients admitted to the PICU from 2013 to 2019.

Interventions:

None.

Measurements and Main Results:

We investigated the performance of various machine learning algorithms using the same variables used to calculate the Pediatric Logistic Organ Dysfunction-2 score to predict PICU mortality. We used 10,194 patient records from 2013 to 2017 for training and 4,043 patient records from 2018 to 2019 as a holdout validation cohort. Mortality rate was 3.0% in the training cohort and 3.4% in the validation cohort. The best performing algorithm was a random forest model (area under the receiver operating characteristic curve, 0.867 [95% CI, 0.863–0.895]; area under the precision-recall curve, 0.327 [95% CI, 0.246–0.414]; F1, 0.396 [95% CI, 0.321–0.468]) and significantly outperformed the Pediatric Logistic Organ Dysfunction-2 score (area under the receiver operating characteristic curve, 0.761 [95% CI, 0.713–0.810]; area under the precision-recall curve (0.239 [95% CI, 0.165–0.316]; F1, 0.284 [95% CI, 0.209–0.360]), although this difference was reduced after retraining the Pediatric Logistic Organ Dysfunction-2 logistic regression model at the study institution. The random forest model also showed better calibration than the Pediatric Logistic Organ Dysfunction-2 score, and calibration of the random forest model remained superior to the retrained Pediatric Logistic Organ Dysfunction-2 model.

Conclusions:

A machine learning model achieved better performance than a logistic regression-based score for predicting ICU mortality. Better estimation of mortality risk can improve our ability to adjust for severity of illness in future studies, although external validation is required before this method can be widely deployed.

Collapse

Cho SM, Austin PC, Ross HJ, Abdel-Qadir H, Chicco D, Tomlinson G, Taheri C, Foroutan F, Lawler PR, Billia F, Gramolini A, Epelman S, Wang B, Lee DS. Machine Learning Compared With Conventional Statistical Models for Predicting Myocardial Infarction Readmission and Mortality: A Systematic Review. Can J Cardiol 2021;37:1207-1214. [PMID: 33677098 DOI: 10.1016/j.cjca.2021.02.020] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Revised: 02/23/2021] [Accepted: 02/27/2021] [Indexed: 12/22/2022] Open

Abstract

BACKGROUND

Machine learning (ML) methods are increasingly used in addition to conventional statistical modelling (CSM) for predicting readmission and mortality in patients with myocardial infarction (MI). However, the two approaches have not been systematically compared across studies of prognosis in patients with MI.

METHODS

Following PRISMA guidelines, we systematically reviewed the literature via Medline, EPub, Cochrane Central, Embase, Inspec, ACM Digital Library, and Web of Science. Eligible studies included primary research articles published from January 2000 to March 2020, comparing ML and CSM for prognostication after MI.

RESULTS

Of 7,348 articles, 112 underwent full-text review, with the final set composed of 24 articles representing 374,365 patients. ML methods included artificial neural networks (n = 12 studies), random forests (n = 11), decision trees (n = 8), support vector machines (n = 8), and Bayesian techniques (n = 7). CSM included logistic regression (n = 19 studies), existing CSM-derived risk scores (n = 12), and Cox regression (n = 2). Thirteen of 19 studies examining mortality reported higher C-indexes with the use of ML compared with CSM. One study examined readmissions at 2 different time points, with C-indexes that were higher for ML than CSM. Across all studies, a total of 29 comparisons were performed, but the majority (n = 26, 90%) found small (< 0.05) absolute differences in the C-index between ML and CSM. With the use of a modified CHARMS checklist, sources of bias were identifiable in the majority of studies, and only 2 were externally validated.

CONCLUSION

Although ML algorithms tended to have higher C-indexes than CSM for predicting death or readmission after MI, these studies exhibited threats to internal validity and were often unvalidated. Further comparisons are needed, with adherence to clinical quality standards for prognosis research. (Trial registration: PROSPERO CRD42019134896).

Collapse

Affiliation(s)

Sung Min Cho Ted Rogers Centre for Heart Research, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Peter C Austin Institute for Clinical Evaluative Sciences, Toronto, Ontario, Canada; Institute for Health Policy, Management and Evaluation, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Heather J Ross Ted Rogers Centre for Heart Research, Toronto, Ontario, Canada; Peter Munk Cardiac Centre, University Health Network, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Husam Abdel-Qadir Ted Rogers Centre for Heart Research, Toronto, Ontario, Canada; Peter Munk Cardiac Centre, University Health Network, Toronto, Ontario, Canada; Institute for Clinical Evaluative Sciences, Toronto, Ontario, Canada; Institute for Health Policy, Management and Evaluation, Toronto, Ontario, Canada; Women's College Hospital, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Davide Chicco University of Toronto, Toronto, Ontario, Canada
George Tomlinson Institute for Health Policy, Management and Evaluation, Toronto, Ontario, Canada; Biostatistics Research Unit, University Health Network, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Cameron Taheri Ted Rogers Centre for Heart Research, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Farid Foroutan Ted Rogers Centre for Heart Research, Toronto, Ontario, Canada
Patrick R Lawler Ted Rogers Centre for Heart Research, Toronto, Ontario, Canada; Peter Munk Cardiac Centre, University Health Network, Toronto, Ontario, Canada; Toronto General Hospital Research Institute, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Filio Billia Peter Munk Cardiac Centre, University Health Network, Toronto, Ontario, Canada; Toronto General Hospital Research Institute, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Anthony Gramolini Ted Rogers Centre for Heart Research, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Slava Epelman Ted Rogers Centre for Heart Research, Toronto, Ontario, Canada; Peter Munk Cardiac Centre, University Health Network, Toronto, Ontario, Canada; Toronto General Hospital Research Institute, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Bo Wang Peter Munk Cardiac Centre, University Health Network, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada
Douglas S Lee Ted Rogers Centre for Heart Research, Toronto, Ontario, Canada; Peter Munk Cardiac Centre, University Health Network, Toronto, Ontario, Canada; Institute for Clinical Evaluative Sciences, Toronto, Ontario, Canada; Institute for Health Policy, Management and Evaluation, Toronto, Ontario, Canada; Toronto General Hospital Research Institute, Toronto, Ontario, Canada; University of Toronto, Toronto, Ontario, Canada.

Collapse

Lenert MC, Matheny ME, Walsh CG. Prognostic models will be victims of their own success, unless…. J Am Med Inform Assoc 2021;26:1645-1650. [PMID: 31504588 DOI: 10.1093/jamia/ocz145] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Revised: 07/08/2019] [Accepted: 07/22/2019] [Indexed: 01/16/2023] Open

Majnarić LT, Babič F, O’Sullivan S, Holzinger A. AI and Big Data in Healthcare: Towards a More Comprehensive Research Framework for Multimorbidity. J Clin Med 2021;10:jcm10040766. [PMID: 33672914 PMCID: PMC7918668 DOI: 10.3390/jcm10040766] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 02/02/2021] [Accepted: 02/11/2021] [Indexed: 12/11/2022] Open

Second opinion needed: communicating uncertainty in medical machine learning. NPJ Digit Med 2021;4:4. [PMID: 33402680 PMCID: PMC7785732 DOI: 10.1038/s41746-020-00367-3] [Citation(s) in RCA: 93] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Accepted: 11/12/2020] [Indexed: 01/01/2023] Open

Josephson CB, Wiebe S. Precision Medicine: Academic dreaming or clinical reality? Epilepsia 2020;62 Suppl 2:S78-S89. [PMID: 33205406 DOI: 10.1111/epi.16739] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 10/02/2020] [Accepted: 10/02/2020] [Indexed: 12/26/2022]

Abstract

Precision medicine can be distilled into a concept of accounting for an individual's unique collection of clinical, physiologic, genetic, and sociodemographic characteristics to provide patient-level predictions of disease course and response to therapy. Abundant evidence now allows us to determine how an average person with epilepsy will respond to specific medical and surgical treatments. This is useful, but not readily applicable to an individual patient. This has brought into sharp focus the desire for a more individualized approach through which we counsel people based on individual characteristics, as opposed to population-level data. We are now accruing data at unprecedented rates, allowing us to convert this ideal into reality. In addition, we have access to growing volumes of administrative and electronic health records data, biometric, imaging, genetics data, microbiome, and other "omics" data, thus paving the way toward phenome-wide association studies and "the epidemiology of one." Despite this, there are many challenges ahead. The collating, integrating, and storing sensitive multimodal data for advanced analytics remains difficult as patient consent and data security issues increase in complexity. Agreement on many aspects of epilepsy remains imperfect, rendering models sensitive to misclassification due to a lack of "ground truth." Even with existing data, advanced analytics models are prone to overfitting and often failure to generalize externally. Finally, uptake by clinicians is often hindered by opaque, "black box" algorithms. Systematic approaches to data collection and model generation, and an emphasis on education to promote uptake and knowledge translation, are required to propel epilepsy-based precision medicine from the realm of the theoretical into routine clinical practice.

Collapse

Lee SK, Ahn J, Shin JH, Lee JY. Application of Machine Learning Methods in Nursing Home Research. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:E6234. [PMID: 32867250 PMCID: PMC7503291 DOI: 10.3390/ijerph17176234] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 08/23/2020] [Accepted: 08/24/2020] [Indexed: 12/13/2022]

Solomon SC, Saxena RC, Neradilek MB, Hau V, Fong CT, Lang JD, Posner KL, Nair BG. Forecasting a Crisis. Anesth Analg 2020;130:1201-1210. [DOI: 10.1213/ane.0000000000004636] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Foreman B. Neurocritical Care: Bench to Bedside (Eds. Claude Hemphill, Michael James) Integrating and Using Big Data in Neurocritical Care. Neurotherapeutics 2020;17:593-605. [PMID: 32152955 PMCID: PMC7283405 DOI: 10.1007/s13311-020-00846-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Riley RD, Ensor J, Snell KIE, Harrell FE, Martin GP, Reitsma JB, Moons KGM, Collins G, van Smeden M. Calculating the sample size required for developing a clinical prediction model. BMJ 2020;368:m441. [PMID: 32188600 DOI: 10.1136/bmj.m441] [Citation(s) in RCA: 728] [Impact Index Per Article: 182.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Luque A, De Las Heras A, Ávila-Gutiérrez MJ, Zamora-Polo F. ADAPTS: An Intelligent Sustainable Conceptual Framework for Engineering Projects. SENSORS 2020;20:s20061553. [PMID: 32168788 PMCID: PMC7147156 DOI: 10.3390/s20061553] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Revised: 03/06/2020] [Accepted: 03/07/2020] [Indexed: 12/30/2022]

Logistic regression was as good as machine learning for predicting major chronic diseases. J Clin Epidemiol 2020;122:56-69. [PMID: 32169597 DOI: 10.1016/j.jclinepi.2020.03.002] [Citation(s) in RCA: 143] [Impact Index Per Article: 35.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 02/21/2020] [Accepted: 03/04/2020] [Indexed: 12/15/2022]

FibroScan-AST (FAST) score for the non-invasive identification of patients with non-alcoholic steatohepatitis with significant activity and fibrosis: a prospective derivation and global validation study. Lancet Gastroenterol Hepatol 2020;5:362-373. [PMID: 32027858 PMCID: PMC7066580 DOI: 10.1016/s2468-1253(19)30383-8] [Citation(s) in RCA: 402] [Impact Index Per Article: 100.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Revised: 10/21/2019] [Accepted: 10/22/2019] [Indexed: 02/06/2023]

Abstract

BACKGROUND

The burden of non-alcoholic fatty liver disease (NAFLD) is increasing globally, and a major priority is to identify patients with non-alcoholic steatohepatitis (NASH) who are at greater risk of progression to cirrhosis, and who will be candidates for clinical trials and emerging new pharmacotherapies. We aimed to develop a score to identify patients with NASH, elevated NAFLD activity score (NAS≥4), and advanced fibrosis (stage 2 or higher [F≥2]).

METHODS

This prospective study included a derivation cohort before validation in multiple international cohorts. The derivation cohort was a cross-sectional, multicentre study of patients aged 18 years or older, scheduled to have a liver biopsy for suspicion of NAFLD at seven tertiary care liver centres in England. This was a prespecified secondary outcome of a study for which the primary endpoints have already been reported. Liver stiffness measurement (LSM) by vibration-controlled transient elastography and controlled attenuation parameter (CAP) measured by FibroScan device were combined with aspartate aminotransferase (AST), alanine aminotransferase (ALT), or AST:ALT ratio. To identify those patients with NASH, an elevated NAS, and significant fibrosis, the best fitting multivariable logistic regression model was identified and internally validated using boot-strapping. Score calibration and discrimination performance were determined in both the derivation dataset in England, and seven independent international (France, USA, China, Malaysia, Turkey) histologically confirmed cohorts of patients with NAFLD (external validation cohorts). This study is registered with ClinicalTrials.gov, number NCT01985009.

FINDINGS

Between March 20, 2014, and Jan 17, 2017, 350 patients with suspected NAFLD attending liver clinics in England were prospectively enrolled in the derivation cohort. The most predictive model combined LSM, CAP, and AST, and was designated FAST (FibroScan-AST). Performance was satisfactory in the derivation dataset (C-statistic 0·80, 95% CI 0·76-0·85) and was well calibrated. In external validation cohorts, calibration of the score was satisfactory and discrimination was good across the full range of validation cohorts (C-statistic range 0·74-0·95, 0·85; 95% CI 0·83-0·87 in the pooled external validation patients' cohort; n=1026). Cutoff was 0·35 for sensitivity of 0·90 or greater and 0·67 for specificity of 0·90 or greater in the derivation cohort, leading to a positive predictive value (PPV) of 0·83 (84/101) and a negative predictive value (NPV) of 0·85 (93/110). In the external validation cohorts, PPV ranged from 0·33 to 0·81 and NPV from 0·73 to 1·0.

INTERPRETATION

The FAST score provides an efficient way to non-invasively identify patients at risk of progressive NASH for clinical trials or treatments when they become available, and thereby reduce unnecessary liver biopsy in patients unlikely to have significant disease.

FUNDING

Echosens and UK National Institute for Health Research.

Collapse

Davis SE, Greevy RA, Fonnesbeck C, Lasko TA, Walsh CG, Matheny ME. A nonparametric updating method to correct clinical prediction model drift. J Am Med Inform Assoc 2019;26:1448-1457. [PMID: 31397478 PMCID: PMC6857513 DOI: 10.1093/jamia/ocz127] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Revised: 05/01/2019] [Accepted: 06/27/2019] [Indexed: 11/13/2022] Open

Burlacu A, Iftene A, Busoiu E, Cogean D, Covic A. Challenging the supremacy of evidence-based medicine through artificial intelligence: the time has come for a change of paradigms. Nephrol Dial Transplant 2019;35:191-194. [PMID: 31697377 DOI: 10.1093/ndt/gfz203] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Accepted: 09/02/2019] [Indexed: 12/14/2022] Open

Comparison of Machine Learning Techniques for Prediction of Hospitalization in Heart Failure Patients. J Clin Med 2019;8:jcm8091298. [PMID: 31450546 PMCID: PMC6780582 DOI: 10.3390/jcm8091298] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 08/20/2019] [Accepted: 08/22/2019] [Indexed: 12/23/2022] Open

Mijderwijk HJ, Steyerberg EW, Steiger HJ, Fischer I, Kamp MA. Fundamentals of Clinical Prediction Modeling for the Neurosurgeon. Neurosurgery 2019;85:302-311. [DOI: 10.1093/neuros/nyz282] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2018] [Accepted: 05/26/2019] [Indexed: 01/18/2023] Open

van Bussel EF, Richard E, Busschers WB, Steyerberg EW, van Gool WA, Moll van Charante EP, Hoevenaar-Blom MP. A cardiovascular risk prediction model for older people: Development and validation in a primary care population. J Clin Hypertens (Greenwich) 2019;21:1145-1152. [PMID: 31294917 PMCID: PMC6772108 DOI: 10.1111/jch.13617] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Revised: 04/18/2019] [Accepted: 05/03/2019] [Indexed: 12/01/2022]

Abstract

Cardiovascular risk prediction is mainly based on traditional risk factors that have been validated in middle‐aged populations. However, associations between these risk factors and cardiovascular disease (CVD) attenuate with increasing age. Therefore, for older people the authors developed and internally validated risk prediction models for fatal and non‐fatal CVD, (re)evaluated the predictive value of traditional and new factors, and assessed the impact of competing risks of non‐cardiovascular death. Post hoc analyses of 1811 persons aged 70‐78 year and free from CVD at baseline from the preDIVA study (Prevention of Dementia by Intensive Vascular care, 2006‐2015), a primary care‐based trial that included persons free from dementia and conditions likely to hinder successful long‐term follow‐up, were performed. In 2017‐2018, Cox‐regression analyses were performed for a model including seven traditional risk factors only, and a model to assess incremental predictive ability of the traditional and eleven new factors. Analyses were repeated accounting for competing risk of death, using Fine‐Gray models. During an average of 6.2 years of follow‐up, 277 CVD events occurred. Age, sex, smoking, and type 2 diabetes mellitus were traditional predictors for CVD, whereas total cholesterol, HDL‐cholesterol, and systolic blood pressure (SBP) were not. Of the eleven new factors, polypharmacy and apathy symptoms were predictors. Discrimination was moderate (concordance statistic 0.65). Accounting for competing risks resulted in slightly smaller predicted absolute risks. In conclusion, we found, SBP, HDL, and total cholesterol no longer predict CVD in older adults, whereas polypharmacy and apathy symptoms are two new relevant predictors. Building on the selected risk factors in this study may improve CVD prediction in older adults and facilitate targeting preventive interventions to those at high risk.

Collapse

Nigatu YT, Wang J. External validation of the International Risk Prediction Algorithm for the onset of generalized anxiety and/or panic syndromes (The Predict A) in the US general population. J Anxiety Disord 2019;64:40-44. [PMID: 30974236 DOI: 10.1016/j.janxdis.2019.03.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 02/01/2019] [Accepted: 03/19/2019] [Indexed: 12/23/2022]

Exploration, Inference, and Prediction in Neuroscience and Biomedicine. Trends Neurosci 2019;42:251-262. [PMID: 30808574 DOI: 10.1016/j.tins.2019.02.001] [Citation(s) in RCA: 110] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 01/28/2019] [Accepted: 02/01/2019] [Indexed: 12/21/2022]

Vijayakumar R, Cheung MWL. Replicability of Machine Learning Models in the Social Sciences. ZEITSCHRIFT FUR PSYCHOLOGIE-JOURNAL OF PSYCHOLOGY 2018. [DOI: 10.1027/2151-2604/a000344] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Abstract Abstract. Machine learning tools are increasingly used in social sciences and policy fields due to their increase in predictive accuracy. However, little research has been done on how well the models of machine learning methods replicate across samples. We compare machine learning methods with regression on the replicability of variable selection, along with predictive accuracy, using an empirical dataset as well as simulated data with additive, interaction, and non-linear squared terms added as predictors. Methods analyzed include support vector machines (SVM), random forests (RF), multivariate adaptive regression splines (MARS), and the regularized regression variants, least absolute shrinkage and selection operator (LASSO), and elastic net. In simulations with additive and linear interactions, machine learning methods performed similarly to regression in replicating predictors; they also performed mostly equal or below regression on measures of predictive accuracy. In simulations with square terms, machine learning methods SVM, RF, and MARS improved predictive accuracy and replicated predictors better than regression. Thus, in simulated datasets, the gap between machine learning methods and regression on predictive measures foreshadowed the gap in variable selection. In replications on the empirical dataset, however, improved prediction by machine learning methods was not accompanied by a visible improvement in replicability in variable selection. This disparity is explained by the overall explanatory power of the models. When predictors have small effects and noise predominates, improved global measures of prediction in a sample by machine learning methods may not lead to the robust selection of predictors; thus, in the presence of weak predictors and noise, regression remains a useful tool for model building and replication. Collapse

Supervised Machine-learning Predictive Analytics for Prediction of Postinduction Hypotension. Anesthesiology 2018;129:675-688. [DOI: 10.1097/aln.0000000000002374] [Citation(s) in RCA: 90] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Abstract Abstract Editor’s Perspective What We Already Know about This Topic What This Article Tells Us That Is New Background Hypotension is a risk factor for adverse perioperative outcomes. Machine-learning methods allow large amounts of data for development of robust predictive analytics. The authors hypothesized that machine-learning methods can provide prediction for the risk of postinduction hypotension. Methods Data was extracted from the electronic health record of a single quaternary care center from November 2015 to May 2016 for patients over age 12 that underwent general anesthesia, without procedure exclusions. Multiple supervised machine-learning classification techniques were attempted, with postinduction hypotension (mean arterial pressure less than 55 mmHg within 10 min of induction by any measurement) as primary outcome, and preoperative medications, medical comorbidities, induction medications, and intraoperative vital signs as features. Discrimination was assessed using cross-validated area under the receiver operating characteristic curve. The best performing model was tuned and final performance assessed using split-set validation. Results Out of 13,323 cases, 1,185 (8.9%) experienced postinduction hypotension. Area under the receiver operating characteristic curve using logistic regression was 0.71 (95% CI, 0.70 to 0.72), support vector machines was 0.63 (95% CI, 0.58 to 0.60), naive Bayes was 0.69 (95% CI, 0.67 to 0.69), k-nearest neighbor was 0.64 (95% CI, 0.63 to 0.65), linear discriminant analysis was 0.72 (95% CI, 0.71 to 0.73), random forest was 0.74 (95% CI, 0.73 to 0.75), neural nets 0.71 (95% CI, 0.69 to 0.71), and gradient boosting machine 0.76 (95% CI, 0.75 to 0.77). Test set area for the gradient boosting machine was 0.74 (95% CI, 0.72 to 0.77). Conclusions The success of this technique in predicting postinduction hypotension demonstrates feasibility of machine-learning models for predictive analytics in the field of anesthesiology, with performance dependent on model selection and appropriate tuning. Collapse

Boulesteix A, Janitza S, Hornung R, Probst P, Busen H, Hapfelmeier A. Making complex prediction rules applicable for readers: Current practice in random forest literature and recommendations. Biom J 2018;61:1314-1328. [DOI: 10.1002/bimj.201700243] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Revised: 06/26/2018] [Accepted: 06/29/2018] [Indexed: 11/08/2022]

A review of statistical and machine learning methods for modeling cancer risk using structured clinical data. Artif Intell Med 2018;90:1-14. [DOI: 10.1016/j.artmed.2018.06.002] [Citation(s) in RCA: 68] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2017] [Revised: 09/08/2017] [Accepted: 06/13/2018] [Indexed: 02/06/2023]

Davis SE, Lasko TA, Chen G, Siew ED, Matheny ME. Calibration drift in regression and machine learning models for acute kidney injury. J Am Med Inform Assoc 2018;24:1052-1061. [PMID: 28379439 DOI: 10.1093/jamia/ocx030] [Citation(s) in RCA: 148] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 03/13/2017] [Indexed: 12/26/2022] Open

Goldstein BA, Navar AM, Carter RE. Moving beyond regression techniques in cardiovascular risk prediction: applying machine learning to address analytic challenges. Eur Heart J 2017;38:1805-1814. [PMID: 27436868 PMCID: PMC5837244 DOI: 10.1093/eurheartj/ehw302] [Citation(s) in RCA: 160] [Impact Index Per Article: 22.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Revised: 05/14/2016] [Accepted: 06/16/2016] [Indexed: 11/14/2022] Open

Studerus E, Ramyead A, Riecher-Rössler A. Prediction of transition to psychosis in patients with a clinical high risk for psychosis: a systematic review of methodology and reporting. Psychol Med 2017;47:1163-1178. [PMID: 28091343 DOI: 10.1017/s0033291716003494] [Citation(s) in RCA: 65] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Wynants L, Collins GS, Van Calster B. Key steps and common pitfalls in developing and validating risk models. BJOG 2016;124:423-432. [DOI: 10.1111/1471-0528.14170] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/24/2016] [Indexed: 01/09/2023]

Van Hoorde K, Van Huffel S, Timmerman D, Bourne T, Van Calster B. A spline-based tool to assess and visualize the calibration of multiclass risk predictions. J Biomed Inform 2015;54:283-93. [DOI: 10.1016/j.jbi.2014.12.016] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2014] [Revised: 12/18/2014] [Accepted: 12/30/2014] [Indexed: 10/24/2022]

Schumacher M. Probability estimation and machine learning--Editorial. Biom J 2015;56:531-3. [PMID: 24986806 DOI: 10.1002/bimj.201400075] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Revised: 04/25/2014] [Accepted: 04/28/2014] [Indexed: 11/09/2022]

Ziegler A. Rejoinder. Biom J 2014;56:607-13. [DOI: 10.1002/bimj.201400010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2014] [Revised: 03/01/2014] [Accepted: 03/02/2014] [Indexed: 12/26/2022]