151
|
Gehringer CK, Martin GP, Hyrich KL, Verstappen SMM, Sexton J, Kristianslund EK, Provan SA, Kvien TK, Sergeant JC. Developing and externally validating multinomial prediction models for methotrexate treatment outcomes in patients with rheumatoid arthritis: results from an international collaboration. J Clin Epidemiol 2024; 166:111239. [PMID: 38072179 DOI: 10.1016/j.jclinepi.2023.111239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 11/23/2023] [Accepted: 12/05/2023] [Indexed: 01/01/2024]
Abstract
OBJECTIVES In rheumatology, there is a clinical need to identify patients at high risk (>50%) of not responding to the first-line therapy methotrexate (MTX) due to lack of disease control or discontinuation due to adverse events (AEs). Despite this need, previous prediction models in this context are at high risk of bias and ignore AEs. Our objectives were to (i) develop a multinomial model for outcomes of low disease activity and discontinuing due to AEs 6 months after starting MTX, (ii) update prognosis 3-month following treatment initiation, and (iii) externally validate these models. STUDY DESIGN AND SETTING A multinomial model for low disease activity (submodel 1) and discontinuing due to AEs (submodel 2) was developed using data from the UK Rheumatoid Arthritis Medication Study, updated using landmarking analysis, internally validated using bootstrapping, and externally validated in the Norwegian Disease-Modifying Antirheumatic Drug register. Performance was assessed using calibration (calibration-slope and calibration-in-the-large), and discrimination (concordance-statistic and polytomous discriminatory index). RESULTS The internally validated model showed good calibration in the development setting with a calibration-slope of 1.01 (0.87, 1.14) (submodel 1) and 0.83 (0.30, 1.34) (submodel 2), and moderate discrimination with a c-statistic of 0.72 (0.69, 0.74) and 0.53 (0.48, 0.59), respectively. Predictive performance decreased after external validation (calibration-slope 0.78 (0.64, 0.93) (submodel 1) and 0.86 (0.34, 1.38) (submodel 2)), which may be due to differences in disease-specific characteristics and outcome prevalence. CONCLUSION We addressed previously identified methodological limitations of prediction models for outcomes of MTX therapy. The multinomial approach predicted outcomes of disease activity more accurately than AEs, which should be addressed in future work to aid implementation into clinical practice.
Collapse
Affiliation(s)
- Celina K Gehringer
- Division of Musculoskeletal and Dermatological Sciences, Centre for Epidemiology Versus Arthritis, Centre for Musculoskeletal Research, University of Manchester, Manchester, UK; Centre for Biostatistics, Manchester Academic Health Science Centre, University of Manchester, Manchester, UK.
| | - Glen P Martin
- Division of Informatics, Imaging and Data Sciences, Centre for Health Informatics, University of Manchester, Manchester, UK
| | - Kimme L Hyrich
- Division of Musculoskeletal and Dermatological Sciences, Centre for Epidemiology Versus Arthritis, Centre for Musculoskeletal Research, University of Manchester, Manchester, UK; NIHR Manchester Biomedical Research Centre, Manchester University NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
| | - Suzanne M M Verstappen
- Division of Musculoskeletal and Dermatological Sciences, Centre for Epidemiology Versus Arthritis, Centre for Musculoskeletal Research, University of Manchester, Manchester, UK; NIHR Manchester Biomedical Research Centre, Manchester University NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
| | - Joseph Sexton
- Center for Treatment of Rheumatic and Musculoskeletal Diseases (REMEDY), Diakonhjemmet Hospital, Oslo, Norway
| | - Eirik K Kristianslund
- Center for Treatment of Rheumatic and Musculoskeletal Diseases (REMEDY), Diakonhjemmet Hospital, Oslo, Norway
| | - Sella A Provan
- Center for Treatment of Rheumatic and Musculoskeletal Diseases (REMEDY), Diakonhjemmet Hospital, Oslo, Norway
| | - Tore K Kvien
- Center for Treatment of Rheumatic and Musculoskeletal Diseases (REMEDY), Diakonhjemmet Hospital, Oslo, Norway; Faculty of Medicine, University of Oslo, Oslo, Norway
| | - Jamie C Sergeant
- Division of Musculoskeletal and Dermatological Sciences, Centre for Epidemiology Versus Arthritis, Centre for Musculoskeletal Research, University of Manchester, Manchester, UK; Centre for Biostatistics, Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
| |
Collapse
|
152
|
Garrido MM, Legler A, Strombotne KL, Frakt AB. Differences in adverse outcomes across race and ethnicity among Veterans with similar predicted risks of an overdose or suicide-related event. PAIN MEDICINE (MALDEN, MASS.) 2024; 25:125-130. [PMID: 37738604 DOI: 10.1093/pm/pnad129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 08/16/2023] [Accepted: 09/18/2023] [Indexed: 09/24/2023]
Abstract
OBJECTIVE To evaluate the degree to which differences in incidence of mortality and serious adverse events exist across patient race and ethnicity among Veterans Health Administration (VHA) patients receiving outpatient opioid prescriptions and who have similar predicted risks of adverse outcomes. Patients were assigned scores via the VHA Stratification Tool for Opioid Risk Mitigation (STORM), a model used to predict the risk of experiencing overdose- or suicide-related health care events or death. Individuals with the highest STORM risk scores are targeted for case review. DESIGN Retrospective cohort study of high-risk veterans who received an outpatient prescription opioid between 4/2018-3/2019. SETTING All VHA medical centers. PARTICIPANTS In total, 84 473 patients whose estimated risk scores were between 0.0420 and 0.0609, the risk scores associated with the top 5%-10% of risk in the STORM development sample. METHODS We examined the expected probability of mortality and serious adverse events (SAEs; overdose or suicide-related events) given a patient's risk score and race. RESULTS Given a similar risk score, Black patients were less likely than White patients to have a recorded SAE within 6 months of risk score calculation. Black, Hispanic, and Asian patients were less likely than White patients with similar risk scores to die within 6 months of risk score calculation. Some of the mortality differences were driven by age differences in the composition of racial and ethnic groups in our sample. CONCLUSIONS Our results suggest that relying on the STORM model to identify patients who may benefit from an interdisciplinary case review may identify patients with clinically meaningful differences in outcome risk across race and ethnicity.
Collapse
Affiliation(s)
- Melissa M Garrido
- Partnered Evidence-based Policy Resource Center, VA Boston Healthcare System, Boston, MA 02130, United States
- Department of Health Law, Policy and Management, Boston University School of Public Health, Boston, MA 02118, United States
| | - Aaron Legler
- Partnered Evidence-based Policy Resource Center, VA Boston Healthcare System, Boston, MA 02130, United States
| | - Kiersten L Strombotne
- Partnered Evidence-based Policy Resource Center, VA Boston Healthcare System, Boston, MA 02130, United States
- Department of Health Law, Policy and Management, Boston University School of Public Health, Boston, MA 02118, United States
| | - Austin B Frakt
- Partnered Evidence-based Policy Resource Center, VA Boston Healthcare System, Boston, MA 02130, United States
- Department of Health Law, Policy and Management, Boston University School of Public Health, Boston, MA 02118, United States
- Department of Health Policy and Management, Harvard T.H. Chan School of Public Health, Cambridge, MA 02115, United States
| |
Collapse
|
153
|
Zakai NA, Wilkinson K, Sparks AD, Packer RT, Koh I, Roetker NS, Repp AB, Thomas R, Holmes CE, Cushman M, Plante TB, Al-Samkari H, Pishko AM, Wood WA, Masias C, Gangaraju R, Li A, Garcia D, Wiggins KL, Schaefer JK, Hooper C, Smith NL, McClure LA. Development and validation of a risk model for hospital-acquired venous thrombosis: the Medical Inpatients Thrombosis and Hemostasis study. J Thromb Haemost 2024; 22:503-515. [PMID: 37918635 PMCID: PMC10872863 DOI: 10.1016/j.jtha.2023.10.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 10/04/2023] [Accepted: 10/20/2023] [Indexed: 11/04/2023]
Abstract
BACKGROUND Regulatory organizations recommend assessing hospital-acquired (HA) venous thromboembolism (VTE) risk for medical inpatients. OBJECTIVES To develop and validate a risk assessment model (RAM) for HA-VTE in medical inpatients using objective and assessable risk factors knowable at admission. METHODS The development cohort included people admitted to medical services at the University of Vermont Medical Center (Burlington, Vermont) between 2010 and 2019, and the validation cohorts included people admitted to Hennepin County Medical Center (Minneapolis, Minnesota), University of Michigan Medical Center (Ann Arbor, Michigan), and Harris Health Systems (Houston, Texas). Individuals with VTE at admission, aged <18 years, and admitted for <1 midnight were excluded. We used a Bayesian penalized regression technique to select candidate HA-VTE risk factors for final inclusion in the RAM. RESULTS The development cohort included 60 633 admissions and 227 HA-VTE, and the validation cohorts included 111 269 admissions and 651 HA-VTE. Seven HA-VTE risk factors with t statistics ≥1.5 were included in the RAM: history of VTE, low hemoglobin level, elevated creatinine level, active cancer, hyponatremia, increased red cell distribution width, and malnutrition. The areas under the receiver operating characteristic curve and calibration slope were 0.72 and 1.10, respectively. The areas under the receiver operating characteristic curve and calibration slope were 0.70 and 0.93 at Hennepin County Medical Center, 0.70 and 0.87 at the University of Michigan Medical Center, and 0.71 and 1.00 at Harris Health Systems, respectively. The RAM performed well stratified by age, sex, and race. CONCLUSION We developed and validated a RAM for HA-VTE in medical inpatients. By quantifying risk, clinicians can determine the potential benefits of measures to reduce HA-VTE.
Collapse
Affiliation(s)
- Neil A Zakai
- Department of Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA; Department of Pathology & Laboratory Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA; Department of Medicine, University of Vermont Medical Center, Burlington, Vermont, USA.
| | - Katherine Wilkinson
- Department of Pathology & Laboratory Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA
| | - Andrew D Sparks
- Department of Medical Biostatistics, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA
| | - Ryan T Packer
- Department of Pathology & Laboratory Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA
| | - Insu Koh
- Department of Pathology & Laboratory Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA; SyllogisTeks, Chesterfield, Missouri, USA
| | - Nicholas S Roetker
- Chronic Disease Research Group, Hennepin Healthcare Research Institute, Minneapolis, Minnesota, USA
| | - Allen B Repp
- Department of Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA; Department of Medicine, University of Vermont Medical Center, Burlington, Vermont, USA
| | - Ryan Thomas
- Department of Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA; Department of Medicine, University of Vermont Medical Center, Burlington, Vermont, USA
| | - Chris E Holmes
- Department of Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA; Department of Medicine, University of Vermont Medical Center, Burlington, Vermont, USA
| | - Mary Cushman
- Department of Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA; Department of Pathology & Laboratory Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA; Department of Medicine, University of Vermont Medical Center, Burlington, Vermont, USA
| | - Timothy B Plante
- Department of Medicine, Larner College of Medicine at the University of Vermont, Burlington, Vermont, USA; Department of Medicine, University of Vermont Medical Center, Burlington, Vermont, USA
| | - Hanny Al-Samkari
- Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, USA
| | - Allyson M Pishko
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - William A Wood
- Department of Medicine, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
| | - Camila Masias
- Miami Cancer Institute, Baptist Health South Florida, Coral Gables, Florida, USA
| | - Radhika Gangaraju
- Institute for Cancer Outcomes and Survivorship, University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Ang Li
- Section of Hematology-Oncology, Baylor College of Medicine, Houston, Texas, USA
| | - David Garcia
- Division of Hematology, University of Washington School of Medicine, Seattle, Washington, USA
| | - Kerri L Wiggins
- Department of Medicine, University of Washington, Seattle, Washington, USA
| | - Jordan K Schaefer
- Division of Hematology/Oncology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA
| | - Craig Hooper
- Division of Blood Disorders, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
| | - Nicholas L Smith
- Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, Washington, USA; Department of Epidemiology, University of Washington, Seattle, Washington, USA; Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Office of Research and Development, Seattle, Washington, USA
| | - Leslie A McClure
- Department of Epidemiology and Biostatistics, Drexel University Dornsife School of Public Health, Philadelphia, Pennsylvania, USA
| |
Collapse
|
154
|
Loos NL, Hoogendam L, Souer JS, van Uchelen JH, Slijper HP, Wouters RM, Selles RW. Algorithm Versus Expert: Machine Learning Versus Surgeon-Predicted Symptom Improvement After Carpal Tunnel Release. Neurosurgery 2024; 95:00006123-990000000-01037. [PMID: 38299861 PMCID: PMC11155572 DOI: 10.1227/neu.0000000000002848] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 12/12/2023] [Indexed: 02/02/2024] Open
Abstract
BACKGROUND AND OBJECTIVES Surgeons rely on clinical experience when making predictions about treatment effects. Incorporating algorithm-based predictions of symptom improvement after carpal tunnel release (CTR) could support medical decision-making. However, these algorithm-based predictions need to outperform predictions made by surgeons to add value. We compared predictions of a validated prediction model for symptom improvement after CTR with predictions made by surgeons. METHODS This cohort study included 97 patients scheduled for CTR. Preoperatively, surgeons estimated each patient's probability of improvement 6 months after surgery, defined as reaching the minimally clinically important difference on the Boston Carpal Tunnel Syndrome Symptom Severity Score. We assessed model and surgeon performance using calibration (calibration belts), discrimination (area under the curve [AUC]), sensitivity, and specificity. In addition, we assessed the net benefit of decision-making based on the prediction model's estimates vs the surgeon's judgement. RESULTS The surgeon predictions had poor calibration and suboptimal discrimination (AUC 0.62, 95%-CI 0.49-0.74), while the prediction model showed good calibration and appropriate discrimination (AUC 0.77, 95%-CI 0.66-0.89, P = .05). The accuracy of surgeon predictions was 0.65 (95%-CI 0.37-0.78) vs 0.78 (95%-CI 0.67-0.89) for the prediction model ( P = .03). The sensitivity of surgeon predictions and the prediction model was 0.72 (95%-CI 0.15-0.96) and 0.85 (95%-CI 0.62-0.97), respectively ( P = .04). The specificity of the surgeon predictions was similar to the model's specificity ( P = .25). The net benefit analysis showed better decision-making based on the prediction model compared with the surgeons' decision-making (ie, more correctly predicted improvements and/or fewer incorrectly predicted improvements). CONCLUSION The prediction model outperformed surgeon predictions of improvement after CTR in terms of calibration, accuracy, and sensitivity. Furthermore, the net benefit analysis indicated that using the prediction model instead of relying solely on surgeon decision-making increases the number of patients who will improve after CTR, without increasing the number of unnecessary surgeries.
Collapse
Affiliation(s)
- Nina Louisa Loos
- Department of Rehabilitation Medicine, Erasmus MC, Rotterdam, The Netherlands
- Department of Plastic and Reconstructive Surgery and Hand Surgery, Erasmus MC, Rotterdam, The Netherlands
| | - Lisa Hoogendam
- Department of Rehabilitation Medicine, Erasmus MC, Rotterdam, The Netherlands
- Department of Plastic and Reconstructive Surgery and Hand Surgery, Erasmus MC, Rotterdam, The Netherlands
- Hand and Wrist Center, Xpert Clinics, Eindhoven, The Netherlands
| | | | | | | | - Robbert Maarten Wouters
- Department of Rehabilitation Medicine, Erasmus MC, Rotterdam, The Netherlands
- Department of Plastic and Reconstructive Surgery and Hand Surgery, Erasmus MC, Rotterdam, The Netherlands
| | - Ruud Willem Selles
- Department of Rehabilitation Medicine, Erasmus MC, Rotterdam, The Netherlands
- Department of Plastic and Reconstructive Surgery and Hand Surgery, Erasmus MC, Rotterdam, The Netherlands
| |
Collapse
|
155
|
Karabacak M, Jagtiani P, Margetis K. The Predictive Abilities of Machine Learning Algorithms in Patients with Thoracolumbar Spinal Cord Injuries. World Neurosurg 2024; 182:e67-e90. [PMID: 38030070 DOI: 10.1016/j.wneu.2023.11.043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 11/08/2023] [Accepted: 11/09/2023] [Indexed: 12/01/2023]
Abstract
OBJECTIVES The goal of this study is to implement machine learning (ML) algorithms to predict mortality, non-home discharge, prolonged length of stay (LOS), prolonged length of intensive care unit stay (ICU-LOS), and major complications in patients diagnosed with thoracolumbar spinal cord injury, while creating a publicly accessible online tool. METHODS The American College of Surgeons Trauma Quality Program database was used to identify patients with thoracolumbar spinal cord injury. Feature selection was performed with the Least Absolute Shrinkage and Selection Operator algorithm. Five ML algorithms, including TabPFN, TabNet, XGBoost, LightGBM, and Random Forest, were used along with the Optuna optimization library for hyperparameter tuning. RESULTS A total of 147,819 patients were included in the analysis. For each outcome, we determined the best model for deployment in our web application based on the area under the receiver operating characteristic (AUROC) values. The top performing algorithms were as follows: LightGBM for mortality with an AUROC of 0.885, TabPFN for non-home discharge with an AUROC of 0.801, LightGBM for prolonged LOS with an AUROC of 0.673, Random Forest for prolonged ICU-LOS with an AUROC of 0.664, and LightGBM for major complications with an AUROC of 0.73. CONCLUSIONS ML models demonstrate good predictive ability for in-hospital mortality and non-home discharge, fair predictive ability for major complications and prolonged ICU-LOS, but poor predictive ability for prolonged LOS. We have developed a web application that allows these models to be accessed.
Collapse
Affiliation(s)
- Mert Karabacak
- Department of Neurosurgery, Mount Sinai Health System, New York, New York, USA
| | - Pemla Jagtiani
- School of Medicine, SUNY Downstate Health Sciences University, New York, New York, USA
| | | |
Collapse
|
156
|
van Trier TJ, Snaterse M, Boekholdt SM, Scholte op Reimer WJM, Hageman SHJ, Visseren FLJ, Dorresteijn JAN, Peters RJG, Jørstad HT. Validation of Systematic Coronary Risk Evaluation 2 (SCORE2) and SCORE2-Older Persons in the EPIC-Norfolk prospective population cohort. Eur J Prev Cardiol 2024; 31:182-189. [PMID: 37793098 PMCID: PMC10809184 DOI: 10.1093/eurjpc/zwad318] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Revised: 08/30/2023] [Accepted: 09/25/2023] [Indexed: 10/06/2023]
Abstract
AIMS The European Systematic Coronary Risk Evaluation 2 (SCORE2) and SCORE2-Older Persons (OP) models are recommended to identify individuals at high 10-year risk for cardiovascular disease (CVD). Independent validation and assessment of clinical utility is needed. This study aims to assess discrimination, calibration, and clinical utility of low-risk SCORE2 and SCORE2-OP. METHODS AND RESULTS Validation in individuals aged 40-69 years (SCORE2) and 70-79 years (SCORE2-OP) without baseline CVD or diabetes from the European Prospective Investigation of Cancer (EPIC) Norfolk prospective population study. We compared 10-year CVD risk estimates with observed outcomes (cardiovascular mortality, non-fatal myocardial infarction, and stroke). For SCORE2, 19 560 individuals (57% women) had 10-year CVD risk estimates of 3.7% [95% confidence interval (CI) 3.6-3.7] vs. observed 3.8% (95% CI 3.6-4.1) [observed (O)/expected (E) ratio 1.0 (95% CI 1.0-1.1)]. The area under the curve (AUC) was 0.75 (95% CI 0.74-0.77), with underestimation of risk in men [O/E 1.4 (95% CI 1.3-1.6)] and overestimation in women [O/E 0.7 (95% CI 0.6-0.8)]. Decision curve analysis (DCA) showed clinical benefit. Systematic Coronary Risk Evaluation 2-Older Persons in 3113 individuals (58% women) predicted 10-year CVD events in 10.2% (95% CI 10.1-10.3) vs. observed 15.3% (95% CI 14.0-16.5) [O/E ratio 1.6 (95% CI 1.5-1.7)]. The AUC was 0.63 (95% CI 0.60-0.65) with underestimation of risk across sex and risk ranges. Decision curve analysis showed limited clinical benefit. CONCLUSION In a UK population cohort, the SCORE2 low-risk model showed fair discrimination and calibration, with clinical benefit for preventive treatment initiation decisions. In contrast, in individuals aged 70-79 years, SCORE2-OP demonstrated poor discrimination, underestimated risk in both sexes, and limited clinical utility.
Collapse
Affiliation(s)
- Tinka J van Trier
- Department of Cardiology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam Cardiovascular Sciences, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - Marjolein Snaterse
- Department of Cardiology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam Cardiovascular Sciences, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - S Matthijs Boekholdt
- Department of Cardiology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam Cardiovascular Sciences, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - Wilma J M Scholte op Reimer
- Department of Cardiology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam Cardiovascular Sciences, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
- HU University of Applied Sciences Utrecht, Research Group Chronic Diseases, Padualaan 99, 3584 CH Utrecht, The Netherlands
| | - Steven H J Hageman
- Department of Vascular Medicine, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX Utrecht, The Netherlands
| | - Frank L J Visseren
- Department of Vascular Medicine, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX Utrecht, The Netherlands
| | - Jannick A N Dorresteijn
- Department of Vascular Medicine, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX Utrecht, The Netherlands
| | - Ron J G Peters
- Department of Cardiology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam Cardiovascular Sciences, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - Harald T Jørstad
- Department of Cardiology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam Cardiovascular Sciences, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| |
Collapse
|
157
|
Riley RD, Snell KIE, Archer L, Ensor J, Debray TPA, van Calster B, van Smeden M, Collins GS. Evaluation of clinical prediction models (part 3): calculating the sample size required for an external validation study. BMJ 2024; 384:e074821. [PMID: 38253388 DOI: 10.1136/bmj-2023-074821] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Affiliation(s)
- Richard D Riley
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Kym I E Snell
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Lucinda Archer
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Joie Ensor
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Thomas P A Debray
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Ben van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, Netherlands
| | - Maarten van Smeden
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Gary S Collins
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford, UK
| |
Collapse
|
158
|
Skarping I, Ellbrant J, Dihge L, Ohlsson M, Huss L, Bendahl PO, Rydén L. Retrospective validation study of an artificial neural network-based preoperative decision-support tool for noninvasive lymph node staging (NILS) in women with primary breast cancer (ISRCTN14341750). BMC Cancer 2024; 24:86. [PMID: 38229058 DOI: 10.1186/s12885-024-11854-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 01/07/2024] [Indexed: 01/18/2024] Open
Abstract
BACKGROUND Surgical sentinel lymph node biopsy (SLNB) is routinely used to reliably stage axillary lymph nodes in early breast cancer (BC). However, SLNB may be associated with postoperative arm morbidities. For most patients with BC undergoing SLNB, the findings are benign, and the procedure is currently questioned. A decision-support tool for the prediction of benign sentinel lymph nodes based on preoperatively available data has been developed using artificial neural network modelling. METHODS This was a retrospective geographical and temporal validation study of the noninvasive lymph node staging (NILS) model, based on preoperatively available data from 586 women consecutively diagnosed with primary BC at two sites. Ten preoperative clinicopathological characteristics from each patient were entered into the web-based calculator, and the probability of benign lymph nodes was predicted. The performance of the NILS model was assessed in terms of discrimination with the area under the receiver operating characteristic curve (AUC) and calibration, that is, comparison of the observed and predicted event rates of benign axillary nodal status (N0) using calibration slope and intercept. The primary endpoint was axillary nodal status (discrimination, benign [N0] vs. metastatic axillary nodal status [N+]) determined by the NILS model compared to nodal status by definitive pathology. RESULTS The mean age of the women in the cohort was 65 years, and most of them (93%) had luminal cancers. Approximately three-fourths of the patients had no metastases in SLNB (N0 74% and 73%, respectively). The AUC for the predicted probabilities for the whole cohort was 0.6741 (95% confidence interval: 0.6255-0.7227). More than one in four patients (n = 151, 26%) were identified as candidates for SLNB omission when applying the predefined cut-off for lymph node-negative status from the development cohort. The NILS model showed the best calibration in patients with a predicted high probability of healthy axilla. CONCLUSION The performance of the NILS model was satisfactory. In approximately every fourth patient, SLNB could potentially be omitted. Considering the shift from postoperatively to preoperatively available predictors in this validation study, we have demonstrated the robustness of the NILS model. The clinical usability of the web interface will be evaluated before its clinical implementation. TRIAL REGISTRATION Registered in the ISRCTN registry with study ID ISRCTN14341750. Date of registration 23/11/2018.
Collapse
Affiliation(s)
- Ida Skarping
- Department of Clinical Sciences Lund, Division of Surgery, Lund University, Lund, Sweden.
- Department of Clinical Physiology and Nuclear Medicine, Skåne University Hospital, Lund, Sweden.
| | - Julia Ellbrant
- Department of Clinical Sciences Lund, Division of Surgery, Lund University, Lund, Sweden
- Department of Surgery, Skåne University Hospital, Malmö, Sweden
| | - Looket Dihge
- Department of Clinical Sciences Lund, Division of Surgery, Lund University, Lund, Sweden
- Department of Plastic and Reconstructive Surgery, Skåne University Hospital, Malmö, Sweden
| | - Mattias Ohlsson
- Department of Astronomy and Theoretical Physics, Division of Computational Biology and Biological Physics, Lund University, Lund, Sweden
| | - Linnea Huss
- Division of Surgery, Department of Clinical Sciences Helsingborg, Lund University, Lund, Sweden
- Department of Surgery, Helsingborg General Hospital, Helsingborg, Sweden
| | - Pär-Ola Bendahl
- Division of Oncology, Department of Clinical Sciences, Lund University, Lund, Sweden
| | - Lisa Rydén
- Department of Clinical Sciences Lund, Division of Surgery, Lund University, Lund, Sweden
- Department of Surgery and Gastroenterology, Skåne University Hospital, Malmö, Sweden
| |
Collapse
|
159
|
Riley RD, Archer L, Snell KIE, Ensor J, Dhiman P, Martin GP, Bonnett LJ, Collins GS. Evaluation of clinical prediction models (part 2): how to undertake an external validation study. BMJ 2024; 384:e074820. [PMID: 38224968 PMCID: PMC10788734 DOI: 10.1136/bmj-2023-074820] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/13/2023] [Indexed: 01/17/2024]
Affiliation(s)
- Richard D Riley
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Lucinda Archer
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Kym I E Snell
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Joie Ensor
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Paula Dhiman
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
| | - Glen P Martin
- Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK
| | - Laura J Bonnett
- Department of Biostatistics, University of Liverpool, Liverpool, UK
| | - Gary S Collins
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
| |
Collapse
|
160
|
Cheng X, Chen Y, Xu J, Cai D, Liu Z, Zeng H, Yao J, Song B. Development and validation of a predictive model based on clinical and MpMRI findings to reduce additional systematic prostate biopsy. Insights Imaging 2024; 15:3. [PMID: 38185753 PMCID: PMC10772021 DOI: 10.1186/s13244-023-01544-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 10/21/2023] [Indexed: 01/09/2024] Open
Abstract
OBJECTIVES To develop and validate a predictive model based on clinical features and multiparametric magnetic resonance imaging (mpMRI) to reduce unnecessary systematic biopsies (SBs) in biopsy-naïve patients with suspected prostate cancer (PCa). METHODS A total of 274 patients who underwent combined cognitive MRI-targeted biopsy (MRTB) with SB were retrospectively enrolled and temporally split into development (n = 201) and validation (n = 73) cohorts. Multivariable logistic regression analyses were used to determine independent predictors of clinically significant PCa (csPCa) on cognitive MRTB, and the clinical, MRI, and combined models were established respectively. Area under the receiver operating characteristic curve (AUC), calibration plots, and decision curve analyses were assessed. RESULTS Prostate imaging data and reporting system (PI-RADS) score, index lesion (IL) on the peripheral zone, age, and prostate-specific antigen density (PSAD) were independent predictors and included in the combined model. The combined model achieved the best discrimination (AUC 0.88) as compared to both the MRI model incorporated by PI-RADS score, IL level, and zone (AUC 0.86) and the clinical model incorporated by age and PSAD (AUC 0.70). The combined model also showed good calibration and enabled great net benefit. Applying the combined model as a reference for performing MRTB alone with a cutoff of 60% would reduce 43.8% of additional SB, while missing 2.9% csPCa. CONCLUSIONS The combined model based on clinical and mpMRI findings improved csPCa prediction and might be useful in making a decision about which patient could safely avoid unnecessary SB in addition to MRTB in biopsy-naïve patients. CRITICAL RELEVANCE STATEMENT The combined model based on clinical and mpMRI findings improved csPCa prediction and might be useful in making a decision about which patient could safely avoid unnecessary SB in addition to MRTB in biopsy-naïve patients. KEY POINTS • Age, PSAD, PI-RADS score, and peripheral index lesion were independent predictors of csPCa. • Risk models were used to predict the probability of detecting csPCa on cognitive MRTB. • The combined model might reduce 43.8% of unnecessary SBs, while missing 2.9% csPCa.
Collapse
Affiliation(s)
- Xueqing Cheng
- Department of Radiology, West China Hospital of Sichuan University, No. 37 Guoxue Street, Chengdu, 610041, Sichuan, China
- Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Yuntian Chen
- Department of Radiology, West China Hospital of Sichuan University, No. 37 Guoxue Street, Chengdu, 610041, Sichuan, China
- Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Jinshun Xu
- Department of Ultrasound, Sichuan Cancer Hospital, Chengdu, Sichuan, China
| | - Diming Cai
- Department of Ultrasound, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Zhenhua Liu
- Department of Urology, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Hao Zeng
- Department of Urology, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Jin Yao
- Department of Radiology, West China Hospital of Sichuan University, No. 37 Guoxue Street, Chengdu, 610041, Sichuan, China.
| | - Bin Song
- Department of Radiology, West China Hospital of Sichuan University, No. 37 Guoxue Street, Chengdu, 610041, Sichuan, China.
- Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, Sichuan, China.
- Department of Radiology, Sanya People's Hospital, Sanya, Hainan, China.
| |
Collapse
|
161
|
Gwilym BL, Pallmann P, Waldron CA, Thomas-Jones E, Milosevic S, Brookes-Howell L, Harris D, Massey I, Burton J, Stewart P, Samuel K, Jones S, Cox D, Clothier A, Prout H, Edwards A, Twine CP, Bosanquet DC. Long-term risk prediction after major lower limb amputation: 1-year results of the PERCEIVE study. BJS Open 2024; 8:zrad135. [PMID: 38266124 PMCID: PMC10807997 DOI: 10.1093/bjsopen/zrad135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 10/22/2023] [Indexed: 01/26/2024] Open
Abstract
BACKGROUND Decision-making when considering major lower limb amputation is complex and requires individualized outcome estimation. It is unknown how accurate healthcare professionals or relevant outcome prediction tools are at predicting outcomes at 1-year after major lower limb amputation. METHODS An international, multicentre prospective observational study evaluating healthcare professional accuracy in predicting outcomes 1 year after major lower limb amputation and evaluation of relevant outcome prediction tools identified in a systematic search of the literature was undertaken. Observed outcomes at 1 year were compared with: healthcare professionals' preoperative predictions of death (surgeons and anaesthetists), major lower limb amputation revision (surgeons) and ambulation (surgeons, specialist physiotherapists and vascular nurse practitioners); and probabilities calculated from relevant outcome prediction tools. RESULTS A total of 537 patients and 2244 healthcare professional predictions of outcomes were included. Surgeons and anaesthetists had acceptable discrimination (C-statistic = 0.715), calibration and overall performance (Brier score = 0.200) when predicting 1-year death, but performed worse when predicting major lower limb amputation revision and ambulation (C-statistics = 0.627 and 0.662 respectively). Healthcare professionals overestimated the death and major lower limb amputation revision risks. Consultants outperformed trainees, especially when predicting ambulation. Allied healthcare professionals marginally outperformed surgeons in predicting ambulation. Two outcome prediction tools (C-statistics = 0.755 and 0.717, Brier scores = 0.158 and 0.178) outperformed healthcare professionals' discrimination, calibration and overall performance in predicting death. Two outcome prediction tools for ambulation (C-statistics = 0.688 and 0.667) marginally outperformed healthcare professionals. CONCLUSION There is uncertainty in predicting 1-year outcomes following major lower limb amputation. Different professional groups performed comparably in this study. Two outcome prediction tools for death and two for ambulation outperformed healthcare professionals and may support shared decision-making.
Collapse
Affiliation(s)
- Brenig Llwyd Gwilym
- School of Medicine, Cardiff University, Cardiff, UK
- Gwent Vascular Institute, Royal Gwent Hospital, Aneurin Bevan University Health Board, Newport, UK
| | | | | | | | | | | | - Debbie Harris
- Centre for Trials Research, Cardiff University, Cardiff, UK
| | - Ian Massey
- Artificial Limb and Appliance Centre, Rookwood Hospital, Cardiff and Vale University Health Board, Cardiff, UK
| | - Jo Burton
- Artificial Limb and Appliance Centre, Rookwood Hospital, Cardiff and Vale University Health Board, Cardiff, UK
| | - Phillippa Stewart
- Artificial Limb and Appliance Centre, Rookwood Hospital, Cardiff and Vale University Health Board, Cardiff, UK
| | - Katie Samuel
- Department of Anaesthesia, North Bristol NHS Trust, Bristol, UK
| | - Sian Jones
- C/O INVOLVE Health and Care Research Wales, Cardiff, UK
| | - David Cox
- C/O INVOLVE Health and Care Research Wales, Cardiff, UK
| | | | - Hayley Prout
- Centre for Trials Research, Cardiff University, Cardiff, UK
| | - Adrian Edwards
- Division of Population Medicine, Cardiff University, Cardiff, UK
| | - Christopher P Twine
- Bristol, Bath and Weston Vascular Network, North Bristol NHS Trust, Southmead Hospital, Bristol, UK
| | | |
Collapse
|
162
|
van Beek S, Nieboer D, Klimek M, Stolker RJ, Mijderwijk HJ. Development and external validation of a clinical prediction model for predicting quality of recovery up to 1 week after surgery. Sci Rep 2024; 14:387. [PMID: 38172591 PMCID: PMC10764891 DOI: 10.1038/s41598-023-50518-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Accepted: 12/20/2023] [Indexed: 01/05/2024] Open
Abstract
The Quality of Recovery Score-40 (QoR-40) has been increasingly used for assessing recovery after patients undergoing surgery. However, a prediction model estimating quality of recovery is lacking. The aim of the present study was to develop and externally validate a clinical prediction model that predicts quality of recovery up to one week after surgery. The modelling procedure consisted of two models of increasing complexity (basic and full model). To assess the internal validity of the developed model, bootstrapping (1000 times) was applied. At external validation, the model performance was evaluated according to measures for overall model performance (explained variance (R2)) and calibration (calibration plot and slope). The full model consisted of age, sex, previous surgery, BMI, ASA classification, duration of surgery, HADS and preoperative QoR-40 score. At model development, the R2 of the full model was 0.24. At external validation the R2 dropped as expected. The calibration analysis showed that the QoR-40 predictions provided by the developed prediction models are reliable. The presented models can be used as a starting point for future updating in prediction studies. When the predictive performance is improved it could be implemented clinically in the future.
Collapse
Affiliation(s)
- Stefan van Beek
- Department of Anaesthesiology, Erasmus University Medical Centre, Dr. Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands.
| | - Daan Nieboer
- Department of Public Health, Erasmus University Medical Centre, Rotterdam, The Netherlands
| | - Markus Klimek
- Department of Anaesthesiology, Erasmus University Medical Centre, Dr. Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
| | - Robert Jan Stolker
- Department of Anaesthesiology, Erasmus University Medical Centre, Dr. Molewaterplein 40, 3015 GD, Rotterdam, The Netherlands
| | - Hendrik-Jan Mijderwijk
- Department of Neurosurgery, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
| |
Collapse
|
163
|
Xiao Y, Lin H, Li J, Wu J. Disulfidptosis-related prognostic signature correlates with immunotherapy response in colorectal cancer. Sci Rep 2024; 14:81. [PMID: 38168553 PMCID: PMC10762008 DOI: 10.1038/s41598-023-49954-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 12/13/2023] [Indexed: 01/05/2024] Open
Abstract
Disulfidptosis (DSP), a form of cell death caused by disulphide stress, plays an important role in tumour progression. However, the mechanisms by which DSP regulates the tumour microenvironment remain unclear. Thus, we analysed the transcriptome profiles and clinical data, which were obtained from the TCGA database, of 540 patients with colorectal cancer. Compared with the patients with low DSP expression, those with high DSP expression exhibited significantly better survival outcomes; lower stromal and ESTIMATE scores; significantly higher numbers of CD4+ T cells, M2 macrophages, dendritic cells, and neutrophils; higher expression of immune checkpoint-related genes; and lower Tregs and HLA-DQB2 levels. A prognostic signature established based on DSP-related genes demonstrated an increase in risk score with a higher clinical stage. Risk scores negatively correlated with dendritic cells, eosinophils, and CD4+ T cells and significantly positively correlated with Treg cells. Patients with higher risk scores experienced significantly worse survival outcomes and immunotherapy non-response. Our nomogram model, combining clinicopathological features and risk scores, exhibited robust prognostic and predictive power. In conclusion, DSP-related genes actively participated in regulating the tumour microenvironment. Thus, they can serve as biomarkers to provide targeted treatment for colorectal cancer.
Collapse
Affiliation(s)
- Yu Xiao
- Department of Radiation Oncology, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, 350014, China
| | - Hancui Lin
- Department of Radiation Oncology, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, 350014, China
| | - Jinluan Li
- Department of Radiation Oncology, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, 350014, China.
| | - Junxin Wu
- Department of Radiation Oncology, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, 350014, China.
| |
Collapse
|
164
|
Deisenhofer AK, Barkham M, Beierl ET, Schwartz B, Aafjes-van Doorn K, Beevers CG, Berwian IM, Blackwell SE, Bockting CL, Brakemeier EL, Brown G, Buckman JEJ, Castonguay LG, Cusack CE, Dalgleish T, de Jong K, Delgadillo J, DeRubeis RJ, Driessen E, Ehrenreich-May J, Fisher AJ, Fried EI, Fritz J, Furukawa TA, Gillan CM, Gómez Penedo JM, Hitchcock PF, Hofmann SG, Hollon SD, Jacobson NC, Karlin DR, Lee CT, Levinson CA, Lorenzo-Luaces L, McDanal R, Moggia D, Ng MY, Norris LA, Patel V, Piccirillo ML, Pilling S, Rubel JA, Salazar-de-Pablo G, Schleider JL, Schnurr PP, Schueller SM, Siegle GJ, Uher R, Watkins E, Webb CA, Wiltsey Stirman S, Wynants L, Youn SJ, Zilcha-Mano S, Lutz W, Cohen ZD. Implementing precision methods in personalizing psychological therapies: Barriers and possible ways forward. Behav Res Ther 2024; 172:104443. [PMID: 38086157 DOI: 10.1016/j.brat.2023.104443] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 11/21/2023] [Accepted: 11/27/2023] [Indexed: 12/26/2023]
Affiliation(s)
| | | | | | | | | | | | | | | | - Claudi L Bockting
- AmsterdamUMC, Department of Psychiatry, Research Program Amsterdam Public Health and Centre for Urban Mental Health, University of Amsterdam, the Netherlands
| | | | | | | | | | | | | | - Kim de Jong
- Leiden University, Institute of Psychology, USA
| | | | | | | | | | | | | | - Jessica Fritz
- University of Cambridge, UK; Philipps University of Marburg, Germany
| | | | - Claire M Gillan
- School of Psychology, Trinity College Institute for Neuroscience, And Global Brain Health Institute, Trinity College Dublin, USA
| | | | | | | | | | | | | | | | | | | | | | | | - Mei Yi Ng
- Florida International University, USA
| | | | | | | | | | | | | | - Jessica L Schleider
- Stony Brook University and Feinberg School of Medicine Northwestern University, USA
| | - Paula P Schnurr
- National Center for PTSD and Geisel School of Medicine at Dartmouth, USA
| | | | | | | | | | | | | | | | - Soo Jeong Youn
- Reliant Medical Group, OptumCare and Harvard Medical School, USA
| | | | | | - Zachary D Cohen
- University of California, Los Angeles and University of Arizona, USA.
| |
Collapse
|
165
|
Karabacak M, Jagtiani P, Di L, Shah AH, Komotar RJ, Margetis K. Advancing precision prognostication in neuro-oncology: Machine learning models for data-driven personalized survival predictions in IDH-wildtype glioblastoma. Neurooncol Adv 2024; 6:vdae096. [PMID: 38983675 PMCID: PMC11232516 DOI: 10.1093/noajnl/vdae096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/11/2024] Open
Abstract
Background Glioblastoma (GBM) remains associated with a dismal prognoses despite standard therapies. While population-level survival statistics are established, generating individualized prognosis remains challenging. We aim to develop machine learning (ML) models that generate personalized survival predictions for GBM patients to enhance prognostication. Methods Adult patients with histologically confirmed IDH-wildtype GBM from the National Cancer Database (NCDB) were analyzed. ML models were developed with TabPFN, TabNet, XGBoost, LightGBM, and Random Forest algorithms to predict mortality at 6, 12, 18, and 24 months postdiagnosis. SHapley Additive exPlanations (SHAP) were employed to enhance the interpretability of the models. Models were primarily evaluated using the area under the receiver operating characteristic (AUROC) values, and the top-performing models indicated by the highest AUROCs for each outcome were deployed in a web application that was created for individualized predictions. Results A total of 7537 patients were retrieved from the NCDB. Performance evaluation revealed the top-performing models for each outcome were built using the TabPFN algorithm. The TabPFN models yielded mean AUROCs of 0.836, 0.78, 0.732, and 0.724 in predicting 6, 12, 18, and 24 month mortality, respectively. Conclusions This study establishes ML models tailored to individual patients to enhance GBM prognostication. Future work should focus on external validation and dynamic updating as new data emerge.
Collapse
Affiliation(s)
- Mert Karabacak
- Department of Neurosurgery, Mount Sinai Health System, New York, New York, USA
| | - Pemla Jagtiani
- School of Medicine, SUNY Downstate Health Sciences University, New York, New York, USA
| | - Long Di
- Department of Neurological Surgery, University of Miami Miller School of Medicine, Miami, Florida, USA
| | - Ashish H Shah
- Department of Neurological Surgery, University of Miami Miller School of Medicine, Miami, Florida, USA
- Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, Florida, USA
| | - Ricardo J Komotar
- Department of Neurological Surgery, University of Miami Miller School of Medicine, Miami, Florida, USA
- Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, Florida, USA
| | | |
Collapse
|
166
|
Sabbagh A, Tilki D, Feng J, Huland H, Graefen M, Wiegel T, Böhmer D, Hong JC, Valdes G, Cowan JE, Cooperberg M, Feng FY, Mohammad T, Shelan M, D'Amico AV, Carroll PR, Mohamad O. Multi-institutional Development and External Validation of a Machine Learning Model for the Prediction of Distant Metastasis in Patients Treated by Salvage Radiotherapy for Biochemical Failure After Radical Prostatectomy. Eur Urol Focus 2024; 10:66-74. [PMID: 37507248 DOI: 10.1016/j.euf.2023.07.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 05/30/2023] [Accepted: 07/13/2023] [Indexed: 07/30/2023]
Abstract
BACKGROUND Up to 40% of patients with prostate cancer may develop biochemical recurrence after surgery, with salvage radiation therapy (SRT) being the only curative option. In 2016, Tendulkar et al. (Contemporary update of a multi-institutional predictive nomogram for salvage radiotherapy after radical prostatectomy. J Clin Oncol 2016;34:3648-54) published a nomogram to predict distant metastasis in a cohort of patients treated with SRT with pre-SRT prostate-specific antigen (PSA) of 0.5 ng/ml after radical prostatectomy. In modern practice, SRT is delivered at lower PSA values. OBJECTIVE To train and externally validate a machine learning model to predict the risk of distant metastasis at 5 yr in a contemporary cohort of patients receiving SRT. DESIGN, SETTING, AND PARTICIPANTS We trained a machine learning model on data from 2418 patients treated with SRT at one institution, with a median PSA value of 0.27 ng/ml. External validation was done in 475 patients treated at two different institutions. Patients with cM1, pN1, or pT4 disease were excluded, as were patients with PSA >2 ng/ml or PSA 0, and patients with radiation dose <60 or ≥80 Gy. OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS Model performance was assessed using calibration and time-dependent area under the receiver operating curve (tAUC). RESULTS AND LIMITATIONS Our model had better calibration and showed improved discrimination (tAUC = 0.72) compared with the Tendulkar model (tAUC = 0.60, p < 0.001). The main limitations of this study are its retrospective design and lack of validation on patients who received hormone therapy. CONCLUSIONS The updated model can be used to provide more individualized risk assessments to patients treated with SRT at low PSA values, improving decision-making. PATIENT SUMMARY Up to 40% of patients with prostate cancer may develop biochemical recurrence after surgery, with salvage radiation therapy as the only potentially curative option. We trained and validated a machine learning model using clinical and surgical data to predict a patient's risk of distant metastasis at 5 yr after treatment. Our model outperformed the reference tool and can improve clinical decision-making by providing more personalized risk assessment.
Collapse
Affiliation(s)
- Ali Sabbagh
- Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, USA
| | - Derya Tilki
- Department of Urology, University Hospital Hamburg-Eppendorf, Hamburg, Germany; Martini-Klinik Prostate Cancer Center, University Hospital-Hamburg-Eppendorf, Hamburg, Germany
| | - Jean Feng
- Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA
| | - Hartwig Huland
- Martini-Klinik Prostate Cancer Center, University Hospital-Hamburg-Eppendorf, Hamburg, Germany
| | - Markus Graefen
- Martini-Klinik Prostate Cancer Center, University Hospital-Hamburg-Eppendorf, Hamburg, Germany
| | - Thomas Wiegel
- Department of Radio Oncology, University Hospital Ulm, Ulm, Germany
| | - Dirk Böhmer
- Department of Radiation Oncology, Charité University Hospital, Berlin, Germany
| | - Julian C Hong
- Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, USA
| | - Gilmer Valdes
- Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, USA
| | - Janet E Cowan
- Department of Urology, University of California San Francisco, San Francisco, CA, USA
| | - Matthew Cooperberg
- Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA; Department of Urology, University of California San Francisco, San Francisco, CA, USA
| | - Felix Y Feng
- Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, USA; Department of Urology, University of California San Francisco, San Francisco, CA, USA
| | | | - Mohamed Shelan
- Department of Radiation Oncology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
| | - Anthony V D'Amico
- Department of Radiation Oncology, Brigham and Women's Hospital and Dana Farber Cancer Institute, Boston, MA, USA
| | - Peter R Carroll
- Department of Urology, University of California San Francisco, San Francisco, CA, USA
| | - Osama Mohamad
- Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, USA; Department of Urology, University of California San Francisco, San Francisco, CA, USA.
| |
Collapse
|
167
|
Rowe IA. Prediction of outcomes in patients with acute variceal bleeding. Hepatology 2024; 79:15-17. [PMID: 37607729 DOI: 10.1097/hep.0000000000000571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 07/28/2023] [Indexed: 08/24/2023]
Affiliation(s)
- Ian A Rowe
- Leeds Institute for Medical Research, University of Leeds & Leeds Liver Unit, St James's University Hospital, Leeds, UK
| |
Collapse
|
168
|
Drebin HM, Hosein S, Kurtansky NR, Nadelmann E, Moy AP, Ariyan CE, Bello DM, Brady MS, Coit DG, Marchetti MA, Bartlett EK. Clinical Utility of Melanoma Sentinel Lymph Node Biopsy Nomograms. J Am Coll Surg 2024; 238:23-31. [PMID: 37870230 DOI: 10.1097/xcs.0000000000000886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2023]
Abstract
BACKGROUND For patients with melanoma, the decision to perform sentinel lymph node biopsy (SLNB) is based on the estimated risk of lymph node metastasis. We assessed 3 melanoma SLNB risk-prediction models' statistical performance and their ability to improve clinical decision making (clinical utility) on a cohort of melanoma SLNB cases. STUDY DESIGN Melanoma patients undergoing SLNB at a single center from 2003 to 2021 were identified. The predicted probabilities of sentinel lymph node positivity using the Melanoma Institute of Australia, Memorial Sloan Kettering Cancer Center (MSK), and Friedman nomograms were calculated. Receiver operating characteristic and calibration curves were generated. Clinical utility was assessed via decision curve analysis, calculating the net SLNBs that could have been avoided had a given model guided selection at different risk thresholds. RESULTS Of 2,464 melanoma cases that underwent SLNB, 567 (23.0%) had a positive sentinel lymph node. The areas under the receiver operating characteristic curves for the Melanoma Institute of Australia, MSK, and Friedman models were 0.726 (95% CI, 0.702 to 0.750), 0.720 (95% CI, 0.697 to 0.744), and 0.721 (95% CI, 0.699 to 0.744), respectively. For all models, calibration was best at predicted positivity rates below 30%. The MSK model underpredicted risk. At a 10% risk threshold, only the Friedman model would correctly avoid a net of 6.2 SLNBs per 100 patients. The other models did not reduce net avoidable SLNBs at risk thresholds of ≤10%. CONCLUSIONS The tested nomograms had comparable performance in our cohort. The only model that achieved clinical utility at risk thresholds of ≤10% was the Friedman model.
Collapse
Affiliation(s)
- Harrison M Drebin
- From the Gastric and Mixed Tumor Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY (Drebin, Ariyan, Bello, Brady, Coit, Bartlett)
| | - Sharif Hosein
- the Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY (Hosein, Kurtansky, Nadelmann, Marchetti)
| | - Nicholas R Kurtansky
- the Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY (Hosein, Kurtansky, Nadelmann, Marchetti)
| | - Emily Nadelmann
- the Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY (Hosein, Kurtansky, Nadelmann, Marchetti)
| | - Andrea P Moy
- the Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY (Moy)
| | - Charlotte E Ariyan
- From the Gastric and Mixed Tumor Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY (Drebin, Ariyan, Bello, Brady, Coit, Bartlett)
| | - Danielle M Bello
- From the Gastric and Mixed Tumor Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY (Drebin, Ariyan, Bello, Brady, Coit, Bartlett)
| | - Mary S Brady
- From the Gastric and Mixed Tumor Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY (Drebin, Ariyan, Bello, Brady, Coit, Bartlett)
| | - Daniel G Coit
- From the Gastric and Mixed Tumor Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY (Drebin, Ariyan, Bello, Brady, Coit, Bartlett)
| | - Michael A Marchetti
- the Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY (Hosein, Kurtansky, Nadelmann, Marchetti)
- Skagit Regional Health, Mt Vernon, WA (Marchetti)
| | - Edmund K Bartlett
- From the Gastric and Mixed Tumor Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY (Drebin, Ariyan, Bello, Brady, Coit, Bartlett)
| |
Collapse
|
169
|
Kyrimi E, Stoner RS, Perkins ZB, Pisirir E, Wohlgemut JM, Marsh W, Tai NRM. Updating and recalibrating causal probabilistic models on a new target population. J Biomed Inform 2024; 149:104572. [PMID: 38081566 DOI: 10.1016/j.jbi.2023.104572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 10/13/2023] [Accepted: 12/08/2023] [Indexed: 12/17/2023]
Abstract
OBJECTIVE Very often the performance of a Bayesian Network (BN) is affected when applied to a new target population. This is mainly because of differences in population characteristics. External validation of the model performance on different populations is a standard approach to test model's generalisability. However, a good predictive performance is not enough to show that the model represents the unique population characteristics and can be adopted in the new environment. METHODS In this paper, we present a methodology for updating and recalibrating developed BN models - both their structure and parameters - to better account for the characteristics of the target population. Attention has been given on incorporating expert knowledge and recalibrating latent variables, which are usually omitted from data-driven models. RESULTS The method is successfully applied to a clinical case study about the prediction of trauma-induced coagulopathy, where a BN has already been developed for civilian trauma patients and now it is recalibrated on combat casualties. CONCLUSION The methodology proposed in this study is important for developing credible models that can demonstrate a good predictive performance when applied to a target population. Another advantage of the proposed methodology is that it is not limited to data-driven techniques and shows how expert knowledge can also be used when updating and recalibrating the model.
Collapse
Affiliation(s)
- Evangelia Kyrimi
- Department of Electronic Engineering and Computer Science, Queen Mary University of London, United Kingdom. https://twitter.com/@LinaKyrimi
| | - Rebecca S Stoner
- Centre for Trauma Sciences, Blizard Institute, Queen Mary University of London, London, United Kingdom; Royal London Hospital, Barts Health NHS Trust, London, United Kingdom
| | - Zane B Perkins
- Centre for Trauma Sciences, Blizard Institute, Queen Mary University of London, London, United Kingdom; Royal London Hospital, Barts Health NHS Trust, London, United Kingdom
| | - Erhan Pisirir
- Department of Electronic Engineering and Computer Science, Queen Mary University of London, United Kingdom
| | - Jared M Wohlgemut
- Centre for Trauma Sciences, Blizard Institute, Queen Mary University of London, London, United Kingdom; Royal London Hospital, Barts Health NHS Trust, London, United Kingdom
| | - William Marsh
- Department of Electronic Engineering and Computer Science, Queen Mary University of London, United Kingdom
| | - Nigel R M Tai
- Centre for Trauma Sciences, Blizard Institute, Queen Mary University of London, London, United Kingdom; Royal London Hospital, Barts Health NHS Trust, London, United Kingdom; Royal Centre for Defence Medicine, Birmingham, United Kingdom
| |
Collapse
|
170
|
Cheng WHG, Dong W, Tse ETY, Chan L, Wong CKH, Chin WY, Bedford LE, Ko WK, Chao DVK, Tan KCB, Lam CLK. Recalibration of a Non-Laboratory-Based Risk Model to Estimate Pre-Diabetes/Diabetes Mellitus Risk in Primary Care in Hong Kong. J Prim Care Community Health 2024; 15:21501319241241188. [PMID: 38577788 PMCID: PMC10996357 DOI: 10.1177/21501319241241188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 03/01/2024] [Accepted: 03/06/2024] [Indexed: 04/06/2024] Open
Abstract
INTRODUCTION/OBJECTIVES A non-laboratory-based pre-diabetes/diabetes mellitus (pre-DM/DM) risk prediction model developed from the Hong Kong Chinese population showed good external discrimination in a primary care (PC) population, but the estimated risk level was significantly lower than the observed incidence, indicating poor calibration. This study explored whether recalibrating/updating methods could improve the model's accuracy in estimating individuals' risks in PC. METHODS We performed a secondary analysis on the model's predictors and blood test results of 919 Chinese adults with no prior DM diagnosis recruited from PC clinics from April 2021 to January 2022 in HK. The dataset was randomly split in half into a training set and a test set. The model was recalibrated/updated based on a seven-step methodology, including model recalibrating, revising and extending methods. The primary outcome was the calibration of the recalibrated/updated models, indicated by calibration plots. The models' discrimination, indicated by the area under the receiver operating characteristic curves (AUC-ROC), was also evaluated. RESULTS Recalibrating the model's regression constant, with no change to the predictors' coefficients, improved the model's accuracy (calibration plot intercept: -0.01, slope: 0.69). More extensive methods could not improve any further. All recalibrated/updated models had similar AUC-ROCs to the original model. CONCLUSION The simple recalibration method can adapt the HK Chinese pre-DM/DM model to PC populations with different pre-test probabilities. The recalibrated model can be used as a first-step screening tool and as a measure to monitor changes in pre-DM/DM risks over time or after interventions.
Collapse
Affiliation(s)
| | - Weinan Dong
- The University of Hong Kong, Hong Kong SAR, China
| | - Emily T. Y. Tse
- The University of Hong Kong, Hong Kong SAR, China
- The University of Hong Kong-Shenzhen Hospital, Shenzhen, China
| | - Linda Chan
- The University of Hong Kong, Hong Kong SAR, China
- The University of Hong Kong-Shenzhen Hospital, Shenzhen, China
| | - Carlos K. H. Wong
- The University of Hong Kong, Hong Kong SAR, China
- Hong Kong Science and Technology Park, Sha Tin, Hong Kong SAR, China
| | - Weng Y. Chin
- The University of Hong Kong, Hong Kong SAR, China
| | | | - Wai Kit Ko
- Hospital Authority, Hong Kong SAR, China
| | | | | | - Cindy L. K. Lam
- The University of Hong Kong, Hong Kong SAR, China
- The University of Hong Kong-Shenzhen Hospital, Shenzhen, China
| |
Collapse
|
171
|
Diguisto C, Morgan AS, Foix L'Hélias L, Pierrat V, Ancel PY, Cohen JF, Goffinet F. Five-year outcomes for extremely preterm babies with active perinatal management: A clinical prediction model. BJOG 2024; 131:151-156. [PMID: 37592874 DOI: 10.1111/1471-0528.17633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 07/19/2023] [Accepted: 07/23/2023] [Indexed: 08/19/2023]
Abstract
OBJECTIVE To develop and validate a clinical prediction model for outcomes at 5 years of age for children born extremely preterm and receiving active perinatal management. DESIGN The EPIPAGE-2 national prospective cohort. SETTING France, 2011. POPULATION Live-born neonates between 24+0 and 26+6 weeks of gestation who received active perinatal management (i.e. birth in a tertiary-level hospital, with antenatal steroids and resuscitation at birth). METHODS A prediction model using logistic modelling, including gestational age, small-for gestational-age (SGA) status and sex, was developed. Model performance was assessed through calibration and discrimination, with bootstrap internal validation. MAIN OUTCOME MEASURES Survival without moderate or severe neurodevelopmental disability (NDD) at 5 years. RESULTS Among the 557 neonates included, 401 (72%) survived to 5 years, of which 59% survived without NDD (95% CI 54% to 63%). Predicted rates of survival without NDD ranged from 45% (95% CI 33% to 57%), to 56% (95% CI 49% to 64%) to 64% (95% CI 57% to 70%) for neonates born at 24, 25 and 26 weeks of gestation, respectively. Predicted rates of survival without NDD were 47% (95% CI 18% to 76%) and 62% (95% CI 49% to 76%) for SGA and non-SGA children, respectively. The model showed good calibration (calibration slope 0.85, 95% CI 0.54 to 1.16; calibration-in-the-large -0.0123, 95% CI -0.25 to 0.23) and modest discrimination (C-index 0.59, 95% CI 0.53 to 0.65). CONCLUSIONS A simple prediction model using three factors easily known antenatally may help doctors and families in their decision-making for extremely preterm neonates receiving active perinatal management.
Collapse
Affiliation(s)
- Caroline Diguisto
- Université Paris Cité, Université Sorbonne Paris Nord, Inserm, INRAE, CRESS U1153, EPOPé, Paris, France
- Maternité Olympe de Gouges Centre, Hospitalier Regional Universitaire Tours, Université de Tours, Tours, France
| | - Andrei Scott Morgan
- Université Paris Cité, Université Sorbonne Paris Nord, Inserm, INRAE, CRESS U1153, EPOPé, Paris, France
- Department of Neonatal Medicine, Hôpital Nord, Association Publique Hôpitaux de Marseille, Marseille, France
| | - Laurence Foix L'Hélias
- Université Paris Cité, Université Sorbonne Paris Nord, Inserm, INRAE, CRESS U1153, EPOPé, Paris, France
- Department of Neonatal Paediatrics, Trousseau Hospital, APHP, Sorbonne University, Paris, France
| | - Veronique Pierrat
- Université Paris Cité, Université Sorbonne Paris Nord, Inserm, INRAE, CRESS U1153, EPOPé, Paris, France
- Department of Neonatology, CHI Créteil, Créteil, France
| | - Pierre-Yves Ancel
- Université Paris Cité, Université Sorbonne Paris Nord, Inserm, INRAE, CRESS U1153, EPOPé, Paris, France
- Clinical Investigation Centre P1419, Assistance Publique-Hôpitaux de Paris, Paris, France
| | - Jérémie F Cohen
- Université Paris Cité, Université Sorbonne Paris Nord, Inserm, INRAE, CRESS U1153, EPOPé, Paris, France
- Department of General Paediatrics and Paediatric Infectious Diseases, Hôpital Necker-Enfants Malades, APHP, Université Paris Cité, Paris, France
| | - Francois Goffinet
- Université Paris Cité, Université Sorbonne Paris Nord, Inserm, INRAE, CRESS U1153, EPOPé, Paris, France
- Maternité Port Royal, Université Paris Cité, Cochin-Broca-Hôtel Dieu Hospitals, Assistance Publique-Hôpitaux de Paris, DHU Risk in Pregnancy, Paris, France
| |
Collapse
|
172
|
Huynh TTM, Dale E, Falk RS, Hellebust TP, Astrup GL, Malinen E, Edin NFJ, Bjordal K, Herlofson BB, Kiserud CE, Helland Å, Amdal CD. Radiation-induced long-term dysphagia in survivors of head and neck cancer and association with dose-volume parameters. Radiother Oncol 2024; 190:110044. [PMID: 38061420 DOI: 10.1016/j.radonc.2023.110044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 11/19/2023] [Accepted: 11/29/2023] [Indexed: 02/20/2024]
Abstract
BACKGROUND Although dysphagia is a common side effect after radiotherapy (RT) of head and neck cancer (HNC), data on long-term dysphagia is scarce. We aimed to 1) compare radiation dose parameters in HNC survivors with and without dysphagia, 2) investigate factors associated with long-term dysphagia and its possible impact on health-related quality of life (HRQoL), and 3) investigate how our data agree with existing NTCP models. METHODS This cross-sectional study conducted in 2018-2020, included HNC survivors treated in 2007-2013. Participants attended a one-day examination in hospital and filled in patient questionnaires. Dysphagia was measured with the EORTC QLQ-H&N35 swallowing scale. Toxicity was scored with CTCAE v.4. We contoured swallowing organs at risk (SWOAR) on RT plans, calculated dose-volume histograms (DVHs), performed logistic regression analyses and tested our data in established NTCP models. RESULTS Of the 239 participants, 75 (31%) reported dysphagia. Compared to survivors without dysphagia, this group had reduced HRQoL and the DVHs for infrahyoid SWOAR were significantly shifted to the right. Long-term dysphagia was associated with age (OR 1.07, 95% CI 1.03-1.10), female sex (OR 2.75, 95% CI 1.45-5.21), and mean dose to middle pharyngeal constrictor muscle (MD-MPCM) (OR 1.06, 95% CI 1.03-1.09). NTCP models overall underestimated the risk of long-term dysphagia. CONCLUSIONS Long-term dysphagia was associated with higher age, being female, and high MD-MPCM. Doses to distally located SWOAR seemed to be risk factors. Existing NTCP models do not sufficiently predict long-term dysphagia. Further efforts are needed to reduce the prevalence and consequences of this late effect.
Collapse
Affiliation(s)
- Thuy-Tien Maria Huynh
- Faculty of Medicine, University of Oslo, Oslo, Norway; Department of Oncology, Oslo University Hospital, Oslo, Norway.
| | - Einar Dale
- Department of Oncology, Oslo University Hospital, Oslo, Norway
| | | | - Taran Paulsen Hellebust
- Department of Physics, University of Oslo, Oslo, Norway; Department of Medical Physics, Oslo University Hospital, Oslo, Norway
| | | | - Eirik Malinen
- Department of Physics, University of Oslo, Oslo, Norway; Department of Medical Physics, Oslo University Hospital, Oslo, Norway
| | | | - Kristin Bjordal
- Faculty of Medicine, University of Oslo, Oslo, Norway; Research support services, Oslo University Hospital, Oslo, Norway
| | - Bente Brokstad Herlofson
- Faculty of Dentistry, University of Oslo, Oslo, Norway; Department of Otorhinolaryngology, Head and Neck Surgery, Oslo University Hospital, Oslo, Norway
| | | | - Åslaug Helland
- Faculty of Medicine, University of Oslo, Oslo, Norway; Department of Oncology, Oslo University Hospital, Oslo, Norway
| | - Cecilie Delphin Amdal
- Department of Oncology, Oslo University Hospital, Oslo, Norway; Research support services, Oslo University Hospital, Oslo, Norway
| |
Collapse
|
173
|
Roepke RML, Besen BAMP, Daltro-Oliveira R, Guazzelli RM, Bassi E, Salluh JIF, Damous SHB, Utiyama EM, Malbouisson LMS. Predictive Performance for Hospital Mortality of SAPS 3, SOFA, ISS, and New ISS in Critically Ill Trauma Patients: A Validation Cohort Study. J Intensive Care Med 2024; 39:44-51. [PMID: 37448331 DOI: 10.1177/08850666231188051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/15/2023]
Abstract
Background: It is not known whether anatomical scores perform better than general critical care scores for trauma patients admitted to the intensive care unit (ICU). We compare the predictive performance for hospital mortality of general critical care scores (SAPS 3 and SOFA) with anatomical injury-based scores (Injury Severity Score [ISS] and New ISS [NISS]). Methods: Retrospective cohort study of patients admitted to a specialized trauma ICU from a tertiary hospital in São Paulo, Brazil between May, 2012 and January, 2016. We retrieved data from the ICU database for critical care scores and calculated ISS and NISS from chart data and whole body computed tomography results. We compared the predictive performance for hospital mortality of each model through discrimination, calibration, and decision-curve analysis. Results: The sample comprised 1053 victims of trauma admitted to the ICU, with 84.2% male patients and mean age of 40 (±18) years. Main injury mechanism was blunt trauma (90.7%). Traumatic brain injury was present in 67.8% of patients; 43.3% with severe TBI. At the time of ICU admission, 846 patients (80.3%) were on mechanical ventilation and 644 (64.3%) on vasoactive drugs. Hospital mortality was 23.8% (251). Median SAPS 3 was 41; median maximum SOFA within 24 h of admission, 7; ISS, 29; and NISS, 41. AUROCs (95% CI) were: SAPS 3 = 0.786 (0.756-0.817), SOFA = 0.807 (0.778-0.837), ISS = 0.616 (0.577-0.656), and NISS = 0.689 (0.649-0.729). In pairwise comparisons, SAPS 3 and SOFA did not differ, while both outperformed the anatomical scores (p < .001). Maximum SOFA within 24 h of admission presented the best calibration and net benefit in decision-curve analysis. Conclusions: Trauma-specific anatomical scores have fair performance in critically ill trauma patients and are outperformed by SAPS 3 and SOFA. Illness severity is best characterized by organ dysfunction and physiological variables than anatomical injuries.
Collapse
Affiliation(s)
- Roberta Muriel Longo Roepke
- Trauma and Acute Care Surgery ICU, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
- Intensive Care Unit, AC Camargo Cancer Center, São Paulo, SP, Brazil
| | - Bruno Adler Maccagnan Pinheiro Besen
- Intensive Care Unit, AC Camargo Cancer Center, São Paulo, SP, Brazil
- Medical ICU, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Renato Daltro-Oliveira
- Medical ICU, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | | | - Estevão Bassi
- Trauma and Acute Care Surgery ICU, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | | | - Sérgio Henrique Bastos Damous
- Trauma and Acute Care Surgery ICU, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Edivaldo Massazo Utiyama
- Trauma and Acute Care Surgery ICU, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Luiz Marcelo Sá Malbouisson
- Surgical ICU, Anesthesiology Division, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| |
Collapse
|
174
|
Lee AS, Valero C, Yoo SK, Vos JL, Chowell D, Morris LGT. Validation of a Machine Learning Model to Predict Immunotherapy Response in Head and Neck Squamous Cell Carcinoma. Cancers (Basel) 2023; 16:175. [PMID: 38201602 PMCID: PMC10778506 DOI: 10.3390/cancers16010175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 12/22/2023] [Accepted: 12/27/2023] [Indexed: 01/12/2024] Open
Abstract
Head and neck squamous-cell carcinoma (HNSCC) is a disease with a generally poor prognosis; half of treated patients eventually develop recurrent and/or metastatic (R/M) disease. Patients with R/M HNSCC generally have incurable disease with a median survival of 10 to 15 months. Although immune-checkpoint blockade (ICB) has improved outcomes in patients with R/M HNSCC, identifying patients who are likely to benefit from ICB remains a challenge. Biomarkers in current clinical use include tumor mutational burden and immunohistochemistry for programmed death-ligand 1, both of which have only modest predictive power. Machine learning (ML) has the potential to aid in clinical decision-making as an approach to estimate a tumor's likelihood of response or a patient's likelihood of experiencing clinical benefit from therapies such as ICB. Previously, we described a random forest ML model that had value in predicting ICB response using 11 or 16 clinical, laboratory, and genomic features in a pan-cancer development cohort. However, its applicability to certain cancer types, such as HNSCC, has been unknown, due to a lack of cancer-type-specific validation. Here, we present the first validation of a random forest ML tool to predict the likelihood of ICB response in patients with R/M HNSCC. The tool had adequate predictive power for tumor response (area under the receiver operating characteristic curve = 0.65) and was able to stratify patients by overall (HR = 0.53 [95% CI 0.29-0.99], p = 0.045) and progression-free (HR = 0.49 [95% CI 0.27-0.87], p = 0.016) survival. The overall accuracy was 0.72. Our study validates an ML predictor in HNSCC, demonstrating promising performance in a novel cohort of patients. Further studies are needed to validate the generalizability of this algorithm in larger patient samples from additional multi-institutional contexts.
Collapse
Affiliation(s)
- Andrew Sangho Lee
- Head and Neck Service and Immunogenomic Oncology Platform, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA; (A.S.L.); (C.V.); (J.L.V.)
| | - Cristina Valero
- Head and Neck Service and Immunogenomic Oncology Platform, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA; (A.S.L.); (C.V.); (J.L.V.)
| | - Seong-keun Yoo
- Department of Oncological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; (S.-k.Y.); (D.C.)
| | - Joris L. Vos
- Head and Neck Service and Immunogenomic Oncology Platform, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA; (A.S.L.); (C.V.); (J.L.V.)
| | - Diego Chowell
- Department of Oncological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; (S.-k.Y.); (D.C.)
| | - Luc G. T. Morris
- Head and Neck Service and Immunogenomic Oncology Platform, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA; (A.S.L.); (C.V.); (J.L.V.)
| |
Collapse
|
175
|
Eickelberg G, Sanchez-Pinto LN, Kline AS, Luo Y. Transportability of bacterial infection prediction models for critically ill patients. J Am Med Inform Assoc 2023; 31:98-108. [PMID: 37647884 PMCID: PMC10746321 DOI: 10.1093/jamia/ocad174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 07/20/2023] [Accepted: 08/16/2023] [Indexed: 09/01/2023] Open
Abstract
OBJECTIVE Bacterial infections (BIs) are common, costly, and potentially life-threatening in critically ill patients. Patients with suspected BIs may require empiric multidrug antibiotic regimens and therefore potentially be exposed to prolonged and unnecessary antibiotics. We previously developed a BI risk model to augment practices and help shorten the duration of unnecessary antibiotics to improve patient outcomes. Here, we have performed a transportability assessment of this BI risk model in 2 tertiary intensive care unit (ICU) settings and a community ICU setting. We additionally explored how simple multisite learning techniques impacted model transportability. METHODS Patients suspected of having a community-acquired BI were identified in 3 datasets: Medical Information Mart for Intensive Care III (MIMIC), Northwestern Medicine Tertiary (NM-T) ICUs, and NM "community-based" ICUs. ICU encounters from MIMIC and NM-T datasets were split into 70/30 train and test sets. Models developed on training data were evaluated against the NM-T and MIMIC test sets, as well as NM community validation data. RESULTS During internal validations, models achieved AUROCs of 0.78 (MIMIC) and 0.81 (NM-T) and were well calibrated. In the external community ICU validation, the NM-T model had robust transportability (AUROC 0.81) while the MIMIC model transported less favorably (AUROC 0.74), likely due to case-mix differences. Multisite learning provided no significant discrimination benefit in internal validation studies but offered more stability during transport across all evaluation datasets. DISCUSSION These results suggest that our BI risk models maintain predictive utility when transported to external cohorts. CONCLUSION Our findings highlight the importance of performing external model validation on myriad clinically relevant populations prior to implementation.
Collapse
Affiliation(s)
- Garrett Eickelberg
- Department of Preventive Medicine (Health & Biomedical Informatics), Feinberg School of Medicine, Chicago, IL 60611, United States
| | - Lazaro Nelson Sanchez-Pinto
- Department of Preventive Medicine (Health & Biomedical Informatics), Feinberg School of Medicine, Chicago, IL 60611, United States
- Departments of Pediatrics (Critical Care), Chicago, IL 60611, United States
| | - Adrienne Sarah Kline
- Department of Preventive Medicine (Health & Biomedical Informatics), Feinberg School of Medicine, Chicago, IL 60611, United States
| | - Yuan Luo
- Department of Preventive Medicine (Health & Biomedical Informatics), Feinberg School of Medicine, Chicago, IL 60611, United States
| |
Collapse
|
176
|
Marko M, Goyal M, Ospel JM, Singh N, Venema E, Nogueira RG, Demchuk AM, McTaggart RA, Poppe AY, Menon BK, Zerna C, Mulder M, Dippel DW, Lingsma HF, Roozenbeek B, Tymianski M, Hill MD. Predicting outcome in acute stroke with large vessel occlusion-application and validation of MR PREDICTS in the ESCAPE-NA1 population. Interv Neuroradiol 2023:15910199231221491. [PMID: 38115793 DOI: 10.1177/15910199231221491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2023] Open
Abstract
BACKGROUND Predicting outcome after endovascular treatment for acute ischemic stroke is challenging. We aim to investigate differences between predicted and observed outcomes in patients with acute ischemic stroke treated with endovascular treatment and to evaluate the performance of a validated outcome prediction score. PATIENTS AND METHODS MR PREDICTS is an outcome prediction tool based on a logistic regression model designed to predict the treatment benefit of endovascular treatment based on the MR CLEAN and HERMES populations. ESCAPE-NA1 is a randomized trial of nerinetide vs. placebo in patients with acute stroke and large vessel occlusion. We applied MR PREDICTS to patients in the control arm of ESCAPE-NA1. Model performance was assessed by calculating its discriminative ability and calibration. RESULTS Overall, 556/1105 patients (50.3%) in the ESCAPE-NA1-trial were randomized to the control arm, 435/556 (78.2%) were treated within 6 h of symptom onset. Good outcome (modified Rankin scale 0-2) at 3 months was achieved in 275/435 patients (63.2%), the predicted probability of good outcome was 52.5%. Baseline characteristics were similar in the study and model derivation cohort except for age (ESCAPE-NA1: mean: 70 y vs. HERMES: 66 y), hypertension (72% vs. 57%), and collaterals (good collaterals, 15% vs. 44%). Compared to HERMES we observed higher rates of successful reperfusion (TICI 2b-3, ESCAPE-NA1: 87% vs. HERMES: 71%) and faster times from symptom onset to reperfusion (median: 201 min vs. 286 min). Model performance was good, indicated by a c-statistic of 0.76 (95%confidence interval: 0.71-0.81). CONCLUSION Outcome-prediction using models created from HERMES data, based on information available in the emergency department underestimated the actual outcome in patients with acute ischemic stroke and large vessel occlusion receiving endovascular treatment despite overall good model performance, which might be explained by differences in quality of and time to reperfusion. These findings underline the importance of timely and successful reperfusion for functional outcomes in acute stroke patients.
Collapse
Affiliation(s)
- Martha Marko
- Department of Neurology, Medical University of Vienna, Wien, Austria
| | - Mayank Goyal
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, Canada
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, Canada
| | - Johanna M Ospel
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, Canada
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, Canada
| | - Nishita Singh
- Department of Clinical Neurosciences, Max Rady College of Medicine, University of Manitoba, Winnipeg, Canada
| | - Esmee Venema
- Department of Public Health, Erasmus MC University Medical Center, Rotterdam, the Netherlands
- Department of Emergency Medicine, Erasmus MC University Medical Center, Rotterdam, the Netherlands
| | - Raul G Nogueira
- Emory University School of Medicine, Grady Memorial Hospital, Atlanta, USA
| | - Andrew M Demchuk
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, Canada
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, Canada
- Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, Canada
| | - Ryan A McTaggart
- Warren Alpert School of Medicine, Brown University, Providence, RI, USA
| | - Alexandre Y Poppe
- Department of Medicine (Neurology), Centre Hospitalier de l'Université de Montréal, QC, Calgary, Canada
| | - Bijoy K Menon
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, Canada
- Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, Canada
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Canada
| | - Charlotte Zerna
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, Canada
- Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, Canada
| | - Maxim Mulder
- Department of Public Health, Erasmus MC University Medical Center, Rotterdam, the Netherlands
- Department of Radiology & Nuclear Medicine, Erasmus MC University Medical Center, Rotterdam, the Netherlands
| | - Diederik Wj Dippel
- Department of Emergency Medicine, Erasmus MC University Medical Center, Rotterdam, the Netherlands
| | - Hester F Lingsma
- Department of Public Health, Erasmus MC University Medical Center, Rotterdam, the Netherlands
| | - Bob Roozenbeek
- Department of Public Health, Erasmus MC University Medical Center, Rotterdam, the Netherlands
- Department of Radiology & Nuclear Medicine, Erasmus MC University Medical Center, Rotterdam, the Netherlands
| | | | - Michael D Hill
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, Canada
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, Canada
- Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, Canada
- Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, Canada
| |
Collapse
|
177
|
Miller RJH, Gransar H, Rozanski A, Dey D, Al‐Mallah M, Chow BJW, Kaufmann PA, Cademartiri F, Maffei E, Han D, Slomka PJ, Berman DS. Simplified Approach to Predicting Obstructive Coronary Disease With Integration of Coronary Calcium: Development and External Validation. J Am Heart Assoc 2023; 12:e031601. [PMID: 38108259 PMCID: PMC10863788 DOI: 10.1161/jaha.123.031601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 11/13/2023] [Indexed: 12/19/2023]
Abstract
BACKGROUND The Diamond-Forrester model was used extensively to predict obstructive coronary artery disease (CAD) but overestimates probability in current populations. Coronary artery calcium (CAC) is a useful marker of CAD, which is not routinely integrated with other features. We derived simple likelihood tables, integrating CAC with age, sex, and cardiac chest pain to predict obstructive CAD. METHODS AND RESULTS The training population included patients from 3 multinational sites (n=2055), with 2 sites for external testing (n=3321). We determined associations between age, sex, cardiac chest pain, and CAC with the presence of obstructive CAD, defined as any stenosis ≥50% on coronary computed tomography angiography. Prediction performance was assessed using area under the receiver-operating characteristic curves (AUCs) and compared with the CAD Consortium models with and without CAC, which require detailed calculations, and the updated Diamond-Forrester model. In external testing, the proposed likelihood tables had higher AUC (0.875 [95% CI, 0.862-0.889]) than the CAD Consortium clinical+CAC score (AUC, 0.868 [95% CI, 0.855-0.881]; P=0.030) and the updated Diamond-Forrester model (AUC, 0.679 [95% CI, 0.658-0.699]; P<0.001). The calibration for the likelihood tables was better than the CAD Consortium model (Brier score, 0.116 versus 0.121; P=0.005). CONCLUSIONS We have developed and externally validated simple likelihood tables to integrate CAC with age, sex, and cardiac chest pain, demonstrating improved prediction performance compared with other risk models. Our tool affords physicians with the opportunity to rapidly and easily integrate a small number of important features to estimate a patient's likelihood of obstructive CAD as an aid to clinical management.
Collapse
Affiliation(s)
- Robert J. H. Miller
- Departments of Medicine (Division of Artificial Intelligence in Medicine)Imaging and Biomedical SciencesCedars‐Sinai Medical CenterLos AngelesCA
- Libin Cardiovascular Institute of AlbertaUniversity of CalgaryCalgaryAlbertaCanada
| | - Heidi Gransar
- Departments of Medicine (Division of Artificial Intelligence in Medicine)Imaging and Biomedical SciencesCedars‐Sinai Medical CenterLos AngelesCA
| | - Alan Rozanski
- Departments of Medicine (Division of Artificial Intelligence in Medicine)Imaging and Biomedical SciencesCedars‐Sinai Medical CenterLos AngelesCA
- Division of Cardiology and Department of MedicineMount Sinai Morningside HospitalMount Sinai Heart and the Icahn School of Medicine at Mount SinaiNew YorkNY
| | - Damini Dey
- Departments of Medicine (Division of Artificial Intelligence in Medicine)Imaging and Biomedical SciencesCedars‐Sinai Medical CenterLos AngelesCA
| | - Mouaz Al‐Mallah
- Houston Methodist DeBakey Heart and Vascular CenterHoustonTX
| | - Benjamin J. W. Chow
- Departments of Medicine (Cardiology and Nuclear Medicine) and RadiologyUniversity of Ottawa Heart InstituteOttawaOntarioCanada
| | - Philipp A. Kaufmann
- Department of Nuclear MedicineUniversity Hospital Zurich, University of ZurichZurichSwitzerland
| | | | - Erica Maffei
- Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) SYNLAB SDNNaplesItaly
| | - Donghee Han
- Departments of Medicine (Division of Artificial Intelligence in Medicine)Imaging and Biomedical SciencesCedars‐Sinai Medical CenterLos AngelesCA
| | - Piotr J. Slomka
- Departments of Medicine (Division of Artificial Intelligence in Medicine)Imaging and Biomedical SciencesCedars‐Sinai Medical CenterLos AngelesCA
| | - Daniel S. Berman
- Departments of Medicine (Division of Artificial Intelligence in Medicine)Imaging and Biomedical SciencesCedars‐Sinai Medical CenterLos AngelesCA
| |
Collapse
|
178
|
Riley RD, Pate A, Dhiman P, Archer L, Martin GP, Collins GS. Clinical prediction models and the multiverse of madness. BMC Med 2023; 21:502. [PMID: 38110939 PMCID: PMC10729337 DOI: 10.1186/s12916-023-03212-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 12/05/2023] [Indexed: 12/20/2023] Open
Abstract
BACKGROUND Each year, thousands of clinical prediction models are developed to make predictions (e.g. estimated risk) to inform individual diagnosis and prognosis in healthcare. However, most are not reliable for use in clinical practice. MAIN BODY We discuss how the creation of a prediction model (e.g. using regression or machine learning methods) is dependent on the sample and size of data used to develop it-were a different sample of the same size used from the same overarching population, the developed model could be very different even when the same model development methods are used. In other words, for each model created, there exists a multiverse of other potential models for that sample size and, crucially, an individual's predicted value (e.g. estimated risk) may vary greatly across this multiverse. The more an individual's prediction varies across the multiverse, the greater the instability. We show how small development datasets lead to more different models in the multiverse, often with vastly unstable individual predictions, and explain how this can be exposed by using bootstrapping and presenting instability plots. We recommend healthcare researchers seek to use large model development datasets to reduce instability concerns. This is especially important to ensure reliability across subgroups and improve model fairness in practice. CONCLUSIONS Instability is concerning as an individual's predicted value is used to guide their counselling, resource prioritisation, and clinical decision making. If different samples lead to different models with very different predictions for the same individual, then this should cast doubt into using a particular model for that individual. Therefore, visualising, quantifying and reporting the instability in individual-level predictions is essential when proposing a new model.
Collapse
Affiliation(s)
- Richard D Riley
- College of Medical and Dental Sciences, Institute of Applied Health Research, University of Birmingham, Birmingham, B15 2TT, UK.
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK.
| | - Alexander Pate
- Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Paula Dhiman
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK
| | - Lucinda Archer
- College of Medical and Dental Sciences, Institute of Applied Health Research, University of Birmingham, Birmingham, B15 2TT, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Glen P Martin
- Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Gary S Collins
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK
| |
Collapse
|
179
|
Gerry S, Collins GS. A prediction model for venous thromboembolism in the intensive care unit: flawed methods may lead to inaccurate predictions. Crit Care 2023; 27:495. [PMID: 38111055 PMCID: PMC10726531 DOI: 10.1186/s13054-023-04778-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 12/12/2023] [Indexed: 12/20/2023] Open
Affiliation(s)
- Stephen Gerry
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Windmill Road, Oxford, OX3 7LD, UK.
| | - Gary S Collins
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Windmill Road, Oxford, OX3 7LD, UK
| |
Collapse
|
180
|
Boukebous B, Petrie L, Baker JF. Keeping It Simple: Developing a Prognostic Tool for Spinal Epidural Abscess. Global Spine J 2023:21925682231221497. [PMID: 38105544 DOI: 10.1177/21925682231221497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/19/2023] Open
Abstract
STUDY DESIGN A retrospective study. OBJECTIVE To develop a prognostic score for mortality and treatment failure in Spinal epidural abscess (SEA), based on simplicity and multidimensional assessment principles. METHODS One-hundred-fifty patients were reviewed. Variables assessed included comorbidities, functional status, clinical presentation, Frankel classification, and biochemical and radiological parameters. The main outcomes were the 90-day mortality and treatment failure, corresponding to any intensification of the initial treatment plan. Variables were sorted out with a factorial analysis. Logistic regressions were performed, and the new score was derived from the coefficients. ROC curves with Area Under Curve, calibration plots, and cross-validation were performed. RESULTS Forty-three patients (29%) had treatment failure, and 15 died (10%) by 90 days. Factorization created 3 groups: Comorbidities (C), Severity (S), and Function (F). For 90-day mortality, Odds ratios were 1.20 (P = .0002), 1.15, (P = .03), 1.36, (P < 10-4) for C, S, F, respectively. The new score 'CSF' had 1 point per item, ranging from zero to 3. OR increased by 1.2/point for 90-day mortality (P < 10-4), AUC was .86. For failures OR increased by 1.15/point (P = .014), AUC was .58, and increased to .64 for patients who survived after 90 days, probably due to competing risks. CONCLUSIONS Comorbidities, Severity, and Function is a new simplistic tool, easy to use in daily practice; its performances were excellent for 90-day mortality, and acceptable for failures. Simple tools are more likely to be adopted into practice. External validation of this technique is desirable.
Collapse
Affiliation(s)
- Baptiste Boukebous
- Department of Orthopaedic Surgery, University of Auckland, Waikato Hospital, Hamilton, New Zealand
- ECAMO team, UMR 1153, CRESS (Centre of Research in Epidemiology and StatisticS), University of Paris Cité, INSERM, Paris, France
| | - Liam Petrie
- Department of Orthopaedic Surgery, University of Auckland, Waikato Hospital, Hamilton, New Zealand
- Department of Surgery, University of Auckland, Waikato Hospital, Hamilton, New Zealand
| | - Joseph F Baker
- Department of Orthopaedic Surgery, University of Auckland, Waikato Hospital, Hamilton, New Zealand
- Department of Surgery, University of Auckland, Waikato Hospital, Hamilton, New Zealand
| |
Collapse
|
181
|
Pitt TM, Hetherington E, Adhikari K, Premji S, Racine N, Tough SC, McDonald S. Developing non-response weights to account for attrition-related bias in a longitudinal pregnancy cohort. BMC Med Res Methodol 2023; 23:295. [PMID: 38097944 PMCID: PMC10722744 DOI: 10.1186/s12874-023-02121-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Accepted: 12/07/2023] [Indexed: 12/17/2023] Open
Abstract
BACKGROUND Prospective cohorts may be vulnerable to bias due to attrition. Inverse probability weights have been proposed as a method to help mitigate this bias. The current study used the "All Our Families" longitudinal pregnancy cohort of 3351 maternal-infant pairs and aimed to develop inverse probability weights using logistic regression models to predict study continuation versus drop-out from baseline to the three-year data collection wave. METHODS Two methods of variable selection took place. One method was a knowledge-based a priori variable selection approach, while the second used Least Absolute Shrinkage and Selection Operator (LASSO). The ability of each model to predict continuing participation through discrimination and calibration for both approaches were evaluated by examining area under the receiver operating curve (AUROC) and calibration plots, respectively. Stabilized inverse probability weights were generated using predicted probabilities. Weight performance was assessed using standardized differences of baseline characteristics for those who continue in study and those that do not, with and without weights (unadjusted estimates). RESULTS The a priori and LASSO variable selection method prediction models had good and fair discrimination with AUROC of 0.69 (95% Confidence Interval [CI]: 0.67-0.71) and 0.73 (95% CI: 0.71-0.75), respectively. Calibration plots and non-significant Hosmer-Lemeshow Goodness of Fit Tests indicated that both the a priori (p = 0.329) and LASSO model (p = 0.242) were well-calibrated. Unweighted results indicated large (> 10%) standardized differences in 15 demographic variables (range: 11 - 29%), when comparing those who continued in the study with those that did not. Weights derived from the a priori and LASSO models reduced standardized differences relative to unadjusted estimates, with the largest differences of 13% and 5%, respectively. Additionally, when applying the same LASSO variable selection method to develop weights in future data collection waves, standardized differences remained below 10% for each demographic variable. CONCLUSION The LASSO variable selection approach produced robust weights that addressed non-response bias more than the knowledge-driven approach. These weights can be applied to analyses across multiple longitudinal waves of data collection to reduce bias.
Collapse
Affiliation(s)
- Tona M Pitt
- Department of Paediatrics, University of Calgary, 28 Oki Drive NW, Calgary, T3B 6A8, Canada
- Provincial Population and Public Health, Alberta Health Services, 10301 Southport Rd SW, Calgary, T2W 1S7, Canada
| | - Erin Hetherington
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, 2001 McGill College, Montreal, H3A 1G1, Canada
| | - Kamala Adhikari
- Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, T2N 4Z6, Canada
- Provincial Population and Public Health, Alberta Health Services, 10301 Southport Rd SW, Calgary, T2W 1S7, Canada
| | - Shainur Premji
- Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, T2N 4Z6, Canada
- Centre for Health Economics, University of York, Heslington, YO10 5DD, York, UK
| | - Nicole Racine
- School of Psychology, Faculty of Social Sciences, University of Ottawa, 136 Jean- Jacques Lussier, Ottawa, K1N 6N5, Canada
- Children's Hospital of Eastern Ontario Research Institute, 401 Smyth Rd, Ottawa, K1H 5B2, Canada
| | - Suzanne C Tough
- Department of Paediatrics, University of Calgary, 28 Oki Drive NW, Calgary, T3B 6A8, Canada
- Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, T2N 4Z6, Canada
| | - Sheila McDonald
- Department of Paediatrics, University of Calgary, 28 Oki Drive NW, Calgary, T3B 6A8, Canada.
- Department of Community Health Sciences, University of Calgary, 3280 Hospital Drive NW, Calgary, T2N 4Z6, Canada.
- Provincial Population and Public Health, Alberta Health Services, 10301 Southport Rd SW, Calgary, T2W 1S7, Canada.
| |
Collapse
|
182
|
Hrytsenko Y, Shea B, Elgart M, Kurniansyah N, Lyons G, Morrison AC, Carson AP, Haring B, Mitchel BD, Psaty BM, Jaeger BC, Gu CC, Kooperberg C, Levy D, Lloyd-Jones D, Choi E, Brody JA, Smith JA, Rotter JI, Moll M, Fornage M, Simon N, Castaldi P, Casanova R, Chung RH, Kaplan R, Loos RJ, Kardia SLR, Rich SS, Redline S, Kelly T, O’Connor T, Zhao W, Kim W, Guo X, Der Ida Chen Y, Sofer T. Machine learning models for blood pressure phenotypes combining multiple polygenic risk scores. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.12.13.23299909. [PMID: 38168328 PMCID: PMC10760279 DOI: 10.1101/2023.12.13.23299909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
We construct non-linear machine learning (ML) prediction models for systolic and diastolic blood pressure (SBP, DBP) using demographic and clinical variables and polygenic risk scores (PRSs). We developed a two-model ensemble, consisting of a baseline model, where prediction is based on demographic and clinical variables only, and a genetic model, where we also include PRSs. We evaluate the use of a linear versus a non-linear model at both the baseline and the genetic model levels and assess the improvement in performance when incorporating multiple PRSs. We report the ensemble model's performance as percentage variance explained (PVE) on a held-out test dataset. A non-linear baseline model improved the PVEs from 28.1% to 30.1% (SBP) and 14.3% to 17.4% (DBP) compared with a linear baseline model. Including seven PRSs in the genetic model computed based on the largest available GWAS of SBP/DBP improved the genetic model PVE from 4.8% to 5.1% (SBP) and 4.7% to 5% (DBP) compared to using a single PRS. Adding additional 14 PRSs computed based on two independent GWASs further increased the genetic model PVE to 6.3% (SBP) and 5.7% (DBP). PVE differed across self-reported race/ethnicity groups, with primarily all non-White groups benefitting from the inclusion of additional PRSs.
Collapse
Affiliation(s)
- Yana Hrytsenko
- Department of Medicine, Brigham and Women’s Hospital, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
- CardioVascular Institute (CVI), Beth Israel Deaconess Medical Center, Boston, MA
| | - Benjamin Shea
- CardioVascular Institute (CVI), Beth Israel Deaconess Medical Center, Boston, MA
| | - Michael Elgart
- Department of Medicine, Brigham and Women’s Hospital, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | | | - Genevieve Lyons
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA
| | - Alanna C. Morrison
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - April P. Carson
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
| | - Bernhard Haring
- Department of Epidemiology & Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
- Department of Medicine III, Saarland University, Homburg, Saarland, Germany
| | - Braxton D. Mitchel
- Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Bruce M. Psaty
- Department of Medicine, University of Washington, Seattle, WA, USA
- Department of Epidemiology, University of Washington, Seattle, WA, USA
- Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
- Health Systems and Population Health, University of Washington, Seattle, WA, USA
| | - Byron C. Jaeger
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - C Charles Gu
- The Center for Biostatistics and Data Science, Washington University, St. Louis, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Daniel Levy
- The Population Sciences Branch of the National Heart, Lung and Blood Institute, Bethesda, MD, USA
- The Framingham Heart Study, Framingham, MA, USA
| | - Donald Lloyd-Jones
- Department of Preventive Medicine, Northwestern University, Chicago, IL, USA
| | - Eunhee Choi
- Columbia Hypertension Laboratory, Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA
| | - Jennifer A Brody
- Department of Medicine, University of Washington, Seattle, WA, USA
- Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
| | - Jennifer A Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Jerome I. Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Matthew Moll
- Department of Medicine, Brigham and Women’s Hospital, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
- VA Boston Healthcare System, West Roxbury, MA, USA
| | - Myriam Fornage
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Noah Simon
- Department of Biostatistics, School of Public Health, University of Washington, Seattle, WA
| | - Peter Castaldi
- Department of Medicine, Brigham and Women’s Hospital, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Ramon Casanova
- Health Systems and Population Health, University of Washington, Seattle, WA, USA
| | - Ren-Hua Chung
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Taipei City, Taiwan
| | - Robert Kaplan
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Epidemiology & Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Ruth J.F. Loos
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty for Health and Medical Sciences, University of Copenhagen, Denmark, DK
| | - Sharon L. R. Kardia
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Stephen S. Rich
- Center for Public Health Genomics, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Susan Redline
- Department of Medicine, Harvard Medical School, Boston, MA
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
| | - Tanika Kelly
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
| | - Timothy O’Connor
- Department of Medicine III, Saarland University, Homburg, Saarland, Germany
| | - Wei Zhao
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Wonji Kim
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital
| | - Xiuqing Guo
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Yii Der Ida Chen
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | | | - Tamar Sofer
- Department of Medicine, Brigham and Women’s Hospital, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
- CardioVascular Institute (CVI), Beth Israel Deaconess Medical Center, Boston, MA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA
| |
Collapse
|
183
|
Tanner KT, Keogh RH, Coupland CAC, Hippisley-Cox J, Diaz-Ordaz K. Dynamic updating of clinical survival prediction models in a changing environment. Diagn Progn Res 2023; 7:24. [PMID: 38082429 PMCID: PMC10714456 DOI: 10.1186/s41512-023-00163-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 10/17/2023] [Indexed: 01/31/2024] Open
Abstract
BACKGROUND Over time, the performance of clinical prediction models may deteriorate due to changes in clinical management, data quality, disease risk and/or patient mix. Such prediction models must be updated in order to remain useful. In this study, we investigate dynamic model updating of clinical survival prediction models. In contrast to discrete or one-time updating, dynamic updating refers to a repeated process for updating a prediction model with new data. We aim to extend previous research which focused largely on binary outcome prediction models by concentrating on time-to-event outcomes. We were motivated by the rapidly changing environment seen during the COVID-19 pandemic where mortality rates changed over time and new treatments and vaccines were introduced. METHODS We illustrate three methods for dynamic model updating: Bayesian dynamic updating, recalibration, and full refitting. We use a simulation study to compare performance in a range of scenarios including changing mortality rates, predictors with low prevalence and the introduction of a new treatment. Next, the updating strategies were applied to a model for predicting 70-day COVID-19-related mortality using patient data from QResearch, an electronic health records database from general practices in the UK. RESULTS In simulated scenarios with mortality rates changing over time, all updating methods resulted in better calibration than not updating. Moreover, dynamic updating outperformed ad hoc updating. In the simulation scenario with a new predictor and a small updating dataset, Bayesian updating improved the C-index over not updating and refitting. In the motivating example with a rare outcome, no single updating method offered the best performance. CONCLUSIONS We found that a dynamic updating process outperformed one-time discrete updating in the simulations. Bayesian updating offered good performance overall, even in scenarios with new predictors and few events. Intercept recalibration was effective in scenarios with smaller sample size and changing baseline hazard. Refitting performance depended on sample size and produced abrupt changes in hazard ratio estimates between periods.
Collapse
Affiliation(s)
- Kamaryn T Tanner
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK.
| | - Ruth H Keogh
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK
| | - Carol A C Coupland
- Nuffield Department of Primary Health Care Sciences, University of Oxford, Oxford, OX2 6HT, UK
- Centre for Academic Primary Care, University of Nottingham, Nottingham, NG7 2UH, UK
| | - Julia Hippisley-Cox
- Nuffield Department of Primary Health Care Sciences, University of Oxford, Oxford, OX2 6HT, UK
| | - Karla Diaz-Ordaz
- Department of Statistical Science, University College London, London, WC1E 6BT, UK
| |
Collapse
|
184
|
Salama V, Godinich B, Geng Y, Humbert-Vidan L, Maule L, Wahid KA, Naser MA, He R, Mohamed ASR, Fuller CD, Moreno AC. Artificial Intelligence and Machine Learning in Cancer Related Pain: A Systematic Review. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.12.06.23299610. [PMID: 38105979 PMCID: PMC10723503 DOI: 10.1101/2023.12.06.23299610] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Background/objective Pain is a challenging multifaceted symptom reported by most cancer patients, resulting in a substantial burden on both patients and healthcare systems. This systematic review aims to explore applications of artificial intelligence/machine learning (AI/ML) in predicting pain-related outcomes and supporting decision-making processes in pain management in cancer. Methods A comprehensive search of Ovid MEDLINE, EMBASE and Web of Science databases was conducted using terms including "Cancer", "Pain", "Pain Management", "Analgesics", "Opioids", "Artificial Intelligence", "Machine Learning", "Deep Learning", and "Neural Networks" published up to September 7, 2023. The screening process was performed using the Covidence screening tool. Only original studies conducted in human cohorts were included. AI/ML models, their validation and performance and adherence to TRIPOD guidelines were summarized from the final included studies. Results This systematic review included 44 studies from 2006-2023. Most studies were prospective and uni-institutional. There was an increase in the trend of AI/ML studies in cancer pain in the last 4 years. Nineteen studies used AI/ML for classifying cancer patients' pain development after cancer therapy, with median AUC 0.80 (range 0.76-0.94). Eighteen studies focused on cancer pain research with median AUC 0.86 (range 0.50-0.99), and 7 focused on applying AI/ML for cancer pain management decisions with median AUC 0.71 (range 0.47-0.89). Multiple ML models were investigated with. median AUC across all models in all studies (0.77). Random forest models demonstrated the highest performance (median AUC 0.81), lasso models had the highest median sensitivity (1), while Support Vector Machine had the highest median specificity (0.74). Overall adherence of included studies to TRIPOD guidelines was 70.7%. Lack of external validation (14%) and clinical application (23%) of most included studies was detected. Reporting of model calibration was also missing in the majority of studies (5%). Conclusion Implementation of various novel AI/ML tools promises significant advances in the classification, risk stratification, and management decisions for cancer pain. These advanced tools will integrate big health-related data for personalized pain management in cancer patients. Further research focusing on model calibration and rigorous external clinical validation in real healthcare settings is imperative for ensuring its practical and reliable application in clinical practice.
Collapse
|
185
|
Fridgeirsson EA, Sontag D, Rijnbeek P. Attention-based neural networks for clinical prediction modelling on electronic health records. BMC Med Res Methodol 2023; 23:285. [PMID: 38062352 PMCID: PMC10701944 DOI: 10.1186/s12874-023-02112-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 11/27/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND Deep learning models have had a lot of success in various fields. However, on structured data they have struggled. Here we apply four state-of-the-art supervised deep learning models using the attention mechanism and compare against logistic regression and XGBoost using discrimination, calibration and clinical utility. METHODS We develop the models using a general practitioners database. We implement a recurrent neural network, a transformer with and without reverse distillation and a graph neural network. We measure discrimination using the area under the receiver operating characteristic curve (AUC) and the area under the precision recall curve (AUPRC). We assess smooth calibration using restricted cubic splines and clinical utility with decision curve analysis. RESULTS Our results show that deep learning approaches can improve discrimination up to 2.5% points AUC and 7.4% points AUPRC. However, on average the baselines are competitive. Most models are similarly calibrated as the baselines except for the graph neural network. The transformer using reverse distillation shows the best performance in clinical utility on two out of three prediction problems over most of the prediction thresholds. CONCLUSION In this study, we evaluated various approaches in supervised learning using neural networks and attention. Here we do a rigorous comparison, not only looking at discrimination but also calibration and clinical utility. There is value in using deep learning models on electronic health record data since it can improve discrimination and clinical utility while providing good calibration. However, good baseline methods are still competitive.
Collapse
Affiliation(s)
- Egill A Fridgeirsson
- Department of Medical Informatics, Erasmus University Medical Center, Doctor Molewaterplein 40, 3015 GD, Rotterdam, the Netherlands.
| | - David Sontag
- Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Peter Rijnbeek
- Department of Medical Informatics, Erasmus University Medical Center, Doctor Molewaterplein 40, 3015 GD, Rotterdam, the Netherlands
| |
Collapse
|
186
|
van Royen FS, Asselbergs FW, Alfonso F, Vardas P, van Smeden M. Five critical quality criteria for artificial intelligence-based prediction models. Eur Heart J 2023; 44:4831-4834. [PMID: 37897346 DOI: 10.1093/eurheartj/ehad727] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/30/2023] Open
Abstract
To raise the quality of clinical artificial intelligence (AI) prediction modelling studies in the cardiovascular health domain and thereby improve their impact and relevancy, the editors for digital health, innovation, and quality standards of the European Heart Journal propose five minimal quality criteria for AI-based prediction model development and validation studies: complete reporting, carefully defined intended use of the model, rigorous validation, large enough sample size, and openness of code and software.
Collapse
Affiliation(s)
- Florien S van Royen
- Department of General Practice & Nursing Science, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Folkert W Asselbergs
- Department of Cardiology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
- Health Data Research UK and Institute of Health Informatics, University College London, London, UK
| | - Fernando Alfonso
- Department of Cardiology, Hospital Universitario de la Princesa, Universidad Autónoma de Madrid, IIS-IP. CIVER-CV, Madrid, Spain
| | - Panos Vardas
- Biomedical Research Foundation Academy of Athens (BRFAA) and Hygeia Hospitals Group, Athens, Greece
| | - Maarten van Smeden
- Department of Epidemiology & Health Economics, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG Utrecht, Netherlands
- Department of Data Science & Biostatistics, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG Utrecht, The Netherlands
| |
Collapse
|
187
|
de Hond A, Huisman M, Van Smeden M. Why the grass isn't always greener on the machine learning side. Eur J Intern Med 2023; 118:36-37. [PMID: 37879970 DOI: 10.1016/j.ejim.2023.10.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 10/03/2023] [Indexed: 10/27/2023]
Affiliation(s)
- Anne de Hond
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Menno Huisman
- Department of Thrombosis and Hemostasis Leiden University Medical Center Leiden Netherlands
| | - Maarten Van Smeden
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, the Netherlands.
| |
Collapse
|
188
|
Pegu B, Subburaj SP, Chaturvedula L, Sarkar S, Nair NS, Keepanasseril A. External validation of prediction models for vaginal delivery after the trial of labour among women with previous one caesarean section - A cohort study. Eur J Obstet Gynecol Reprod Biol 2023; 291:10-15. [PMID: 37801782 DOI: 10.1016/j.ejogrb.2023.09.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 09/28/2023] [Accepted: 09/30/2023] [Indexed: 10/08/2023]
Abstract
OBJECTIVE To externally validate three predictive models (the Grobman model (2007), the Zhang model (2020), and the Grobman model (2021)) for identifying women with increased chances of a successful trial of labour after caesarean section (TOLAC). METHODS This retrospective observational cohort study was conducted in a tertiary teaching hospital from 2018 to 2021. Individual probabilities were calculated for women with previous one caesarean section who underwent TOLAC at term, using the predicted probabilities from the logistic regression models. The primary outcome of this study was vaginal delivery following attempted TOLAC. The predictive ability of the models was assessed using the area under the receiver operative characteristics curves (AUC) and a calibration graph. RESULTS Of 1515 eligible women who underwent TOLAC, we found an overall rate of successful TOLAC of 60.3 %. No significant difference was noticed in adverse scar outcome and neonatal morbidity while comparing successful and failed TOLAC. The discriminative ability of Grobman-2007 and Grobman-2021 and the Zhang model were fair to poor with the AUC of 0.54(95 % CI 0.51-0.57), 0.62(95 % CI 0.59-0.65) and 0.66(95 % CI 0.63-0.69) respectively. The agreement between the observed rates of TOLAC success and the predicted probabilities for all three models was poor. CONCLUSION The performance of all three models predicting success after TOLAC was poor in the study population. A population-specific model may be needed, with the addition of factors influencing the labour, such as the methods of induction, which may aid in predicting the outcome.
Collapse
Affiliation(s)
- Bhabani Pegu
- Department of Obstetrics & Gynaecology, Jawaharlal Institute of Medical Education & Research, Puducherry 605006, India
| | - Sathiya Priya Subburaj
- Department of Obstetrics & Gynaecology, Jawaharlal Institute of Medical Education & Research, Puducherry 605006, India
| | - Latha Chaturvedula
- Department of Obstetrics & Gynaecology, Jawaharlal Institute of Medical Education & Research, Puducherry 605006, India
| | - Sonali Sarkar
- Preventive and Social Medicine, Jawaharlal Institute of Medical Education & Research, Puducherry 605006, India
| | - N Sreekumaran Nair
- Biostatistics, Jawaharlal Institute of Medical Education & Research, Puducherry 605006, India
| | - Anish Keepanasseril
- Department of Obstetrics & Gynaecology, Jawaharlal Institute of Medical Education & Research, Puducherry 605006, India.
| |
Collapse
|
189
|
Chiasakul T, Lam BD, McNichol M, Robertson W, Rosovsky RP, Lake L, Vlachos IS, Adamski A, Reyes N, Abe K, Zwicker JI, Patell R. Artificial intelligence in the prediction of venous thromboembolism: A systematic review and pooled analysis. Eur J Haematol 2023; 111:951-962. [PMID: 37794526 PMCID: PMC10900245 DOI: 10.1111/ejh.14110] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 09/16/2023] [Accepted: 09/18/2023] [Indexed: 10/06/2023]
Abstract
BACKGROUND Accurate diagnostic and prognostic predictions of venous thromboembolism (VTE) are crucial for VTE management. Artificial intelligence (AI) enables autonomous identification of the most predictive patterns from large complex data. Although evidence regarding its performance in VTE prediction is emerging, a comprehensive analysis of performance is lacking. AIMS To systematically review the performance of AI in the diagnosis and prediction of VTE and compare it to clinical risk assessment models (RAMs) or logistic regression models. METHODS A systematic literature search was performed using PubMed, MEDLINE, EMBASE, and Web of Science from inception to April 20, 2021. Search terms included "artificial intelligence" and "venous thromboembolism." Eligible criteria were original studies evaluating AI in the prediction of VTE in adults and reporting one of the following outcomes: sensitivity, specificity, positive predictive value, negative predictive value, or area under receiver operating curve (AUC). Risks of bias were assessed using the PROBAST tool. Unpaired t-test was performed to compare the mean AUC from AI versus conventional methods (RAMs or logistic regression models). RESULTS A total of 20 studies were included. Number of participants ranged from 31 to 111 888. The AI-based models included artificial neural network (six studies), support vector machines (four studies), Bayesian methods (one study), super learner ensemble (one study), genetic programming (one study), unspecified machine learning models (two studies), and multiple machine learning models (five studies). Twelve studies (60%) had both training and testing cohorts. Among 14 studies (70%) where AUCs were reported, the mean AUC for AI versus conventional methods were 0.79 (95% CI: 0.74-0.85) versus 0.61 (95% CI: 0.54-0.68), respectively (p < .001). However, the good to excellent discriminative performance of AI methods is unlikely to be replicated when used in clinical practice, because most studies had high risk of bias due to missing data handling and outcome determination. CONCLUSION The use of AI appears to improve the accuracy of diagnostic and prognostic prediction of VTE over conventional risk models; however, there was a high risk of bias observed across studies. Future studies should focus on transparent reporting, external validation, and clinical application of these models.
Collapse
Affiliation(s)
- Thita Chiasakul
- Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
- Division of Hemostasis and Thrombosis, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
- Division of Hematology, Faculty of Medicine, Department of Medicine, Center of Excellence in Translational Hematology, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Bangkok, Thailand
| | - Barbara D Lam
- Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
- Division of Hemostasis and Thrombosis, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
| | - Megan McNichol
- Division of Knowledge Services, Department of Information Services (M.M.), Beth Israel Deaconess Medical Center, Boston, Massachusetts, USA
| | - William Robertson
- National Blood Clot Alliance, Philadelphia, Pennsylvania, USA
- Department of Emergency Healthcare, College of Health Professions, Weber State University, Ogden, Utah, USA
| | - Rachel P Rosovsky
- Division of Hematology/Oncology, Department of Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, USA
| | - Leslie Lake
- National Blood Clot Alliance, Philadelphia, Pennsylvania, USA
| | - Ioannis S Vlachos
- Department of Pathology, Cancer Research Institute, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
| | - Alys Adamski
- Division of Blood Disorders, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
| | - Nimia Reyes
- Division of Blood Disorders, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
| | - Karon Abe
- Division of Blood Disorders, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
| | - Jeffrey I Zwicker
- Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
- Division of Hemostasis and Thrombosis, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
- Department of Medicine, Hematology Service, Memorial Sloan Kettering Cancer Center, New York City, New York, USA
| | - Rushad Patell
- Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
- Division of Hemostasis and Thrombosis, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
190
|
Lapp L, Roper M, Kavanagh K, Schraag S. Development and validation of a digital biomarker predicting acute kidney injury following cardiac surgery on an hourly basis. JTCVS OPEN 2023; 16:540-581. [PMID: 38204694 PMCID: PMC10775068 DOI: 10.1016/j.xjon.2023.09.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Revised: 09/01/2023] [Accepted: 09/06/2023] [Indexed: 01/12/2024]
Abstract
Objectives To develop and validate a digital biomarker for predicting the onset of acute kidney injury (AKI) on an hourly basis up to 24 hours in advance in the intensive care unit after cardiac surgery. Methods The study analyzed data from 6056 adult patients undergoing coronary artery bypass graft and/or valve surgery between April 1, 2012, and December 31, 2018 (development phase, training, and testing) and 3572 patients between January 1, 2019, and June 30, 2022 (validation phase). The study used 2 dynamic predictive modeling approaches, namely logistic regression and bootstrap aggregated regression trees machine (BARTm), to predict AKI. The mean area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and positive and negative predictive values across all lead times before the occurrence of AKI were reported. The clinical practicality was assessed using calibration. Results Of all included patients, 8.45% and 16.66% had AKI in the development and validation phases, respectively. When applied to testing data, AKI was predicted with the mean AUC of 0.850 and 0.802 by BARTm and logistic regression, respectively. When applied to validation data, BARTm and LR resulted in a mean AUC of 0.844 and 0.786, respectively. Conclusions This study demonstrated the successful prediction of AKI on an hourly basis up to 24 hours in advance. The digital biomarkers developed and validated in this study have the potential to assist clinicians in optimizing treatment and implementing preventive strategies for patients at risk of developing AKI after cardiac surgery in the intensive care unit.
Collapse
Affiliation(s)
- Linda Lapp
- Department of Computer and Information Sciences, Faculty of Science, University of Strathclyde, Glasgow, Scotland
| | - Marc Roper
- Department of Computer and Information Sciences, Faculty of Science, University of Strathclyde, Glasgow, Scotland
| | - Kimberley Kavanagh
- Department of Mathematics and Statistics, Faculty of Science, University of Strathclyde, Glasgow, Scotland
| | - Stefan Schraag
- Department of Anaesthesia and Perioperative Medicine, Golden Jubilee National Hospital, Clydebank, United Kingdom
| |
Collapse
|
191
|
Riley RD, Collins GS. Stability of clinical prediction models developed using statistical or machine learning methods. Biom J 2023; 65:e2200302. [PMID: 37466257 PMCID: PMC10952221 DOI: 10.1002/bimj.202200302] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 04/26/2023] [Accepted: 05/02/2023] [Indexed: 07/20/2023]
Abstract
Clinical prediction models estimate an individual's risk of a particular health outcome. A developed model is a consequence of the development dataset and model-building strategy, including the sample size, number of predictors, and analysis method (e.g., regression or machine learning). We raise the concern that many models are developed using small datasets that lead to instability in the model and its predictions (estimated risks). We define four levels of model stability in estimated risks moving from the overall mean to the individual level. Through simulation and case studies of statistical and machine learning approaches, we show instability in a model's estimated risks is often considerable, and ultimately manifests itself as miscalibration of predictions in new data. Therefore, we recommend researchers always examine instability at the model development stage and propose instability plots and measures to do so. This entails repeating the model-building steps (those used to develop the original prediction model) in each of multiple (e.g., 1000) bootstrap samples, to produce multiple bootstrap models, and deriving (i) a prediction instability plot of bootstrap model versus original model predictions; (ii) the mean absolute prediction error (mean absolute difference between individuals' original and bootstrap model predictions), and (iii) calibration, classification, and decision curve instability plots of bootstrap models applied in the original sample. A case study illustrates how these instability assessments help reassure (or not) whether model predictions are likely to be reliable (or not), while informing a model's critical appraisal (risk of bias rating), fairness, and further validation requirements.
Collapse
Affiliation(s)
- Richard D. Riley
- Institute of Applied Health ResearchCollege of Medical and Dental SciencesUniversity of BirminghamBirminghamUK
| | - Gary S. Collins
- Centre for Statistics in MedicineNuffield Department of OrthopaedicsRheumatology and Musculoskeletal SciencesUniversity of OxfordOxfordUK
| |
Collapse
|
192
|
Galimzhanov A, Matetic A, Tenekecioglu E, Mamas MA. Prediction of clinical outcomes after percutaneous coronary intervention: Machine-learning analysis of the National Inpatient Sample. Int J Cardiol 2023; 392:131339. [PMID: 37678434 DOI: 10.1016/j.ijcard.2023.131339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 08/08/2023] [Accepted: 09/03/2023] [Indexed: 09/09/2023]
Abstract
BACKGROUND This study aimed to develop a multiclass machine-learning (ML) model to predict all-cause mortality, ischemic and hemorrhagic events in unselected hospitalized patients undergoing percutaneous coronary intervention (PCI). METHODS This retrospective study included 1,815,595 unselected weighted hospitalizations undergoing PCI from the National Inpatient Sample (2016-2019). Five most common ML algorithms (logistic regression, support vector machine (SVM), naive Bayes, random forest (RF), and extreme gradient boosting (XGBoost)) were trained and tested with 101 input features. The study endpoints were different combinations of all-cause mortality, ischemic cerebrovascular events (CVE) and major bleeding. An area under the curve (AUC) with 95% confidence interval (95% CI) was selected as a performance metric. RESULTS The study population was split to a training cohort of 1,186,880 PCI discharges, validation cohort (for calibration) of 296,725 hospitalizations and a test cohort of 331,990 PCI discharges. A total of 98,180 (5.4%) hospital entries included study outcomes. Logistic regression, SVM, naive Bayes, and RF model demonstrated AUCs of 0.83 (95% CI 0.82-0.84), 0.84 (95% CI 0.83-0.86), 0.81 (95% CI 0.80-0.82), and 0.83 (95% CI 0.81-0.84), retrospectively. The XGBoost classifier performed the best with an AUC of 0.86 (95% CI 0.85-0.87) with excellent calibration. We then built a web-based application that provides predictions based on the XGBoost model. CONCLUSION We derived the multi-task XGBoost classifier based on 101 features to predict different combinations of all-cause death, ischemic CVE and major bleeding. Such models may be useful in benchmarking and risk prediction using routinely collected administrative data.
Collapse
Affiliation(s)
- Akhmetzhan Galimzhanov
- Department of Propedeutics of Internal Disease, Semey Medical University, Semey, Kazakhstan; Keele Cardiovascular Research Group, Keele University, Keele, UK.
| | - Andrija Matetic
- Keele Cardiovascular Research Group, Keele University, Keele, UK; Department of Cardiology, University Hospital of Split, Split 21000, Croatia
| | - Erhan Tenekecioglu
- Department of Cardiology, Bursa Education and Research Hospital, Health Sciences University, Bursa,Turkey; Department of Cardiology, Thoraxcenter, Erasmus MC, Erasmus University, Rotterdam, the Netherlands
| | - Mamas A Mamas
- Keele Cardiovascular Research Group, Keele University, Keele, UK
| |
Collapse
|
193
|
Goudsmit BF, Ilaria Prosepe, Tushuizen ME, Mazzaferro V, Alwayn IP, van Hoek B, Braat AE, Putter H. Survival benefit from liver transplantation for patients with and without hepatocellular carcinoma. JHEP Rep 2023; 5:100907. [PMID: 38034881 PMCID: PMC10685016 DOI: 10.1016/j.jhepr.2023.100907] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 08/11/2023] [Accepted: 08/31/2023] [Indexed: 12/02/2023] Open
Abstract
Background & Aims In the USA, inequal liver transplantation (LT) access exists between patients with and without hepatocellular carcinoma (HCC). Survival benefit considers survival without and with LT and could equalise LT access. We calculated bias-corrected LT survival benefit for patients with(out) HCC who underwent a transplant, based on longitudinal data in a recent United States cohort. Methods Adult LT candidates with(out) HCC between 2010 and 2019 were included. Waitlist survival over time was contrasted to post-transplant survival, to estimate 5-year survival benefit from the moment of LT. Waitlist survival was modelled with a bias-corrected Cox regression, and post-transplant survival was estimated through Cox proportional hazards regression. Results Mean HCC survival without LT was always lower than non-HCC waitlist survival. Below model for end-stage liver disease (sodium) (MELD(-Na)) 30, patients with HCC gained more life-years from LT than patients without HCC at the same MELD(-Na) score. Only patients without HCC below MELD(-Na) 9 had negative benefit. Most patients with HCC underwent a transplant below MELD(-Na) 14, and most patients without HCC underwent a transplant above MELD(-Na) 26. Liver function [MELD(-Na), albumin] was the main predictor of 5-year benefit. Therefore, during 5 years, most patients with HCC gained 0.12 to 1.96 years from LT, whereas most patients without HCC gained 2.48 to 3.45 years. Conclusions On an individual level, performing a transplant in patients with HCC resulted in survival benefit. However, on a population level, benefit was indirectly decreased, as patients without HCC were likely to gain more survival owing to decreased liver function. For patients who underwent a transplant, a constructed online calculator estimates 5-year survival benefit given specific patient characteristics. Survival benefit scores could serve to equalise LT access. Impact and implications Benefit is a comparison of the survival with and without liver transplantation, and it is important when deciding who should undergo a transplant. Liver function is most important when predicting possible benefit from transplantation. Patients with liver cancer die sooner on the waiting list than similar patients without liver cancer. However, patients with liver cancer more often have better liver function. Most patients without liver cancer derive more benefit from transplantation than patients with liver cancer.
Collapse
Affiliation(s)
- Ben F.J. Goudsmit
- Department of Gastroenterology and Hepatology, Leiden University Medical Center, Leiden, The Netherlands
| | - Ilaria Prosepe
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| | - Maarten E. Tushuizen
- Department of Gastroenterology and Hepatology, Leiden University Medical Center, Leiden, The Netherlands
- Transplant Center, Leiden University Medical Center, Leiden, The Netherlands
| | - Vincenzo Mazzaferro
- Department of Oncology, University of Milan, Milan, Italy
- Hepatology and Liver Transplantation Unit, Department of Surgery, Fondazione Istituto di Ricovero e Cura a Carattere Scientifico, Istituto Nazionale Tumori, Milan, Italy
| | - Ian P.J. Alwayn
- Transplant Center, Leiden University Medical Center, Leiden, The Netherlands
- Department of Surgery, Leiden University Medical Center, Leiden, The Netherlands
| | - Bart van Hoek
- Department of Gastroenterology and Hepatology, Leiden University Medical Center, Leiden, The Netherlands
- Transplant Center, Leiden University Medical Center, Leiden, The Netherlands
| | - Andries E. Braat
- Department of Surgery, Leiden University Medical Center, Leiden, The Netherlands
| | - Hein Putter
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| |
Collapse
|
194
|
Schuring N, Donlon NE, Hagens ERC, Gootjes D, Donohoe CL, van Berge Henegouwen MI, Reynolds JV, Gisbertz SS. External validation of a nomogram predicting conditional survival after tri-modality treatment of esophageal cancer. Surgery 2023; 174:1363-1370. [PMID: 37735034 DOI: 10.1016/j.surg.2023.08.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 05/27/2023] [Accepted: 08/03/2023] [Indexed: 09/23/2023]
Abstract
BACKGROUND A conditional survival nomogram was developed at a single high-volume center to predict 5-year overall survival for esophageal cancer patients after neoadjuvant chemoradiation and esophagectomy. The aim of this study was to externally validate the nomogram in a cohort of patients with esophageal adeno- or squamous cell carcinoma from another high-volume center. METHODS Consecutive patients with an esophageal adeno- or squamous cell carcinoma who had undergone esophagectomy after being treated with preoperative chemoradiation between 2004 and 2016 were selected from a prospectively maintained institutional database. The level of discrimination for prediction of 5-year overall survival was quantified by Harrell's C statistic. Calibration of the conditional survival nomogram was visualized by plotting predicted 5-year survival and observed 5-year survival for comparison. RESULTS Of the 296 patients examined, the probability of 5-year overall survival directly after surgery was 45% and increased to 51%, 68%, 78%, and 89% for each additional year survived. The predicted 5-year overall survival differed from the observed survival, with a calibration slope of 0.54, 0.55, 0.59, 0.73, and 1.09 directly after surgery and 1, 2, 3, and 4 years of survival after surgery, respectively. The nomogram's discrimination level for 5-year survival was moderate, with a C statistic of 0.65 compared to the 0.70 reported in the original study. CONCLUSION The nomogram model has moderate predictive discrimination and accuracy, supporting its applicability to external cohorts to predict conditional survival. Further validation studies should empirically assess the model for predictive performance.
Collapse
Affiliation(s)
- Nannet Schuring
- Department of Surgery, Amsterdam UMC, Cancer Center Amsterdam, Amsterdam, The Netherlands; Cancer Center Amsterdam, Cancer Treatment and Quality of Life, Amsterdam, The Netherlands.
| | - Noel E Donlon
- Department of Gastro-Intestinal Surgery, National Center for Esophageal and Gastric Cancer, Trinity St. James's Cancer Institute, Dublin, Ireland
| | - Eliza R C Hagens
- Department of Surgery, Amsterdam UMC, Cancer Center Amsterdam, Amsterdam, The Netherlands
| | - Didier Gootjes
- Department of Surgery, Amsterdam UMC, Cancer Center Amsterdam, Amsterdam, The Netherlands
| | - Claire L Donohoe
- Department of Gastro-Intestinal Surgery, National Center for Esophageal and Gastric Cancer, Trinity St. James's Cancer Institute, Dublin, Ireland
| | - Mark I van Berge Henegouwen
- Department of Surgery, Amsterdam UMC, Cancer Center Amsterdam, Amsterdam, The Netherlands; Cancer Center Amsterdam, Cancer Treatment and Quality of Life, Amsterdam, The Netherlands
| | - John V Reynolds
- Department of Gastro-Intestinal Surgery, National Center for Esophageal and Gastric Cancer, Trinity St. James's Cancer Institute, Dublin, Ireland
| | - Suzanne S Gisbertz
- Department of Surgery, Amsterdam UMC, Cancer Center Amsterdam, Amsterdam, The Netherlands; Cancer Center Amsterdam, Cancer Treatment and Quality of Life, Amsterdam, The Netherlands
| |
Collapse
|
195
|
Vigdal ØN, Storheim K, Killingmo RM, Rysstad T, Pripp AH, van der Gaag W, Chiarotto A, Koes B, Grotle M. External validation and updating of prognostic prediction models for nonrecovery among older adults seeking primary care for back pain. Pain 2023; 164:2759-2768. [PMID: 37490100 DOI: 10.1097/j.pain.0000000000002974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 03/23/2023] [Indexed: 07/26/2023]
Abstract
ABSTRACT Prognostic prediction models for 3 different definitions of nonrecovery were developed in the Back Complaints in the Elders study in the Netherlands. The models' performance was good (optimism-adjusted area under receiver operating characteristics [AUC] curve ≥0.77, R2 ≥0.3). This study aimed to assess the external validity of the 3 prognostic prediction models in the Norwegian Back Complaints in the Elders study. We conducted a prospective cohort study, including 452 patients aged ≥55 years, seeking primary care for a new episode of back pain. Nonrecovery was defined for 2 outcomes, combining 6- and 12-month follow-up data: Persistent back pain (≥3/10 on numeric rating scale) and persistent disability (≥4/24 on Roland-Morris Disability Questionnaire). We could not assess the third model (self-reported nonrecovery) because of substantial missing data (>50%). The models consisted of biopsychosocial prognostic factors. First, we assessed Nagelkerke R2 , discrimination (AUC) and calibration (calibration-in-the-large [CITL], slope, and calibration plot). Step 2 was to recalibrate the models based on CITL and slope. Step 3 was to reestimate the model coefficients and assess if this improved performance. The back pain model demonstrated acceptable discrimination (AUC 0.74, 95% confidence interval: 0.69-0.79), and R2 was 0.23. The disability model demonstrated excellent discrimination (AUC 0.81, 95% confidence interval: 0.76-0.85), and R2 was 0.35. Both models had poor calibration (CITL <0, slope <1). Recalibration yielded acceptable calibration for both models, according to the calibration plots. Step 3 did not improve performance substantially. The recalibrated models may need further external validation, and the models' clinical impact should be assessed.
Collapse
Affiliation(s)
- Ørjan Nesse Vigdal
- Department of Rehabilitation Science and Health Technology, Faculty of Health Science, OsloMet-Oslo Metropolitan University, Oslo, Norway
| | - Kjersti Storheim
- Department of Rehabilitation Science and Health Technology, Faculty of Health Science, OsloMet-Oslo Metropolitan University, Oslo, Norway
- Research and Communication Unit for Musculoskeletal Health (FORMI), Division of Clinical Neuroscience, Oslo University Hospital, Oslo, Norway
| | - Rikke Munk Killingmo
- Department of Rehabilitation Science and Health Technology, Faculty of Health Science, OsloMet-Oslo Metropolitan University, Oslo, Norway
| | - Tarjei Rysstad
- Department of Rehabilitation Science and Health Technology, Faculty of Health Science, OsloMet-Oslo Metropolitan University, Oslo, Norway
| | - Are Hugo Pripp
- Department of Rehabilitation Science and Health Technology, Faculty of Health Science, OsloMet-Oslo Metropolitan University, Oslo, Norway
| | - Wendelien van der Gaag
- Department of General Practice, Erasmus MC, University Medical Center, Rotterdam, the Netherlands
| | - Alessandro Chiarotto
- Department of General Practice, Erasmus MC, University Medical Center, Rotterdam, the Netherlands
| | - Bart Koes
- Department of General Practice, Erasmus MC, University Medical Center, Rotterdam, the Netherlands
- Center for Muscle and Health, University of Southern Denmark, Odense, Denmark
| | - Margreth Grotle
- Department of Rehabilitation Science and Health Technology, Faculty of Health Science, OsloMet-Oslo Metropolitan University, Oslo, Norway
- Research and Communication Unit for Musculoskeletal Health (FORMI), Division of Clinical Neuroscience, Oslo University Hospital, Oslo, Norway
| |
Collapse
|
196
|
Sud M, Sivaswamy A, Austin PC, Anderson TJ, Naimark DMJ, Farkouh ME, Lee DS, Roifman I, Thanassoulis G, Tu K, Udell JA, Wijeysundera HC, Ko DT. Development and Validation of the CANHEART Population-Based Laboratory Prediction Models for Atherosclerotic Cardiovascular Disease. Ann Intern Med 2023; 176:1638-1647. [PMID: 38079638 DOI: 10.7326/m23-1345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/20/2023] Open
Abstract
BACKGROUND Prediction of atherosclerotic cardiovascular disease (ASCVD) in primary prevention assessments exclusively with laboratory results may facilitate automated risk reporting and improve uptake of preventive therapies. OBJECTIVE To develop and validate sex-specific prediction models for ASCVD using age and routine laboratory tests and compare their performance with that of the pooled cohort equations (PCEs). DESIGN Derivation and validation of the CANHEART (Cardiovascular Health in Ambulatory Care Research Team) Lab Models. SETTING Population-based cohort study in Ontario, Canada. PARTICIPANTS A derivation and internal validation cohort of adults aged 40 to 75 years without cardiovascular disease from April 2009 to December 2015; an external validation cohort of primary care patients from January 2010 to December 2014. MEASUREMENTS Age and laboratory predictors measured in the outpatient setting included serum total cholesterol, high-density lipoprotein cholesterol, triglycerides, hemoglobin, mean corpuscular volume, platelets, leukocytes, estimated glomerular filtration rate, and glucose. The ASCVD outcomes were defined as myocardial infarction, stroke, and death from ischemic heart or cerebrovascular disease within 5 years. RESULTS Sex-specific models were developed and internally validated in 2 160 497 women and 1 833 147 men. They were well calibrated, with relative differences less than 1% between mean predicted and observed risk for both sexes. The c-statistic was 0.77 in women and 0.71 in men. External validation in 31 697 primary care patients showed a relative difference less than 14% and an absolute difference less than 0.3 percentage points in mean predicted and observed risks for both sexes. The c-statistics for the laboratory models were 0.72 for both sexes and were not statistically significantly different from those for the PCEs in women (change in c-statistic, -0.01 [95% CI, -0.03 to 0.01]) or men (change in c-statistic, -0.01 [CI, -0.04 to 0.02]). LIMITATION Medication use was not available at the population level. CONCLUSION The CANHEART Lab Models predict ASCVD with similar accuracy to more complex models, such as the PCEs. PRIMARY FUNDING SOURCE Canadian Institutes of Health Research.
Collapse
Affiliation(s)
- Maneesh Sud
- Schulich Heart Program, Sunnybrook Health Sciences Centre, University of Toronto; Institute of Health Policy, Management and Evaluation, University of Toronto; ICES; and Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada (M.S., I.R., H.C.W., D.T.K.)
| | | | - Peter C Austin
- Institute of Health Policy, Management and Evaluation, University of Toronto, and ICES, Toronto, Ontario, Canada (P.C.A.)
| | - Todd J Anderson
- Libin Cardiovascular Institute and Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada (T.J.A.)
| | - David M J Naimark
- Institute of Health Policy, Management and Evaluation, University of Toronto; ICES; and Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada (D.M.J.N.)
| | - Michael E Farkouh
- Academic Affairs, Cedars-Sinai Health System, Los Angeles, California (M.E.F.)
| | - Douglas S Lee
- Institute of Health Policy, Management and Evaluation, University of Toronto; ICES; Temerty Faculty of Medicine, University of Toronto; Peter Munk Cardiac Centre, University Health Network, University of Toronto; and Ted Rogers Centre for Heart Research, Toronto, Ontario, Canada (D.S.L.)
| | - Idan Roifman
- Schulich Heart Program, Sunnybrook Health Sciences Centre, University of Toronto; Institute of Health Policy, Management and Evaluation, University of Toronto; ICES; and Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada (M.S., I.R., H.C.W., D.T.K.)
| | - George Thanassoulis
- Department of Medicine, McGill University, and Preventive and Genomic Cardiology, McGill University Health Centre, Montreal, Quebec, Canada (G.T.)
| | - Karen Tu
- Toronto Western Family Health Team, University Health Network, North York General Hospital, and Department of Family and Community Medicine, Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Ontario, Canada (K.T.)
| | - Jacob A Udell
- Institute of Health Policy, Management and Evaluation, University of Toronto; ICES; Temerty Faculty of Medicine, University of Toronto; Peter Munk Cardiac Centre, University Health Network, University of Toronto; and Women's College Hospital, University of Toronto, Toronto, Ontario, Canada (J.A.U.)
| | - Harindra C Wijeysundera
- Schulich Heart Program, Sunnybrook Health Sciences Centre, University of Toronto; Institute of Health Policy, Management and Evaluation, University of Toronto; ICES; and Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada (M.S., I.R., H.C.W., D.T.K.)
| | - Dennis T Ko
- Schulich Heart Program, Sunnybrook Health Sciences Centre, University of Toronto; Institute of Health Policy, Management and Evaluation, University of Toronto; ICES; and Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada (M.S., I.R., H.C.W., D.T.K.)
| |
Collapse
|
197
|
Kanwal F, Khaderi S, Singal AG, Marrero JA, Asrani SK, Amos CI, Thrift AP, Kramer JR, Yu X, Cao Y, Luster M, Al-Sarraj A, Ning J, El-Serag HB. Risk Stratification Model for Hepatocellular Cancer in Patients With Cirrhosis. Clin Gastroenterol Hepatol 2023; 21:3296-3304.e3. [PMID: 37390101 PMCID: PMC10661677 DOI: 10.1016/j.cgh.2023.04.019] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 04/07/2023] [Accepted: 04/13/2023] [Indexed: 07/02/2023]
Abstract
BACKGROUND & AIMS The available risk stratification indices for hepatocellular cancer (HCC) have limited applicability. We developed and externally validated an HCC risk stratification index in U.S. cohorts of patients with cirrhosis. METHODS We used data from 2 prospective U.S. cohorts to develop the risk index. Patients with cirrhosis were enrolled from 8 centers and followed until development of HCC, death, or December 31, 2021. We identified an optimal set of predictors with the highest discriminatory ability (C-index) for HCC. The predictors were refit using competing risk regression and its predictive performance was evaluated using the area under the receiver-operating characteristic curve (AUROC). External validation was performed in a cohort of 21,550 patients with cirrhosis seen in the U.S Veterans Affairs system between 2018 and 2019 with follow-up through 2021. RESULTS We developed the model in 2431 patients (mean age 60 years, 31% women, 24% cured hepatitis C, 16% alcoholic liver disease, and 29% nonalcoholic fatty liver disease). The selected model had a C-index of 0.77 (95% confidence interval [CI], 0.73-0.81), and the predictors were age, sex, smoking, alcohol use, body mass index, etiology, α-fetoprotein, albumin, alanine aminotransferase, and platelet levels. The AUROCs were 0.75 (95% CI, 0.65-0.85) at 1 year and 0.77 (95% CI, 0.71-0.83) at 2 years, and the model was well calibrated. In the external validation cohort, the AUROC at 2 years was 0.70 with excellent calibration. CONCLUSION The risk index, including objective and routinely available risk factors, can differentiate patients with cirrhosis who will develop HCC and help guide discussions regarding HCC surveillance and prevention. Future studies are needed for additional external validation and refinement of risk stratification.
Collapse
Affiliation(s)
- Fasiha Kanwal
- Section of Gastroenterology and Hepatology, Department of Medicine, Baylor College of Medicine, Houston, Texas; Section of Health Services Research, Department of Medicine, Baylor College of Medicine, Houston, Texas; VA HSR&D Center for Innovatio ns in Quality, Effectiveness, and Safety, Michael E. DeBakey Veterans Affairs Medical Center, Houston, Texas.
| | - Saira Khaderi
- Section of Gastroenterology and Hepatology, Department of Medicine, Baylor College of Medicine, Houston, Texas
| | - Amit G Singal
- Division of Digestive and Liver Diseases, Department of Medicine, UT Southwestern Medical Center, Dallas, Texas
| | - Jorge A Marrero
- Division of Digestive and Liver Diseases, Department of Medicine, UT Southwestern Medical Center, Dallas, Texas
| | - Sumeet K Asrani
- Division of Gastroenterology, Department of Medicine, Baylor University Medical Center, Dallas, Texas
| | - Christopher I Amos
- Section of Epidemiology, Department of Medicine, Baylor College of Medicine, Houston, Texas
| | - Aaron P Thrift
- Section of Epidemiology, Department of Medicine, Baylor College of Medicine, Houston, Texas
| | - Jennifer R Kramer
- Section of Health Services Research, Department of Medicine, Baylor College of Medicine, Houston, Texas; VA HSR&D Center for Innovatio ns in Quality, Effectiveness, and Safety, Michael E. DeBakey Veterans Affairs Medical Center, Houston, Texas
| | - Xian Yu
- Section of Health Services Research, Department of Medicine, Baylor College of Medicine, Houston, Texas; VA HSR&D Center for Innovatio ns in Quality, Effectiveness, and Safety, Michael E. DeBakey Veterans Affairs Medical Center, Houston, Texas
| | - Yumei Cao
- Section of Health Services Research, Department of Medicine, Baylor College of Medicine, Houston, Texas; VA HSR&D Center for Innovatio ns in Quality, Effectiveness, and Safety, Michael E. DeBakey Veterans Affairs Medical Center, Houston, Texas
| | - Michelle Luster
- Section of Gastroenterology and Hepatology, Department of Medicine, Baylor College of Medicine, Houston, Texas
| | - Abeer Al-Sarraj
- Section of Gastroenterology and Hepatology, Department of Medicine, Baylor College of Medicine, Houston, Texas; Section of Health Services Research, Department of Medicine, Baylor College of Medicine, Houston, Texas; VA HSR&D Center for Innovatio ns in Quality, Effectiveness, and Safety, Michael E. DeBakey Veterans Affairs Medical Center, Houston, Texas
| | - Jing Ning
- Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, Texas
| | - Hashem B El-Serag
- Section of Gastroenterology and Hepatology, Department of Medicine, Baylor College of Medicine, Houston, Texas; Section of Health Services Research, Department of Medicine, Baylor College of Medicine, Houston, Texas; VA HSR&D Center for Innovatio ns in Quality, Effectiveness, and Safety, Michael E. DeBakey Veterans Affairs Medical Center, Houston, Texas.
| |
Collapse
|
198
|
Solomon J, Bender S, Durgempudi P, Robar C, Cocchiaro M, Turner S, Watson C, Healy J, Spake A, Szlosek D. Diagnostic validation of vertebral heart score machine learning algorithm for canine lateral chest radiographs. J Small Anim Pract 2023; 64:769-775. [PMID: 37622992 DOI: 10.1111/jsap.13666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 04/26/2023] [Accepted: 07/12/2023] [Indexed: 08/26/2023]
Abstract
OBJECTIVES The vertebral heart score is a measurement used to index heart size relative to thoracic vertebra. Vertebral heart score can be a useful tool for identifying and staging heart disease and providing prognostic information. The purpose of this study is to validate the use of a vertebral heart score algorithm compared to manual vertebral heart scoring by three board-certified veterinary cardiologists. MATERIALS AND METHODS A convolutional neural network centred around semantic segmentation of relevant anatomical features was developed to predict heart size and vertebral bodies. These predictions were used to calculate the vertebral heart score. An external validation study consisting of 1200 canine lateral radiographs was randomly selected to match the underlying distribution of vertebral heart scores. Three American College of Veterinary Internal Medicine board-certified cardiologists were enrolled to manually score 400 images each using the traditional Buchanan method. Post-scoring, the cardiologists evaluated the algorithm for misaligned anatomic landmarks and overall image quality. RESULTS The 95th percentile absolute difference between the cardiologist vertebral heart score and the algorithm vertebral heart score was 1.05 vertebrae (95% confidence interval: 0.97 to 1.20 vertebrae) with a mean bias of -0.09 vertebrae (95% confidence interval: -0.12 to -0.05 vertebrae). In addition, the model was observed to be well calibrated across the predictive range. CLINICAL SIGNIFICANCE We have found the performance of the vertebral heart score algorithm comparable to three board-certified cardiologists. While validation of this vertebral heart score algorithm has shown strong performance compared to veterinarians, further external validation in other clinical settings is warranted before use in those settings.
Collapse
Affiliation(s)
- J Solomon
- IDEXX Laboratories, Inc., Westbrook, ME, USA
| | - S Bender
- IDEXX Laboratories, Inc., Westbrook, ME, USA
| | | | - C Robar
- IDEXX Laboratories, Inc., Westbrook, ME, USA
| | - M Cocchiaro
- IDEXX Laboratories, Inc., Westbrook, ME, USA
| | - S Turner
- IDEXX Laboratories, Inc., Westbrook, ME, USA
| | - C Watson
- IDEXX Laboratories, Inc., Westbrook, ME, USA
| | - J Healy
- IDEXX Laboratories, Inc., Westbrook, ME, USA
| | - A Spake
- IDEXX Laboratories, Inc., Westbrook, ME, USA
| | - D Szlosek
- IDEXX Laboratories, Inc., Westbrook, ME, USA
| |
Collapse
|
199
|
Teeple S, Smith A, Toerper M, Levin S, Halpern S, Badaki-Makun O, Hinson J. Exploring the impact of missingness on racial disparities in predictive performance of a machine learning model for emergency department triage. JAMIA Open 2023; 6:ooad107. [PMID: 38638298 PMCID: PMC11025382 DOI: 10.1093/jamiaopen/ooad107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Revised: 11/15/2023] [Accepted: 12/06/2023] [Indexed: 04/20/2024] Open
Abstract
Objective To investigate how missing data in the patient problem list may impact racial disparities in the predictive performance of a machine learning (ML) model for emergency department (ED) triage. Materials and Methods Racial disparities may exist in the missingness of EHR data (eg, systematic differences in access, testing, and/or treatment) that can impact model predictions across racialized patient groups. We use an ML model that predicts patients' risk for adverse events to produce triage-level recommendations, patterned after a clinical decision support tool deployed at multiple EDs. We compared the model's predictive performance on sets of observed (problem list data at the point of triage) versus manipulated (updated to the more complete problem list at the end of the encounter) test data. These differences were compared between Black and non-Hispanic White patient groups using multiple performance measures relevant to health equity. Results There were modest, but significant, changes in predictive performance comparing the observed to manipulated models across both Black and non-Hispanic White patient groups; c-statistic improvement ranged between 0.027 and 0.058. The manipulation produced no between-group differences in c-statistic by race. However, there were small between-group differences in other performance measures, with greater change for non-Hispanic White patients. Discussion Problem list missingness impacted model performance for both patient groups, with marginal differences detected by race. Conclusion Further exploration is needed to examine how missingness may contribute to racial disparities in clinical model predictions across settings. The novel manipulation method demonstrated may aid future research.
Collapse
Affiliation(s)
- Stephanie Teeple
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19143, United States
- Palliative and Advanced Illness Research (PAIR) Center, Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, United States
| | - Aria Smith
- Department of Emergency Medicine, Johns Hopkins University, Baltimore, MD 21218, United States
- Clinical Decision Support Solutions, Beckman Coulter, Brea, CA 92821, United States
| | - Matthew Toerper
- Department of Emergency Medicine, Johns Hopkins University, Baltimore, MD 21218, United States
- Clinical Decision Support Solutions, Beckman Coulter, Brea, CA 92821, United States
| | - Scott Levin
- Department of Emergency Medicine, Johns Hopkins University, Baltimore, MD 21218, United States
- Clinical Decision Support Solutions, Beckman Coulter, Brea, CA 92821, United States
| | - Scott Halpern
- Palliative and Advanced Illness Research (PAIR) Center, Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, United States
- Division of Pulmonary, Allergy and Critical Care, Department of Medicine at the Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, United States
| | - Oluwakemi Badaki-Makun
- Department of Pediatric Emergency Medicine, Johns Hopkins University, Baltimore, MD 21218, United States
| | - Jeremiah Hinson
- Department of Emergency Medicine, Johns Hopkins University, Baltimore, MD 21218, United States
| |
Collapse
|
200
|
Lee T, Hwang EJ, Park CM, Goo JM. Deep Learning-Based Computer-Aided Detection System for Preoperative Chest Radiographs to Predict Postoperative Pneumonia. Acad Radiol 2023; 30:2844-2855. [PMID: 36931951 DOI: 10.1016/j.acra.2023.02.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 02/10/2023] [Accepted: 02/17/2023] [Indexed: 03/18/2023]
Abstract
RATIONALE AND OBJECTIVES The role of preoperative chest radiography (CR) for prediction of postoperative pneumonia remains uncertain. We aimed to develop and validate a prediction model for postoperative pneumonia incorporating findings of preoperative CRs evaluated by a deep learning-based computer-aided detection (DL-CAD) system MATERIALS AND METHODS: This retrospective study included consecutive patients who underwent surgery between January 2019 and March 2020 and divided into development (surgery in 2019) and validation (surgery between January and March 2020) cohorts. Preoperative CRs obtained within 1-month before surgery were analyzed with a commercialized DL-CAD that provided probability values for the presence of 10 different abnormalities in CRs. Logistic regression models to predict postoperative pneumonia were built using clinical variables (clinical model), and both clinical variables and DL-CAD results for preoperative CRs (DL-CAD model). The discriminative performances of the models were evaluated by area under the receiver operating characteristic curves. RESULTS In development cohort (n = 19,349; mean age, 57 years; 11,392 men), DL-CAD results for pulmonary nodules (odds ratio [OR, for 1% increase in probability value], 1.007; p = 0.021), consolidation (OR, 1.019; p < 0.001), and cardiomegaly (OR, 1.013; p < 0.001) were independent predictors of postoperative pneumonia and were included in the DL-CAD model. In validation cohort (n = 4957; mean age, 56 years; 2848 men), the DL-CAD model exhibited a higher AUROC than the clinical model (0.843 vs. 0.815; p = 0.012). CONCLUSION Abnormalities in preoperative CRs evaluated by a DL-CAD were independent risk factors for postoperative pneumonia. Using DL-CAD results for preoperative CRs led to an improved prediction of postoperative pneumonia.
Collapse
Affiliation(s)
- Taehee Lee
- Department of Radiology, Seoul National University Hospital, 101 Daehak-ro, Jongno-gu, Seoul 03080, Republic of Korea (T.L., E.J.H., C.M.P., J.M.G.)
| | - Eui Jin Hwang
- Department of Radiology, Seoul National University Hospital, 101 Daehak-ro, Jongno-gu, Seoul 03080, Republic of Korea (T.L., E.J.H., C.M.P., J.M.G.); Department of Radiology, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul 03080, Republic of Korea (E.J.H., C.M.P., J.M.G.).
| | - Chang Min Park
- Department of Radiology, Seoul National University Hospital, 101 Daehak-ro, Jongno-gu, Seoul 03080, Republic of Korea (T.L., E.J.H., C.M.P., J.M.G.); Department of Radiology, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul 03080, Republic of Korea (E.J.H., C.M.P., J.M.G.)
| | - Jin Mo Goo
- Department of Radiology, Seoul National University Hospital, 101 Daehak-ro, Jongno-gu, Seoul 03080, Republic of Korea (T.L., E.J.H., C.M.P., J.M.G.); Department of Radiology, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul 03080, Republic of Korea (E.J.H., C.M.P., J.M.G.)
| |
Collapse
|