1
|
Ban JW, Abel L, Stevens R, Perera R. Research inefficiencies in external validation studies of the Framingham Wilson coronary heart disease risk rule: A systematic review. PLoS One 2024; 19:e0310321. [PMID: 39269949 DOI: 10.1371/journal.pone.0310321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 08/28/2024] [Indexed: 09/15/2024] Open
Abstract
BACKGROUND External validation studies create evidence about a clinical prediction rule's (CPR's) generalizability by evaluating and updating the CPR in populations different from those used in the derivation, and also by contributing to estimating its overall performance when meta-analysed in a systematic review. While most cardiovascular CPRs do not have any external validation, some CPRs have been externally validated repeatedly. Hence, we examined whether external validation studies of the Framingham Wilson coronary heart disease (CHD) risk rule contributed to generating evidence to their full potential. METHODS A forward citation search of the Framingham Wilson CHD risk rule's derivation study was conducted to identify studies that evaluated the Framingham Wilson CHD risk rule in different populations. For external validation studies of the Framingham Wilson CHD risk rule, we examined whether authors updated the Framingham Wilson CHD risk rule when it performed poorly. We also assessed the contribution of external validation studies to understanding the Predicted/Observed (P/O) event ratio and c statistic of the Framingham Wilson CHD risk rule. RESULTS We identified 98 studies that evaluated the Framingham Wilson CHD risk rule; 40 of which were external validation studies. Of these 40 studies, 27 (67.5%) concluded the Framingham Wilson CHD risk rule performed poorly but did not update it. Of 23 external validation studies conducted with data that could be included in meta-analyses, 13 (56.5%) could not fully contribute to the meta-analyses of P/O ratio and/or c statistic because these performance measures were neither reported nor could be calculated from provided data. DISCUSSION Most external validation studies failed to generate evidence about the Framingham Wilson CHD risk rule's generalizability to their full potential. Researchers might increase the value of external validation studies by presenting all relevant performance measures and by updating the CPR when it performs poorly.
Collapse
Affiliation(s)
- Jong-Wook Ban
- Centre for Evidence-Based Medicine, University of Oxford, Oxford, United Kingdom
- Department for Continuing Education, University of Oxford, Oxford, United Kingdom
| | - Lucy Abel
- Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, United Kingdom
| | - Richard Stevens
- Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, United Kingdom
| | - Rafael Perera
- Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
2
|
Paraskevas KI, Saba L, Papaioannou V, Suri J. Artificial Intelligence in Cardiovascular Diseases and Vascular Surgery. Angiology 2024:33197241273410. [PMID: 39126672 DOI: 10.1177/00033197241273410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/12/2024]
Affiliation(s)
| | - Luca Saba
- Department of Radiology, Azienda Ospedaliero Universitaria (A.O.U.) di Cagliari, Cagliari, Italy
| | | | - Jasjit Suri
- Stroke Diagnostic and Monitoring Division, AtheropointTM, Roseville, CA, USA
- Department of Computer Engineering, Graphic Era Deemed to Be University Dehradun, India
- Department of Electrical and Computer Engineering, Idaho State University, Pocatello, ID, USA
- Symbiosis Institute of Technology, Nagpur Campus, Symbiosis International (Deemed University), Pune, India
| |
Collapse
|
3
|
Dippel DW, van Klaveren D. The Dilemma of Incomplete Reperfusion After Thrombectomy for Ischemic Stroke: Proceed With Caution. Neurology 2024; 103:e209646. [PMID: 38896811 DOI: 10.1212/wnl.0000000000209646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/21/2024] Open
Affiliation(s)
- Diederik W Dippel
- From the Departments of Neuroloy and Public Health, Erasmus MC, University Medical Center, Rotterdam, the Netherlands
| | - David van Klaveren
- From the Departments of Neuroloy and Public Health, Erasmus MC, University Medical Center, Rotterdam, the Netherlands
| |
Collapse
|
4
|
Lee TY, Price D, Yadav CP, Roy R, Lim LHM, Wang E, Wechsler ME, Jackson DJ, Busby J, Heaney LG, Pfeffer PE, Mahboub B, Perng Steve DW, Cosio BG, Perez-de-Llano L, Al-Lehebi R, Larenas-Linnemann D, Al-Ahmad M, Rhee CK, Iwanaga T, Heffler E, Canonica GW, Costello R, Papadopoulos NG, Papaioannou AI, Porsbjerg CM, Torres-Duque CA, Christoff GC, Popov TA, Hew M, Peters M, Gibson PG, Maspero J, Bergeron C, Cerda S, Contreras-Contreras EA, Chen W, Sadatsafavi M. International Variation in Severe Exacerbation Rates in Patients With Severe Asthma. Chest 2024; 166:28-38. [PMID: 38395297 DOI: 10.1016/j.chest.2024.02.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Revised: 12/07/2023] [Accepted: 02/19/2024] [Indexed: 02/25/2024] Open
Abstract
BACKGROUND Exacerbation frequency strongly influences treatment choices in patients with severe asthma. RESEARCH QUESTION What is the extent of the variability of exacerbation rate across countries and its implications in disease management? STUDY DESIGN AND METHODS We retrieved data from the International Severe Asthma Registry, an international observational cohort of patients with a clinical diagnosis of severe asthma. We identified patients aged ≥ 18 years who did not initiate any biologics prior to baseline visit. A severe exacerbation was defined as the use of oral corticosteroids for ≥ 3 days or asthma-related hospitalization/ED visit. A series of negative binomial models were applied to estimate country-specific severe exacerbation rates during 365 days of follow-up, starting from a naive model with country as the only variable to an adjusted model with country as a random-effect term and patient and disease characteristics as independent variables. RESULTS The final sample included 7,510 patients from 17 countries (56% from the United States), contributing to 1,939 severe exacerbations (0.27/person-year). There was large between-country variation in observed severe exacerbation rate (minimum, 0.04 [Argentina]; maximum, 0.88 [Saudi Arabia]; interquartile range, 0.13-0.54), which remained substantial after adjusting for patient characteristics and sampling variability (interquartile range, 0.16-0.39). INTERPRETATION Individuals with similar patient characteristics but coming from different jurisdictions have varied severe exacerbation risks, even after controlling for patient and disease characteristics. This suggests unknown patient factors or system-level variations at play. Disease management guidelines should recognize such between-country variability. Risk prediction models that are calibrated for each jurisdiction will be needed to optimize treatment strategies.
Collapse
Affiliation(s)
- Tae Yoon Lee
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore; Respiratory Evaluation Sciences Program, Faculty of Pharmaceutical Sciences, University of British Columbia, Canada
| | - David Price
- Optimum Patient Care Global, Cambridge, England; Observational and Pragmatic Research Institute, Singapore, Singapore; Centre of Academic Primary Care, Division of Applied Health Sciences, University of Aberdeen, Aberdeen, Scotland
| | | | - Rupsa Roy
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore
| | - Laura Huey Mien Lim
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore
| | - Eileen Wang
- Division of Allergy & Clinical Immunology, Department of Medicine, National Jewish Health, Denver, CO; Division of Allergy & Clinical Immunology, Department of Medicine, University of Colorado School of Medicine, Aurora, CO
| | - Michael E Wechsler
- NJH Cohen Family Asthma Institute, Department of Medicine, National Jewish Health, Denver, CO
| | - David J Jackson
- UK Severe Asthma Network and National Registry, Guy's and St Thomas' NHS Trust, London, England; School of Immunology & Microbial Sciences, King's College London, London, England
| | - John Busby
- Centre for Public Health, Queen's University Belfast, Belfast, Northern Ireland
| | - Liam G Heaney
- Wellcome-Wolfson Centre for Experimental Medicine, Queen's University Belfast, Belfast, Northern Ireland
| | - Paul E Pfeffer
- Department of Respiratory Medicine, Barts Health NHS Trust, London, England; Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, England
| | - Bassam Mahboub
- College of Medicine, University of Sharjah, Sharjah, United Arab Emirates; Rashid Hospital, Dubai Health Authority, Dubai, United Arab Emirates
| | - Diahn-Warng Perng Steve
- Division of Clinical Respiratory, Physiology Chest Department, Taipei Veterans General Hospital, Taipei City, Taiwan; COPD Assembly of the Asian Pacific Society of Respirology, Tokyo, Japan
| | - Borja G Cosio
- Son Espases University Hospital-IdISBa-Ciberes, Mallorca, Spain
| | - Luis Perez-de-Llano
- Pneumology Service, Lucus Augusti University Hospital, EOXI Lugo, Monforte, Cervo, Spain; Biodiscovery Research Group, Health Research Institute of Santiago de Compostela, Spain
| | - Riyad Al-Lehebi
- Department of Pulmonology, King Fahad Medical City, Riyadh, Saudi Arabia; College of Medicine, Alfaisal University, Riyadh, Saudi Arabia
| | | | - Mona Al-Ahmad
- Microbiology Department, Faculty of Medicine, Kuwait University, Al-Rashed Allergy Center, Ministry of Health, Kuwait
| | - Chin Kook Rhee
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, South Korea
| | - Takashi Iwanaga
- Center for General Medical Education and Clinical Training, Kindai University Hospital, Osakasayama, Japan
| | - Enrico Heffler
- Personalized Medicine, Asthma and Allergy, Humanitas Clinical and Research Center IRCCS, Rozzano, Milan, Italy; Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
| | - Giorgio Walter Canonica
- Personalized Medicine, Asthma and Allergy, Humanitas Clinical and Research Center IRCCS, Rozzano, Milan, Italy; Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
| | - Richard Costello
- Clinical Research Centre, Smurfit Building Beaumont Hospital, Department of Respiratory Medicine, RCSI, Dublin, Ireland
| | - Nikolaos G Papadopoulos
- Division of Infection, Immunity & Respiratory Medicine, University of Manchester, Manchester, England; Allergy Department, 2nd Pediatric Clinic, University of Athens, Athens, Greece
| | - Andriana I Papaioannou
- 2nd Respiratory Medicine Department, National and Kapodistrian University of Athens Medical School, Attikon University Hospital, Athens, Greece
| | - Celeste M Porsbjerg
- Respiratory Research Unit, Bispebjerg University Hospital, Copenhagen, Denmark
| | - Carlos A Torres-Duque
- CINEUMO, Respiratory Research Center, Fundación Neumológica Colombiana, Bogotá, Colombia
| | | | - Todor A Popov
- Clinic of Occupational Diseases, University Hospital "Sv. Ivan Rilski", Sofia, Bulgaria
| | - Mark Hew
- Allergy, Asthma & Clinical Immunology Service, Alfred Health, Melbourne, Australia; Public Health and Preventive Medicine, Monash University, Melbourne, Australia
| | - Matthew Peters
- Department of Thoracic Medicine, Concord Hospital, Sydney, Australia
| | - Peter G Gibson
- Australian Severe Asthma Network, Priority Research Centre for Healthy Lungs, University of Newcastle, Newcastle, Australia; Hunter Medical Research Institute, Department of Respiratory and Sleep Medicine, John Hunter Hospital, New Lambton Heights, Australia
| | - Jorge Maspero
- Clinical Research for Allergy and Respiratory Medicine, CIDEA Foundation, Buenos Aires, Argentina; University Career of Specialists in Allergy and Clinical Immunology, Buenos Aires University School of Medicine, Buenos Aires, Argentina
| | - Celine Bergeron
- Centre for Lung Health, Vancouver General Hospital, University of British Columbia, Vancouver, BC, Canada
| | - Saraid Cerda
- Medical Specialties Unit, Secretary of National Defense, Mexico City, Mexico
| | - Elvia Angelica Contreras-Contreras
- Mexican Council of Clinical Immunology and Allergy, Mexico City Office, Mexico City, Mexico; Department of Allergy and Clinical Immunology, Lic. Adolfo López Mateos Regional Hospital of the Institute of Security and Social Services for State Workers (ISSSTE), Mexico City, Mexico
| | - Wenjia Chen
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore.
| | - Mohsen Sadatsafavi
- Respiratory Evaluation Sciences Program, Faculty of Pharmaceutical Sciences, University of British Columbia, Canada
| |
Collapse
|
5
|
Hardenberg JHB. [Data-driven intensive care: a lack of comprehensive datasets]. Med Klin Intensivmed Notfmed 2024; 119:352-357. [PMID: 38668882 DOI: 10.1007/s00063-024-01141-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 03/26/2024] [Indexed: 05/28/2024]
Abstract
Intensive care units provide a data-rich environment with the potential to generate datasets in the realm of big data, which could be utilized to train powerful machine learning (ML) models. However, the currently available datasets are too small and exhibit too little diversity due to their limitation to individual hospitals. This lack of extensive and varied datasets is a primary reason for the limited generalizability and resulting low clinical utility of current ML models. Often, these models are based on data from single centers and suffer from poor external validity. There is an urgent need for the development of large-scale, multicentric, and multinational datasets. Ensuring data protection and minimizing re-identification risks pose central challenges in this process. The "Amsterdam University Medical Center database (AmsterdamUMCdb)" and the "Salzburg Intensive Care database (SICdb)" demonstrate that open access datasets are possible in Europe while complying with the data protection regulations of the General Data Protection Regulation (GDPR). Another challenge in building intensive care datasets is the absence of semantic definitions in the source data and the heterogeneity of data formats. Establishing binding industry standards for the semantic definition is crucial to ensure seamless semantic interoperability between datasets.
Collapse
Affiliation(s)
- Jan-Hendrik B Hardenberg
- Medizinische Klinik mit Schwerpunkt Nephrologie und internistische Intensivmedizin, Charité - Universitätsmedizin Berlin, Augustenburger Platz 1, 13353, Berlin, Deutschland.
| |
Collapse
|
6
|
Haimovich AD, Burke RC, Nathanson LA, Rubins D, Taylor RA, Kross EK, Ouchi K, Shapiro NI, Schonberg MA. Geriatric End-of-Life Screening Tool Prediction of 6-Month Mortality in Older Patients. JAMA Netw Open 2024; 7:e2414213. [PMID: 38819823 PMCID: PMC11143461 DOI: 10.1001/jamanetworkopen.2024.14213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Accepted: 03/31/2024] [Indexed: 06/01/2024] Open
Abstract
Importance Emergency department (ED) visits by older adults with life-limiting illnesses are a critical opportunity to establish patient care end-of-life preferences, but little is known about the optimal screening criteria for resource-constrained EDs. Objectives To externally validate the Geriatric End-of-Life Screening Tool (GEST) in an independent population and compare it with commonly used serious illness diagnostic criteria. Design, Setting, and Participants This prognostic study assessed a cohort of patients aged 65 years and older who were treated in a tertiary care ED in Boston, Massachusetts, from 2017 to 2021. Patients arriving in cardiac arrest or who died within 1 day of ED arrival were excluded. Data analysis was performed from August 1, 2023, to March 27, 2024. Exposure GEST, a logistic regression algorithm that uses commonly available electronic health record (EHR) datapoints and was developed and validated across 9 EDs, was compared with serious illness diagnoses as documented in the EHR. Serious illnesses included stroke/transient ischemic attack, liver disease, cancer, lung disease, and age greater than 80 years, among others. Main Outcomes and Measures The primary outcome was 6-month mortality following an ED encounter. Statistical analyses included area under the receiver operating characteristic curve, calibration analyses, Kaplan-Meier survival curves, and decision curves. Results This external validation included 82 371 ED encounters by 40 505 unique individuals (mean [SD] age, 76.8 [8.4] years; 54.3% women, 13.8% 6-month mortality rate). GEST had an external validation area under the receiver operating characteristic curve of 0.79 (95% CI, 0.78-0.79) that was stable across years and demographic subgroups. Of included encounters, 53.4% had a serious illness, with a sensitivity of 77.4% (95% CI, 76.6%-78.2%) and specificity of 50.5% (95% CI, 50.1%-50.8%). Varying GEST cutoffs from 5% to 30% increased specificity (5%: 49.1% [95% CI, 48.7%-49.5%]; 30%: 92.2% [95% CI, 92.0%-92.4%]) at the cost of sensitivity (5%: 89.3% [95% CI, 88.8-89.9]; 30%: 36.2% [95% CI, 35.3-37.1]). In a decision curve analysis, GEST outperformed serious illness criteria across all tested thresholds. When comparing patients referred to intervention by GEST with serious illness criteria, GEST reclassified 45.1% of patients with serious illness as having low risk of mortality with an observed mortality rate 8.1% and 2.6% of patients without serious illness as having high mortality risk with an observed mortality rate of 34.3% for a total reclassification rate of 25.3%. Conclusions and Relevance The findings of this study suggest that both serious illness criteria and GEST identified older ED patients at risk for 6-month mortality, but GEST offered more useful screening characteristics. Future trials of serious illness interventions for high mortality risk in older adults may consider transitioning from diagnosis code criteria to GEST, an automatable EHR-based algorithm.
Collapse
Affiliation(s)
- Adrian D. Haimovich
- Department of Emergency Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
| | - Ryan C. Burke
- Department of Emergency Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
| | - Larry A. Nathanson
- Department of Emergency Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
| | - David Rubins
- Department of Internal Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
| | - R. Andrew Taylor
- Department of Emergency Medicine, Yale School of Medicine, New Haven, Connecticut
| | - Erin K. Kross
- Division of Pulmonary, Critical Care, and Sleep Medicine, University of Washington, Seattle
- Cambia Palliative Care Center of Excellence at UW Medicine, Seattle, Washington
| | - Kei Ouchi
- Department of Emergency Medicine, Brigham and Women’s Hospital, Boston, Massachusetts
| | - Nathan I. Shapiro
- Department of Emergency Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
| | - Mara A. Schonberg
- Department of Internal Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
| |
Collapse
|
7
|
Grebe TA, Khushf G, Greally JM, Turley P, Foyouzi N, Rabin-Havt S, Berkman BE, Pope K, Vatta M, Kaur S. Clinical utility of polygenic risk scores for embryo selection: A points to consider statement of the American College of Medical Genetics and Genomics (ACMG). Genet Med 2024; 26:101052. [PMID: 38393332 DOI: 10.1016/j.gim.2023.101052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 12/12/2023] [Indexed: 02/25/2024] Open
Affiliation(s)
- Theresa A Grebe
- Phoenix Children's, Phoenix, AZ; Department of Child Health, University of Arizona College of Medicine-Phoenix, Phoenix, AZ
| | - George Khushf
- Department of Philosophy, University of South Carolina, Columbia, SC
| | - John M Greally
- Departments of Genetics and Pediatrics, Albert Einstein College of Medicine, Bronx, NY
| | - Patrick Turley
- Center for Economic and Social Research, University of Southern California, Los Angeles, CA; Department of Economics, University of Southern California, Los Angeles, CA
| | | | - Sara Rabin-Havt
- Department of OB/GYN, Montefiore Medical Center, Albert Einstein College of Medicine, Bronx, NY
| | - Benjamin E Berkman
- Department of Bioethics, National Institutes of Health; National Human Genome Research Institute, Bethesda, MD
| | - Kathleen Pope
- Department of Pediatrics, Nemours Children's Hospital, Orlando, FL; University of South Florida College of Public Health, Tampa, FL
| | | | - Shagun Kaur
- Phoenix Children's, Phoenix, AZ; Department of Child Health, University of Arizona College of Medicine-Phoenix, Phoenix, AZ
| |
Collapse
|
8
|
Lou SS, Liu Y, Cohen ME, Ko CY, Hall BL, Kannampallil T. National Multi-Institutional Validation of a Surgical Transfusion Risk Prediction Model. J Am Coll Surg 2024; 238:99-105. [PMID: 37737660 DOI: 10.1097/xcs.0000000000000874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/23/2023]
Abstract
BACKGROUND Accurate estimation of surgical transfusion risk is important for many aspects of surgical planning, yet few methods for estimating are available for estimating such risk. There is a need for reliable validated methods for transfusion risk stratification to support effective perioperative planning and resource stewardship. STUDY DESIGN This study was conducted using the American College of Surgeons NSQIP datafile from 2019. S-PATH performance was evaluated at each contributing hospital, with and without hospital-specific model tuning. Linear regression was used to assess the relationship between hospital characteristics and area under the receiver operating characteristic (AUROC) curve. RESULTS A total of 1,000,927 surgical cases from 414 hospitals were evaluated. Aggregate AUROC was 0.910 (95% CI 0.904 to 0.916) without model tuning and 0.925 (95% CI 0.919 to 0.931) with model tuning. AUROC varied across individual hospitals (median 0.900, interquartile range 0.849 to 0.944), but no statistically significant relationships were found between hospital-level characteristics studied and model AUROC. CONCLUSIONS S-PATH demonstrated excellent discriminative performance, although there was variation across hospitals that was not well-explained by hospital-level characteristics. These results highlight the S-PATH's viability as a generalizable surgical transfusion risk prediction tool.
Collapse
Affiliation(s)
- Sunny S Lou
- From the Department of Anesthesiology, Washington University School of Medicine, St Louis, MO (Lou, Kannampallil)
| | - Yaoming Liu
- Division of Research and Optimal Patient Care, American College of Surgeons, Chicago, IL (Liu, Ko, Hall, Cohen)
| | - Mark E Cohen
- Division of Research and Optimal Patient Care, American College of Surgeons, Chicago, IL (Liu, Ko, Hall, Cohen)
| | - Clifford Y Ko
- Division of Research and Optimal Patient Care, American College of Surgeons, Chicago, IL (Liu, Ko, Hall, Cohen)
- Department of Surgery, David Geffen School of Medicine, University of California Los Angeles, and the VA Greater Los Angeles Health System, Los Angeles, CA (Ko)
| | - Bruce L Hall
- Division of Research and Optimal Patient Care, American College of Surgeons, Chicago, IL (Liu, Ko, Hall, Cohen)
- Department of Surgery, Washington University School of Medicine; Center for Health Policy and the Olin Business School at Washington University in St Louis; John Cochran Veterans Affairs Medical Center; and BJC Healthcare, St Louis, MO (Hall)
| | - Thomas Kannampallil
- From the Department of Anesthesiology, Washington University School of Medicine, St Louis, MO (Lou, Kannampallil)
| |
Collapse
|
9
|
van Royen FS, Asselbergs FW, Alfonso F, Vardas P, van Smeden M. Five critical quality criteria for artificial intelligence-based prediction models. Eur Heart J 2023; 44:4831-4834. [PMID: 37897346 DOI: 10.1093/eurheartj/ehad727] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/30/2023] Open
Abstract
To raise the quality of clinical artificial intelligence (AI) prediction modelling studies in the cardiovascular health domain and thereby improve their impact and relevancy, the editors for digital health, innovation, and quality standards of the European Heart Journal propose five minimal quality criteria for AI-based prediction model development and validation studies: complete reporting, carefully defined intended use of the model, rigorous validation, large enough sample size, and openness of code and software.
Collapse
Affiliation(s)
- Florien S van Royen
- Department of General Practice & Nursing Science, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Folkert W Asselbergs
- Department of Cardiology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
- Health Data Research UK and Institute of Health Informatics, University College London, London, UK
| | - Fernando Alfonso
- Department of Cardiology, Hospital Universitario de la Princesa, Universidad Autónoma de Madrid, IIS-IP. CIVER-CV, Madrid, Spain
| | - Panos Vardas
- Biomedical Research Foundation Academy of Athens (BRFAA) and Hygeia Hospitals Group, Athens, Greece
| | - Maarten van Smeden
- Department of Epidemiology & Health Economics, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG Utrecht, Netherlands
- Department of Data Science & Biostatistics, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG Utrecht, The Netherlands
| |
Collapse
|
10
|
Kurlansky PA, Bittl JA. Learning From Machines to Predict Mortality After Surgical or Percutaneous Revascularization. J Am Coll Cardiol 2023; 82:2125-2127. [PMID: 37993204 DOI: 10.1016/j.jacc.2023.10.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 10/06/2023] [Indexed: 11/24/2023]
Affiliation(s)
- Paul A Kurlansky
- Division of Cardiothoracic and Vascular Surgery, Columbia University Irving Medical Center, New York, New York, USA.
| | - John A Bittl
- Scientific Publications Committee, American College of Cardiology, Washington, DC, USA
| |
Collapse
|
11
|
Stam WT, Ingwersen EW, Ali M, Spijkerman JT, Kazemier G, Bruns ERJ, Daams F. Machine learning models in clinical practice for the prediction of postoperative complications after major abdominal surgery. Surg Today 2023; 53:1209-1215. [PMID: 36840764 PMCID: PMC10520164 DOI: 10.1007/s00595-023-02662-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 02/07/2023] [Indexed: 02/26/2023]
Abstract
Complications after surgery have a major impact on short- and long-term outcomes, and decades of technological advancement have not yet led to the eradication of their risk. The accurate prediction of complications, recently enhanced by the development of machine learning algorithms, has the potential to completely reshape surgical patient management. In this paper, we reflect on multiple issues facing the implementation of machine learning, from the development to the actual implementation of machine learning models in daily clinical practice, providing suggestions on the use of machine learning models for predicting postoperative complications after major abdominal surgery.
Collapse
Affiliation(s)
- Wessel T Stam
- Department of Surgery, Amsterdam UMC Location Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Cancer Center Amsterdam, Cancer Treatment and Quality of Life, Amsterdam, The Netherlands
- AGEM Amsterdam Gastroenterology, Endocrinology and Metabolism, Amsterdam, The Netherlands
| | - Erik W Ingwersen
- Department of Surgery, Amsterdam UMC Location Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Cancer Center Amsterdam, Cancer Treatment and Quality of Life, Amsterdam, The Netherlands
- AGEM Amsterdam Gastroenterology, Endocrinology and Metabolism, Amsterdam, The Netherlands
| | - Mahsoem Ali
- Department of Surgery, Amsterdam UMC Location Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Cancer Center Amsterdam, Cancer Treatment and Quality of Life, Amsterdam, The Netherlands
| | - Jorik T Spijkerman
- Independent Consultant in Computational Intelligence, Amsterdam, The Netherlands
| | - Geert Kazemier
- Department of Surgery, Amsterdam UMC Location Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Cancer Center Amsterdam, Cancer Treatment and Quality of Life, Amsterdam, The Netherlands
| | - Emma R J Bruns
- Department of Surgery, Amsterdam UMC Location Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Cancer Center Amsterdam, Cancer Treatment and Quality of Life, Amsterdam, The Netherlands
| | - Freek Daams
- Department of Surgery, Amsterdam UMC Location Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands.
- Cancer Center Amsterdam, Cancer Treatment and Quality of Life, Amsterdam, The Netherlands.
| |
Collapse
|
12
|
Amegadzie JE, Sadatsafavi M. A long overdue recognition: COPD as a distinct predictor of cardiovascular disease risk. Eur Respir J 2023; 62:2301167. [PMID: 37652564 DOI: 10.1183/13993003.01167-2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 07/30/2023] [Indexed: 09/02/2023]
Affiliation(s)
- Joseph Emil Amegadzie
- Respiratory Evaluation Sciences Program, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, BC, Canada
| | - Mohsen Sadatsafavi
- Respiratory Evaluation Sciences Program, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, BC, Canada
| |
Collapse
|
13
|
de Kok JWTM, de la Hoz MÁA, de Jong Y, Brokke V, Elbers PWG, Thoral P, Castillejo A, Trenor T, Castellano JM, Bronchalo AE, Merz TM, Faltys M, van der Horst ICC, Xu M, Celi LA, van Bussel BCT, Borrat X. A guide to sharing open healthcare data under the General Data Protection Regulation. Sci Data 2023; 10:404. [PMID: 37355751 PMCID: PMC10290652 DOI: 10.1038/s41597-023-02256-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 05/17/2023] [Indexed: 06/26/2023] Open
Abstract
Sharing healthcare data is increasingly essential for developing data-driven improvements in patient care at the Intensive Care Unit (ICU). However, it is also very challenging under the strict privacy legislation of the European Union (EU). Therefore, we explored four successful open ICU healthcare databases to determine how open healthcare data can be shared appropriately in the EU. A questionnaire was constructed based on the Delphi method. Then, follow-up questions were discussed with experts from the four databases. These experts encountered similar challenges and regarded ethical and legal aspects to be the most challenging. Based on the approaches of the databases, expert opinion, and literature research, we outline four distinct approaches to openly sharing healthcare data, each with varying implications regarding data security, ease of use, sustainability, and implementability. Ultimately, we formulate seven recommendations for sharing open healthcare data to guide future initiatives in sharing open healthcare data to improve patient care and advance healthcare.
Collapse
Affiliation(s)
- Jip W T M de Kok
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, the Netherlands
- Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, the Netherlands
| | - Miguel Á Armengol de la Hoz
- Big Data Department, PMC, Fundacion Progreso y Salud (FPS), Regional Ministry of Health of Andalucia, Seville, Andalucia, Spain
| | | | | | - Paul W G Elbers
- Center for Critical Care Computational Intelligence (C4I), Department of Intensive Care Medicine, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam, The Netherlands
| | - Patrick Thoral
- Center for Critical Care Computational Intelligence (C4I), Department of Intensive Care Medicine, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam, The Netherlands
| | | | | | - Jose M Castellano
- Fundación de Investigación HM Hospitales, Grupo HM Hospitales, Madrid, Spain
| | - Alberto E Bronchalo
- Fundación de Investigación HM Hospitales, Grupo HM Hospitales, Madrid, Spain
| | - Tobias M Merz
- Cardiovascular Intensive Care Unit, Auckland City Hospital, Auckland, New Zealand
| | - Martin Faltys
- Department of Intensive Care Medicine, University Hospital, University of Bern, Bern, Switzerland
| | - Iwan C C van der Horst
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, the Netherlands
- Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, the Netherlands
| | - Minnan Xu
- Philips Research North America, Cambridge, MA, USA
| | - Leo Anthony Celi
- Laboratory for Computational Physiology, Harvard-MIT Division of Health Sciences & Technology, Cambridge, Massachusetts, USA
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
- Department of Biostatistics Harvard T.H, Chan School of Public Health, Boston, Massachusetts, USA
| | - Bas C T van Bussel
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, the Netherlands.
- Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, the Netherlands.
- Care and Public Health Research Institute (CAPHRI), Maastricht University, Maastricht, the Netherlands.
| | - Xavier Borrat
- Department of Biostatistics Harvard T.H, Chan School of Public Health, Boston, Massachusetts, USA.
- Anaesthesiology and Critical Care Department, Hospital Clinic de Barcelona, Barcelona, Spain.
- Medical Informatics Department, Hospital Clinic de Barcelona, Barcelona, Spain.
| |
Collapse
|
14
|
de Hond AAH, Shah VB, Kant IMJ, Van Calster B, Steyerberg EW, Hernandez-Boussard T. Perspectives on validation of clinical predictive algorithms. NPJ Digit Med 2023; 6:86. [PMID: 37149704 PMCID: PMC10163568 DOI: 10.1038/s41746-023-00832-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Accepted: 04/28/2023] [Indexed: 05/08/2023] Open
Affiliation(s)
- Anne A H de Hond
- Clinical AI Implementation and Research Lab, Leiden University Medical Centre, Leiden, the Netherlands.
- Department of Medicine (Biomedical Informatics), Stanford University, Stanford, CA, USA.
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, the Netherlands.
| | - Vaibhavi B Shah
- Department of Medicine (Biomedical Informatics), Stanford University, Stanford, CA, USA
| | - Ilse M J Kant
- Department of Digital Health, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Ben Van Calster
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, the Netherlands
- Department of Development & Regeneration, KU Leuven, Leuven, Belgium
| | - Ewout W Steyerberg
- Clinical AI Implementation and Research Lab, Leiden University Medical Centre, Leiden, the Netherlands
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, the Netherlands
| | - Tina Hernandez-Boussard
- Department of Medicine (Biomedical Informatics), Stanford University, Stanford, CA, USA
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
- Department of Epidemiology & Population Health (by courtesy), Stanford University, Stanford, CA, USA
| |
Collapse
|
15
|
Kendzerska T, Gershon AS. One Size Does Not Fit All: Risk Stratification for COPD Exacerbations. Chest 2023; 163:733-735. [PMID: 37031974 DOI: 10.1016/j.chest.2023.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Accepted: 01/04/2023] [Indexed: 04/11/2023] Open
Affiliation(s)
- Tetyana Kendzerska
- Department of Medicine, Faculty of Medicine, Ottawa Hospital Research Institute/University of Ottawa, Ottawa, ON, Canada.
| | - Andrea S Gershon
- Department of Medicine, University of Toronto, Toronto, ON, Canada; Sunnybrook Health Sciences Center, Toronto, ON, Canada
| |
Collapse
|
16
|
Van Calster B, Steyerberg EW, Wynants L, van Smeden M. There is no such thing as a validated prediction model. BMC Med 2023; 21:70. [PMID: 36829188 PMCID: PMC9951847 DOI: 10.1186/s12916-023-02779-w] [Citation(s) in RCA: 61] [Impact Index Per Article: 61.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 02/10/2023] [Indexed: 02/26/2023] Open
Abstract
BACKGROUND Clinical prediction models should be validated before implementation in clinical practice. But is favorable performance at internal validation or one external validation sufficient to claim that a prediction model works well in the intended clinical context? MAIN BODY We argue to the contrary because (1) patient populations vary, (2) measurement procedures vary, and (3) populations and measurements change over time. Hence, we have to expect heterogeneity in model performance between locations and settings, and across time. It follows that prediction models are never truly validated. This does not imply that validation is not important. Rather, the current focus on developing new models should shift to a focus on more extensive, well-conducted, and well-reported validation studies of promising models. CONCLUSION Principled validation strategies are needed to understand and quantify heterogeneity, monitor performance over time, and update prediction models when appropriate. Such strategies will help to ensure that prediction models stay up-to-date and safe to support clinical decision-making.
Collapse
Affiliation(s)
- Ben Van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- EPI-Center, KU Leuven, Leuven, Belgium
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, Netherlands
| | | | - Laure Wynants
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- EPI-Center, KU Leuven, Leuven, Belgium
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
| | - Maarten van Smeden
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, Netherlands.
| |
Collapse
|
17
|
External Validation of Mortality Prediction Models for Critical Illness Reveals Preserved Discrimination but Poor Calibration. Crit Care Med 2023; 51:80-90. [PMID: 36378565 DOI: 10.1097/ccm.0000000000005712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
OBJECTIVES In a recent scoping review, we identified 43 mortality prediction models for critically ill patients. We aimed to assess the performances of these models through external validation. DESIGN Multicenter study. SETTING External validation of models was performed in the Simple Intensive Care Studies-I (SICS-I) and the Finnish Acute Kidney Injury (FINNAKI) study. PATIENTS The SICS-I study consisted of 1,075 patients, and the FINNAKI study consisted of 2,901 critically ill patients. MEASUREMENTS AND MAIN RESULTS For each model, we assessed: 1) the original publications for the data needed for model reconstruction, 2) availability of the variables, 3) model performance in two independent cohorts, and 4) the effects of recalibration on model performance. The models were recalibrated using data of the SICS-I and subsequently validated using data of the FINNAKI study. We evaluated overall model performance using various indexes, including the (scaled) Brier score, discrimination (area under the curve of the receiver operating characteristics), calibration (intercepts and slopes), and decision curves. Eleven models (26%) could be externally validated. The Acute Physiology And Chronic Health Evaluation (APACHE) II, APACHE IV, Simplified Acute Physiology Score (SAPS)-Reduced (SAPS-R)' and Simplified Mortality Score for the ICU models showed the best scaled Brier scores of 0.11' 0.10' 0.10' and 0.06' respectively. SAPS II, APACHE II, and APACHE IV discriminated best; overall discrimination of models ranged from area under the curve of the receiver operating characteristics of 0.63 (0.61-0.66) to 0.83 (0.81-0.85). We observed poor calibration in most models, which improved to at least moderate after recalibration of intercepts and slopes. The decision curve showed a positive net benefit in the 0-60% threshold probability range for APACHE IV and SAPS-R. CONCLUSIONS In only 11 out of 43 available mortality prediction models, the performance could be studied using two cohorts of critically ill patients. External validation showed that the discriminative ability of APACHE II, APACHE IV, and SAPS II was acceptable to excellent, whereas calibration was poor.
Collapse
|
18
|
Stroke risk in older British men: Comparing performance of stroke-specific and composite-CVD risk prediction tools. Prev Med Rep 2022; 31:102098. [PMID: 36820364 PMCID: PMC9938339 DOI: 10.1016/j.pmedr.2022.102098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 12/14/2022] [Accepted: 12/22/2022] [Indexed: 12/25/2022] Open
Abstract
Stroke risk is currently estimated as part of the composite risk of cardiovascular disease (CVD). We investigated if composite-CVD risk prediction tools QRISK3 and Pooled Cohort Equations-PCE, derived from middle-aged adults, are as good as stroke-specific Framingham Stroke Risk Profile-FSRP and QStroke for capturing the true risk of stroke in older adults. External validation for 10y stroke outcomes was performed in men (60-79y) of the British Regional Heart Study. Discrimination and calibration were assessed in separate validation samples (FSRP n = 3762, QStroke n = 3376, QRISK3 n = 2669 and PCE n = 3047) with/without adjustment for competing risks. Sensitivity/specificity were examined using observed and clinically recommended thresholds. Performance of FSRP, QStroke and QRISK3 was further compared head-to-head in 2441 men free of a range of CVD, including across age-groups. Observed 10y risk (/1000PY) ranged from 6.8 (hard strokes) to 11 (strokes/transient ischemic attacks). All tools discriminated weakly, C-indices 0.63-0.66. FSRP and QStroke overestimated risk at higher predicted probabilities. QRISK3 and PCE showed reasonable calibration overall with minor mis-estimations across the risk range. Performance worsened on adjusting for competing non-stroke deaths. However, in men without CVD, QRISK3 displayed relatively better calibration for stroke events, even after adjustment for competing deaths, including in oldest men. All tools displayed similar sensitivity (63-73 %) and specificity (52-54 %) using observed risks as cut-offs. When QRISK3 and PCE were evaluated using thresholds for CVD prevention, sensitivity for stroke events was 99 %, with false positive rate 97 % suggesting existing intervention thresholds may need to be re-examined to reflect age-related stroke burden.
Collapse
Key Words
- AF, atrial fibrillation
- BRHS, British Regional Heart Study
- CHD, coronary heart disease
- CIF, cumulative incidence function
- CPI, centred prognostic index
- CVD, cardiovascular disease
- Calibration
- Cardiovascular disease
- Discrimination
- FSRP, Framingham stroke risk profile
- HF, heart failure
- KM, Kaplan-Meier
- MI, myocardial infarction
- NICE, National Institute For Health And Care Excellence
- Older adults
- PCE, pooled cohort equations
- PI, prognostic index
- Risk prediction
- SCORE, systematic coronary risk evaluation
- Sn/Sp, percent sensitivity/percent specificity
- Stroke
- TIA, transient ischemic attack
Collapse
|
19
|
Ho JK, Safari A, Adibi A, Sin DD, Johnson K, Sadatsafavi M. Generalizability of Risk Stratification Algorithms for Exacerbations in COPD. Chest 2022; 163:790-798. [PMID: 36509123 DOI: 10.1016/j.chest.2022.11.041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 11/02/2022] [Accepted: 11/18/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Contemporary management of COPD relies on exacerbation history to risk-stratify patients for future exacerbations. Multivariate prediction models can improve the performance of risk stratification. However, the clinical usefulness of risk stratification can vary from one population to another. RESEARCH QUESTION How do two validated exacerbation risk prediction models (Acute COPD Exacerbation Prediction Tool [ACCEPT] and the Bertens model) compared with exacerbation history alone perform in different patient populations? STUDY DESIGN AND METHODS We used data from three clinical studies representing populations at different levels of moderate to severe exacerbation risk: the Study to Understand Mortality and Morbidity in COPD (SUMMIT; N = 2,421; annual risk, 0.22), the Long-term Oxygen Treatment Trial (LOTT; N = 595; annual risk, 0.38), and Towards a Revolution in COPD Health (TORCH; N = 1,091; annual risk, 0.52). We compared the area under the receiver operating characteristic curve (AUC) and net benefit (measure of clinical usefulness) among three risk stratification algorithms for predicting exacerbations in the next 12 months. We also evaluated the effect of model recalibration on clinical usefulness. RESULTS Compared with exacerbation history, ACCEPT showed better performance in all three samples (change in AUC, 0.08, 0.07, and 0.10, respectively; P ≤ .001 for all). The Bertens model showed better performance compared with exacerbation history in SUMMIT and TORCH (change in AUC, 0.10 and 0.05, respectively; P < .001 for both), but not in LOTT. No algorithm was superior in clinical usefulness across all samples. Before recalibration, the Bertens model generally outperformed the other algorithms in low-risk settings, whereas ACCEPT outperformed others in high-risk settings. All three algorithms showed the risk of harm (providing lower net benefit than not using any risk stratification). After recalibration, risk of harm was mitigated substantially for both prediction models. INTERPRETATION Exacerbation history alone is unlikely to provide clinical usefulness for predicting COPD exacerbations in all settings and could be associated with a risk of harm. Prediction models have superior predictive performance, but require setting-specific recalibration to confer higher clinical usefulness.
Collapse
Affiliation(s)
- Joseph Khoa Ho
- Respiratory Evaluation Sciences Program, Faculty of Pharmaceutical Sciences, The University of British Columbia, Vancouver, BC, Canada; Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, The University of British Columbia, Vancouver, BC, Canada
| | - Abdollah Safari
- Respiratory Evaluation Sciences Program, Faculty of Pharmaceutical Sciences, The University of British Columbia, Vancouver, BC, Canada; Centre for Heart Lung Innovation, The University of British Columbia, Vancouver, BC, Canada
| | - Amin Adibi
- Respiratory Evaluation Sciences Program, Faculty of Pharmaceutical Sciences, The University of British Columbia, Vancouver, BC, Canada; Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, The University of British Columbia, Vancouver, BC, Canada
| | - Don D Sin
- Department of Medicine (Respirology), The University of British Columbia, Vancouver, BC, Canada; Department of Mathematics, Statistics, and Computer Science, University of Tehran, Tehran, Iran
| | - Kate Johnson
- Respiratory Evaluation Sciences Program, Faculty of Pharmaceutical Sciences, The University of British Columbia, Vancouver, BC, Canada; Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, The University of British Columbia, Vancouver, BC, Canada
| | - Mohsen Sadatsafavi
- Respiratory Evaluation Sciences Program, Faculty of Pharmaceutical Sciences, The University of British Columbia, Vancouver, BC, Canada; Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, The University of British Columbia, Vancouver, BC, Canada.
| | | |
Collapse
|
20
|
de Hond AAH, Steyerberg EW, van Calster B. Interpreting area under the receiver operating characteristic curve. Lancet Digit Health 2022; 4:e853-e855. [PMID: 36270955 DOI: 10.1016/s2589-7500(22)00188-1] [Citation(s) in RCA: 96] [Impact Index Per Article: 48.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Revised: 08/19/2022] [Accepted: 09/20/2022] [Indexed: 11/07/2022]
Affiliation(s)
- Anne A H de Hond
- Clinical AI Implementation and Research Lab, Leiden University Medical Centre, 2333 ZA Leiden, Netherlands; Department of Medicine (Biomedical Informatics), Stanford University, Stanford, CA, USA.
| | - Ewout W Steyerberg
- Clinical AI Implementation and Research Lab, Leiden University Medical Centre, 2333 ZA Leiden, Netherlands; Department of Biomedical Data Sciences, Leiden University Medical Centre, 2333 ZA Leiden, Netherlands
| | - Ben van Calster
- Department of Biomedical Data Sciences, Leiden University Medical Centre, 2333 ZA Leiden, Netherlands; Department of Development & Regeneration, KU Leuven, Leuven, Belgium
| |
Collapse
|
21
|
van Klaveren D, Zanos TP, Nelson J, Levy TJ, Park JG, Retel Helmrich IRA, Rietjens JAC, Basile MJ, Hajizadeh N, Lingsma HF, Kent DM. Prognostic models for COVID-19 needed updating to warrant transportability over time and space. BMC Med 2022; 20:456. [PMID: 36424619 PMCID: PMC9686462 DOI: 10.1186/s12916-022-02651-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 11/04/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Supporting decisions for patients who present to the emergency department (ED) with COVID-19 requires accurate prognostication. We aimed to evaluate prognostic models for predicting outcomes in hospitalized patients with COVID-19, in different locations and across time. METHODS We included patients who presented to the ED with suspected COVID-19 and were admitted to 12 hospitals in the New York City (NYC) area and 4 large Dutch hospitals. We used second-wave patients who presented between September and December 2020 (2137 and 3252 in NYC and the Netherlands, respectively) to evaluate models that were developed on first-wave patients who presented between March and August 2020 (12,163 and 5831). We evaluated two prognostic models for in-hospital death: The Northwell COVID-19 Survival (NOCOS) model was developed on NYC data and the COVID Outcome Prediction in the Emergency Department (COPE) model was developed on Dutch data. These models were validated on subsequent second-wave data at the same site (temporal validation) and at the other site (geographic validation). We assessed model performance by the Area Under the receiver operating characteristic Curve (AUC), by the E-statistic, and by net benefit. RESULTS Twenty-eight-day mortality was considerably higher in the NYC first-wave data (21.0%), compared to the second-wave (10.1%) and the Dutch data (first wave 10.8%; second wave 10.0%). COPE discriminated well at temporal validation (AUC 0.82), with excellent calibration (E-statistic 0.8%). At geographic validation, discrimination was satisfactory (AUC 0.78), but with moderate over-prediction of mortality risk, particularly in higher-risk patients (E-statistic 2.9%). While discrimination was adequate when NOCOS was tested on second-wave NYC data (AUC 0.77), NOCOS systematically overestimated the mortality risk (E-statistic 5.1%). Discrimination in the Dutch data was good (AUC 0.81), but with over-prediction of risk, particularly in lower-risk patients (E-statistic 4.0%). Recalibration of COPE and NOCOS led to limited net benefit improvement in Dutch data, but to substantial net benefit improvement in NYC data. CONCLUSIONS NOCOS performed moderately worse than COPE, probably reflecting unique aspects of the early pandemic in NYC. Frequent updating of prognostic models is likely to be required for transportability over time and space during a dynamic pandemic.
Collapse
Affiliation(s)
- David van Klaveren
- Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands. .,Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA.
| | - Theodoros P Zanos
- Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | - Jason Nelson
- Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA
| | - Todd J Levy
- Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | - Jinny G Park
- Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA
| | - Isabel R A Retel Helmrich
- Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
| | - Judith A C Rietjens
- Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
| | - Melissa J Basile
- Division of Pulmonary Critical Care and Sleep Medicine, Department of Medicine, Donald and Barbara Zucker School of Medicine at Hofstra/Northwell Health, Hempstead, NY, USA
| | - Negin Hajizadeh
- Division of Pulmonary Critical Care and Sleep Medicine, Department of Medicine, Donald and Barbara Zucker School of Medicine at Hofstra/Northwell Health, Hempstead, NY, USA
| | - Hester F Lingsma
- Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
| | - David M Kent
- Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA
| |
Collapse
|
22
|
Levy TJ, Coppa K, Cang J, Barnaby DP, Paradis MD, Cohen SL, Makhnevich A, van Klaveren D, Kent DM, Davidson KW, Hirsch JS, Zanos TP. Development and validation of self-monitoring auto-updating prognostic models of survival for hospitalized COVID-19 patients. Nat Commun 2022; 13:6812. [PMID: 36357420 PMCID: PMC9648888 DOI: 10.1038/s41467-022-34646-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 11/02/2022] [Indexed: 11/12/2022] Open
Abstract
Clinical prognostic models can assist patient care decisions. However, their performance can drift over time and location, necessitating model monitoring and updating. Despite rapid and significant changes during the pandemic, prognostic models for COVID-19 patients do not currently account for these drifts. We develop a framework for continuously monitoring and updating prognostic models and apply it to predict 28-day survival in COVID-19 patients. We use demographic, laboratory, and clinical data from electronic health records of 34912 hospitalized COVID-19 patients from March 2020 until May 2022 and compare three modeling methods. Model calibration performance drift is immediately detected with minor fluctuations in discrimination. The overall calibration on the prospective validation cohort is significantly improved when comparing the dynamically updated models against their static counterparts. Our findings suggest that, using this framework, models remain accurate and well-calibrated across various waves, variants, race and sex and yield positive net-benefits.
Collapse
Affiliation(s)
- Todd J Levy
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
| | - Kevin Coppa
- Clinical Digital Solutions, Northwell Health, New Hyde Park, NY, 11042, USA
| | - Jinxuan Cang
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
| | - Douglas P Barnaby
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA
| | - Marc D Paradis
- Northwell Holdings, Northwell Health, Manhasset, NY, 11030, USA
| | - Stuart L Cohen
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA
| | - Alex Makhnevich
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA
| | - David van Klaveren
- Department of Public Health, Erasmus MC University Medical Center, Rotterdam, Netherlands
- Predictive Analytics and Comparative Effectiveness Center, Tufts Medical Center, Boston, MA, USA
| | - David M Kent
- Predictive Analytics and Comparative Effectiveness Center, Tufts Medical Center, Boston, MA, USA
| | - Karina W Davidson
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA
| | - Jamie S Hirsch
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Clinical Digital Solutions, Northwell Health, New Hyde Park, NY, 11042, USA
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA
| | - Theodoros P Zanos
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA.
- Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA.
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA.
| |
Collapse
|
23
|
Safari A, Adibi A, Sin DD, Lee TY, Ho JK, Sadatsafavi M. ACCEPT 2·0: Recalibrating and externally validating the Acute COPD exacerbation prediction tool (ACCEPT). EClinicalMedicine 2022; 51:101574. [PMID: 35898315 PMCID: PMC9309408 DOI: 10.1016/j.eclinm.2022.101574] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 06/30/2022] [Accepted: 07/01/2022] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND The Acute Chronic Obstructive Pulmonary Disease (COPD) Exacerbation Prediction Tool (ACCEPT) was developed for individualised prediction of COPD exacerbations. ACCEPT was well calibrated overall and had a high discriminatory power, but overestimated risk among individuals without recent exacerbations. The objectives of this study were to 1) fine-tune ACCEPT to make better predictions for individuals with a negative exacerbation history, 2) develop more parsimonious models, and 3) externally validate the models in a new dataset. METHODS We recalibrated ACCEPT using data from the Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-points (ECLIPSE, a three-year observational study, 1,803 patients, 2,117 exacerbations) study by applying non-parametric regression splines to the predicted rates. We developed three reduced versions of ACCEPT by removing symptom score and/or baseline medications as predictors. We examined the discrimination, calibration, and net benefit of ACCEPT 2·0 in the placebo arm of the Towards a Revolution in COPD Health (TORCH, a three-year randomised clinical trial of inhaled therapies in COPD, 1,091 patients, 1,064 exacerbations) study. The primary outcome for prediction was the occurrence of ≥2 moderate or ≥1 severe exacerbation in the next 12 months; the secondary outcomes were prediction of the occurrence of any moderate/severe exacerbation or any severe exacerbation. FINDINGS ACCEPT 2·0 had an area-under-the-curve (AUC) of 0·76 for predicting the primary outcome. Exacerbation history alone (current standard of care) had an AUC of 0·68. The model was well calibrated in patients with positive or negative exacerbation histories. Changes in AUC in reduced versions were minimal for the primary outcome as well as for predicting the occurrence of any moderate/severe exacerbations (ΔAUC<0·011), but more substantial for predicting the occurrence of any severe exacerbations (ΔAUC<0·020). All versions of ACCEPT 2·0 provided positive net benefit over the use of exacerbation history alone for some range of thresholds. INTERPRETATION ACCEPT 2·0 showed good calibration regardless of exacerbation history, and predicts exacerbation risk better than current standard of care for a range of thresholds. Future studies need to investigate the utility of exacerbation prediction in various subgroups of patients. FUNDING This study was funded by a team grant from the Canadian Institutes of Health Research (PHT 178432).
Collapse
Affiliation(s)
- Abdollah Safari
- Respiratory Evaluation Sciences Program, Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, Canada
- Department of Mathematics, Statistics, and Computer Science, University of Tehran, Tehran, Iran
| | - Amin Adibi
- Respiratory Evaluation Sciences Program, Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, Canada
| | - Don D. Sin
- Centre for Heart Lung Innovation, St. Paul's Hospital and Department of Medicine (Division of Respirology), The University of British Columbia, Vancouver, Canada
| | - Tae Yoon Lee
- Respiratory Evaluation Sciences Program, Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, Canada
| | - Joseph Khoa Ho
- Respiratory Evaluation Sciences Program, Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, Canada
| | - Mohsen Sadatsafavi
- Respiratory Evaluation Sciences Program, Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, Canada
- Centre for Heart Lung Innovation, St. Paul's Hospital and Department of Medicine (Division of Respirology), The University of British Columbia, Vancouver, Canada
- Corresponding author at: Room 4110, Faculty of Pharmaceutical Sciences, 2405 Wesbrook Mall, Vancouver, BC, V6T1Z3, Canada.
| |
Collapse
|
24
|
Moving from “Surgeries” to Patients: Progress and Pitfalls While Using Machine Learning to Personalize Transfusion Prediction. Anesthesiology 2022; 137:9-12. [DOI: 10.1097/aln.0000000000004250] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|