1
|
Gonzalez R, Saha A, Campbell CJ, Nejat P, Lokker C, Norgan AP. Seeing the random forest through the decision trees. Supporting learning health systems from histopathology with machine learning models: Challenges and opportunities. J Pathol Inform 2024; 15:100347. [PMID: 38162950 PMCID: PMC10755052 DOI: 10.1016/j.jpi.2023.100347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 10/06/2023] [Accepted: 11/01/2023] [Indexed: 01/03/2024] Open
Abstract
This paper discusses some overlooked challenges faced when working with machine learning models for histopathology and presents a novel opportunity to support "Learning Health Systems" with them. Initially, the authors elaborate on these challenges after separating them according to their mitigation strategies: those that need innovative approaches, time, or future technological capabilities and those that require a conceptual reappraisal from a critical perspective. Then, a novel opportunity to support "Learning Health Systems" by integrating hidden information extracted by ML models from digitalized histopathology slides with other healthcare big data is presented.
Collapse
Affiliation(s)
- Ricardo Gonzalez
- DeGroote School of Business, McMaster University, Hamilton, Ontario, Canada
- Division of Computational Pathology and Artificial Intelligence, Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, United States
| | - Ashirbani Saha
- Department of Oncology, Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada
- Escarpment Cancer Research Institute, McMaster University and Hamilton Health Sciences, Hamilton, Ontario, Canada
| | - Clinton J.V. Campbell
- William Osler Health System, Brampton, Ontario, Canada
- Department of Pathology and Molecular Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada
| | - Peyman Nejat
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, United States
| | - Cynthia Lokker
- Health Information Research Unit, Department of Health Research Methods, Evidence and Impact, McMaster University, Hamilton, Ontario, Canada
| | - Andrew P. Norgan
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, United States
| |
Collapse
|
2
|
Smith KA, Hardy A, Vinnikova A, Blease C, Milligan L, Hidalgo-Mazzei D, Lambe S, Marzano L, Uhlhaas PJ, Ostinelli EG, Anmella G, Zangani C, Aronica R, Dwyer B, Torous J, Cipriani A. Digital Mental Health for Schizophrenia and Other Severe Mental Illnesses: An International Consensus on Current Challenges and Potential Solutions. JMIR Ment Health 2024; 11:e57155. [PMID: 38717799 PMCID: PMC11112473 DOI: 10.2196/57155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 03/08/2024] [Accepted: 03/21/2024] [Indexed: 05/25/2024] Open
Abstract
BACKGROUND Digital approaches may be helpful in augmenting care to address unmet mental health needs, particularly for schizophrenia and severe mental illness (SMI). OBJECTIVE An international multidisciplinary group was convened to reach a consensus on the challenges and potential solutions regarding collecting data, delivering treatment, and the ethical challenges in digital mental health approaches for schizophrenia and SMI. METHODS The consensus development panel method was used, with an in-person meeting of 2 groups: the expert group and the panel. Membership was multidisciplinary including those with lived experience, with equal participation at all stages and coproduction of the consensus outputs and summary. Relevant literature was shared in advance of the meeting, and a systematic search of the recent literature on digital mental health interventions for schizophrenia and psychosis was completed to ensure that the panel was informed before the meeting with the expert group. RESULTS Four broad areas of challenge and proposed solutions were identified: (1) user involvement for real coproduction; (2) new approaches to methodology in digital mental health, including agreed standards, data sharing, measuring harms, prevention strategies, and mechanistic research; (3) regulation and funding issues; and (4) implementation in real-world settings (including multidisciplinary collaboration, training, augmenting existing service provision, and social and population-focused approaches). Examples are provided with more detail on human-centered research design, lived experience perspectives, and biomedical ethics in digital mental health approaches for SMI. CONCLUSIONS The group agreed by consensus on a number of recommendations: (1) a new and improved approach to digital mental health research (with agreed reporting standards, data sharing, and shared protocols), (2) equal emphasis on social and population research as well as biological and psychological approaches, (3) meaningful collaborations across varied disciplines that have previously not worked closely together, (4) increased focus on the business model and product with planning and new funding structures across the whole development pathway, (5) increased focus and reporting on ethical issues and potential harms, and (6) organizational changes to allow for true communication and coproduction with those with lived experience of SMI. This study approach, combining an international expert meeting with patient and public involvement and engagement throughout the process, consensus methodology, discussion, and publication, is a helpful way to identify directions for future research and clinical implementation in rapidly evolving areas and can be combined with measurements of real-world clinical impact over time. Similar initiatives will be helpful in other areas of digital mental health and similarly fast-evolving fields to focus research and organizational change and effect improved real-world clinical implementation.
Collapse
Affiliation(s)
- Katharine A Smith
- Department of Psychiatry, University of Oxford, Oxford, United Kingdom
- Oxford Health NHS Foundation Trust, Warneford Hospital, Oxford, United Kingdom
- Oxford Precision Psychiatry Lab, NIHR Oxford Health Biomedical Research Centre, Oxford, United Kingdom
| | - Amy Hardy
- Department of Psychology, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom
- South London & Maudsley NHS Foundation Trust, London, United Kingdom
| | | | - Charlotte Blease
- Participatory eHealth and Health Data Research Group, Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden
- Division of Digital Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, United States
| | - Lea Milligan
- MQ Mental Health Research, London, United Kingdom
| | - Diego Hidalgo-Mazzei
- Department of Psychiatry and Psychology, Institute of Neuroscience, Hospital Clínic de Barcelona, Barcelona, Spain
- Bipolar and Depressive Disorders Unit, Digital Innovation Group, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
- Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain
- Department of Medicine, School of Medicine and Health Sciences, Institute of Neurosciences, University of Barcelona, Barcelona, Spain
| | - Sinéad Lambe
- Oxford Health NHS Foundation Trust, Warneford Hospital, Oxford, United Kingdom
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Lisa Marzano
- School of Science and Technology, Middlesex University, London, United Kingdom
| | - Peter J Uhlhaas
- Institute of Neuroscience and Psychology, University of Glasgow, Glasgow, United Kingdom
- Department of Child and Adolescent Psychiatry, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
| | - Edoardo G Ostinelli
- Department of Psychiatry, University of Oxford, Oxford, United Kingdom
- Oxford Health NHS Foundation Trust, Warneford Hospital, Oxford, United Kingdom
- Oxford Precision Psychiatry Lab, NIHR Oxford Health Biomedical Research Centre, Oxford, United Kingdom
| | - Gerard Anmella
- Department of Psychiatry and Psychology, Institute of Neuroscience, Hospital Clínic de Barcelona, Barcelona, Spain
- Bipolar and Depressive Disorders Unit, Digital Innovation Group, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
- Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain
- Department of Medicine, School of Medicine and Health Sciences, Institute of Neurosciences, University of Barcelona, Barcelona, Spain
| | - Caroline Zangani
- Department of Psychiatry, University of Oxford, Oxford, United Kingdom
- Oxford Health NHS Foundation Trust, Warneford Hospital, Oxford, United Kingdom
- Oxford Precision Psychiatry Lab, NIHR Oxford Health Biomedical Research Centre, Oxford, United Kingdom
| | - Rosario Aronica
- Psychiatry Unit, Department of Neurosciences and Mental Health, Ospedale Maggiore Policlinico Ca' Granda, Fondazione Istituto di Ricovero e Cura a Carattere Scientifico, Milan, Italy
- Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy
| | - Bridget Dwyer
- Division of Digital Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, United States
| | - John Torous
- Division of Digital Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, United States
| | - Andrea Cipriani
- Department of Psychiatry, University of Oxford, Oxford, United Kingdom
- Oxford Health NHS Foundation Trust, Warneford Hospital, Oxford, United Kingdom
- Oxford Precision Psychiatry Lab, NIHR Oxford Health Biomedical Research Centre, Oxford, United Kingdom
| |
Collapse
|
3
|
Scott J, White A, Walsh C, Aslett L, Rutherford MA, Ng J, Judge C, Sebastian K, O'Brien S, Kelleher J, Power J, Conlon N, Moran SM, Luqmani RA, Merkel PA, Tesar V, Hruskova Z, Little MA. Computable phenotype for real-world, data-driven retrospective identification of relapse in ANCA-associated vasculitis. RMD Open 2024; 10:e003962. [PMID: 38688690 PMCID: PMC11086371 DOI: 10.1136/rmdopen-2023-003962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 03/29/2024] [Indexed: 05/02/2024] Open
Abstract
OBJECTIVE ANCA-associated vasculitis (AAV) is a relapsing-remitting disease, resulting in incremental tissue injury. The gold-standard relapse definition (Birmingham Vasculitis Activity Score, BVAS>0) is often missing or inaccurate in registry settings, leading to errors in ascertainment of this key outcome. We sought to create a computable phenotype (CP) to automate retrospective identification of relapse using real-world data in the research setting. METHODS We studied 536 patients with AAV and >6 months follow-up recruited to the Rare Kidney Disease registry (a national longitudinal, multicentre cohort study). We followed five steps: (1) independent encounter adjudication using primary medical records to assign the ground truth, (2) selection of data elements (DEs), (3) CP development using multilevel regression modelling, (4) internal validation and (5) development of additional models to handle missingness. Cut-points were determined by maximising the F1-score. We developed a web application for CP implementation, which outputs an individualised probability of relapse. RESULTS Development and validation datasets comprised 1209 and 377 encounters, respectively. After classifying encounters with diagnostic histopathology as relapse, we identified five key DEs; DE1: change in ANCA level, DE2: suggestive blood/urine tests, DE3: suggestive imaging, DE4: immunosuppression status, DE5: immunosuppression change. F1-score, sensitivity and specificity were 0.85 (95% CI 0.77 to 0.92), 0.89 (95% CI 0.80 to 0.99) and 0.96 (95% CI 0.93 to 0.99), respectively. Where DE5 was missing, DE2 plus either DE1/DE3 were required to match the accuracy of BVAS. CONCLUSIONS This CP accurately quantifies the individualised probability of relapse in AAV retrospectively, using objective, readily accessible registry data. This framework could be leveraged for other outcomes and relapsing diseases.
Collapse
Affiliation(s)
- Jennifer Scott
- Trinity Kidney Centre, Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
| | - Arthur White
- School of Computer Science and Statistics, Trinity College Dublin, Dublin, Ireland
- ADAPT SFI centre, Trinity College Dublin, Dublin, Ireland
| | - Cathal Walsh
- Department of Computer Science and Statistics, Trinity College Dublin, Dublin, Ireland
- National Centre for Pharmacoeconomics, St James's Hospital, Dublin, Ireland
| | - Louis Aslett
- Department of Mathematical Science, University of Durham, Durham, UK
| | | | - James Ng
- School of Computer Science and Statistics, Trinity College Dublin, Dublin, Ireland
| | - Conor Judge
- School of Medicine, College of Medicine, Nursing and Health Science, University of Galway, Galway, Ireland
| | - Kuruvilla Sebastian
- Trinity Kidney Centre, Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
| | - Sorcha O'Brien
- Trinity Kidney Centre, Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
| | - John Kelleher
- Department of Statistics, Dublin Institute of Technology, Dublin, Ireland
| | - Julie Power
- Vasculitis Ireland Awareness, Dublin, Ireland
| | - Niall Conlon
- Department of Immunology, St James's Hospital, Dublin, Ireland
| | - Sarah M Moran
- Trinity Kidney Centre, Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
- Department of Nephrology, Cork University Hospital, Cork, Ireland
| | - Raashid Ahmed Luqmani
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Science (NDORMs), University of Oxford, Oxford, UK
| | - Peter A Merkel
- Division of Rheumatology, Department of Medicine, Division of Epidemiology, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Vladimir Tesar
- Department of Nephrology, General University Hospital, Prague, Czech Republic
- 1st Faculty of Medicine, Charles University, Prague, Czech Republic
| | - Zdenka Hruskova
- 1st Faculty of Medicine, Charles University, Prague, Czech Republic
- General University Hospital, Prague, Czech Republic
| | - Mark A Little
- Trinity Kidney Centre, Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, Ireland
- ADAPT SFI centre, Trinity College Dublin, Dublin, Ireland
| |
Collapse
|
4
|
Zahra A, van Smeden M, Abbink EJ, van den Berg JM, Blom MT, van den Dries CJ, Gussekloo J, Wouters F, Joling KJ, Melis R, Mooijaart SP, Peters JB, Polinder-Bos HA, van Raaij BFM, Appelman B, la Roi-Teeuw HM, Moons KGM, Luijken K. External validation of six COVID-19 prognostic models for predicting mortality risk in older populations in a hospital, primary care, and nursing home setting. J Clin Epidemiol 2024; 168:111270. [PMID: 38311188 DOI: 10.1016/j.jclinepi.2024.111270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/26/2024] [Accepted: 01/26/2024] [Indexed: 02/10/2024]
Abstract
OBJECTIVES To systematically evaluate the performance of COVID-19 prognostic models and scores for mortality risk in older populations across three health-care settings: hospitals, primary care, and nursing homes. STUDY DESIGN AND SETTING This retrospective external validation study included 14,092 older individuals of ≥70 years of age with a clinical or polymerase chain reaction-confirmed COVID-19 diagnosis from March 2020 to December 2020. The six validation cohorts include three hospital-based (CliniCo, COVID-OLD, COVID-PREDICT), two primary care-based (Julius General Practitioners Network/Academisch network huisartsgeneeskunde/Network of Academic general Practitioners, PHARMO), and one nursing home cohort (YSIS) in the Netherlands. Based on a living systematic review of COVID-19 prediction models using Prediction model Risk Of Bias ASsessment Tool for quality and risk of bias assessment and considering predictor availability in validation cohorts, we selected six prognostic models predicting mortality risk in adults with COVID-19 infection (GAL-COVID-19 mortality, 4C Mortality Score, National Early Warning Score 2-extended model, Xie model, Wang clinical model, and CURB65 score). All six prognostic models were validated in the hospital cohorts and the GAL-COVID-19 mortality model was validated in all three healthcare settings. The primary outcome was in-hospital mortality for hospitals and 28-day mortality for primary care and nursing home settings. Model performance was evaluated in each validation cohort separately in terms of discrimination, calibration, and decision curves. An intercept update was performed in models indicating miscalibration followed by predictive performance re-evaluation. MAIN OUTCOME MEASURE In-hospital mortality for hospitals and 28-day mortality for primary care and nursing home setting. RESULTS All six prognostic models performed poorly and showed miscalibration in the older population cohorts. In the hospital settings, model performance ranged from calibration-in-the-large -1.45 to 7.46, calibration slopes 0.24-0.81, and C-statistic 0.55-0.71 with 4C Mortality Score performing as the most discriminative and well-calibrated model. Performance across health-care settings was similar for the GAL-COVID-19 model, with a calibration-in-the-large in the range of -2.35 to -0.15 indicating overestimation, calibration slopes of 0.24-0.81 indicating signs of overfitting, and C-statistic of 0.55-0.71. CONCLUSION Our results show that most prognostic models for predicting mortality risk performed poorly in the older population with COVID-19, in each health-care setting: hospital, primary care, and nursing home settings. Insights into factors influencing predictive model performance in the older population are needed for pandemic preparedness and reliable prognostication of health-related outcomes in this demographic.
Collapse
Affiliation(s)
- Anum Zahra
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
| | - Maarten van Smeden
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Evertine J Abbink
- Department of Internal Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Jesse M van den Berg
- Department of General Practice, Amsterdam UMC Location Vrije Universiteit Amsterdam, Amsterdam, The Netherlands; Amsterdam Public Health, Health Behaviors & Chronic Diseases, Amsterdam, The Netherlands; PHARMO Institute for Drug Outcomes Research, Utrecht, The Netherlands
| | - Marieke T Blom
- Department of General Practice, Amsterdam UMC Location Vrije Universiteit Amsterdam, Amsterdam, The Netherlands; Amsterdam Public Health, Health Behaviors & Chronic Diseases, Amsterdam, The Netherlands
| | - Carline J van den Dries
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Jacobijn Gussekloo
- Section Gerontology and Geriatrics, LUMC Center for Medicine for Older People & Department of Public Health and Primary Care & Department of Internal Medicine, Leiden University Medical Center, Leiden, The Netherlands
| | - Fenne Wouters
- Department of Medicine for Older People, Amsterdam UMC, Location Vrije Universiteit Amsterdam, De Boelelaan 1117, Amsterdam, The Netherlands; Amsterdam Public Health Research Institute, Aging & Later Life, Amsterdam, The Netherlands
| | - Karlijn J Joling
- Department of Medicine for Older People, Amsterdam UMC, Location Vrije Universiteit Amsterdam, De Boelelaan 1117, Amsterdam, The Netherlands; Amsterdam Public Health Research Institute, Aging & Later Life, Amsterdam, The Netherlands
| | - René Melis
- Department of Geriatric Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Simon P Mooijaart
- LUMC Center for Medicine for Older People, LUMC, Leiden, The Netherlands
| | - Jeannette B Peters
- Department of Pulmonary Diseases, Radboud University Medical Center, Radboud Institute for Health Sciences, Nijmegen, The Netherlands
| | - Harmke A Polinder-Bos
- Section of Geriatric Medicine, Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, The Netherlands
| | - Bas F M van Raaij
- LUMC Center for Medicine for Older People, LUMC, Leiden, The Netherlands
| | - Brent Appelman
- Amsterdam UMC Location University of Amsterdam, Center for Experimental and Molecular Medicine, Amsterdam, The Netherlands
| | - Hannah M la Roi-Teeuw
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Karel G M Moons
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Kim Luijken
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
5
|
Arakaki D, Iwata M, Terasawa T. External validation and update of the International Medical Prevention Registry on Venous Thromboembolism bleeding risk score for predicting bleeding in acutely ill hospitalized medical patients: a retrospective single-center cohort study in Japan. Thromb J 2024; 22:31. [PMID: 38549086 PMCID: PMC10976666 DOI: 10.1186/s12959-024-00603-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Accepted: 03/21/2024] [Indexed: 04/10/2024] Open
Abstract
BACKGROUND The International Medical Prevention Registry for Venous Thromboembolism (IMPROVE) Bleeding Risk Score is the recommended risk assessment model (RAM) for predicting bleeding risk in acutely ill medical inpatients in Western countries. However, few studies have assessed its predictive performance in local Asian settings. METHODS We retrospectively identified acutely ill adolescents and adults (aged ≥ 15 years) who were admitted to our general internal medicine department between July 5, 2016 and July 5, 2021, and extracted data from their electronic medical records. The outcome of interest was the cumulative incidence of major and nonmajor but clinically relevant bleeding 14 days after admission. For the two-risk-group model, we estimated sensitivity, specificity, and positive and negative predictive values (PPV and NPV, respectively). For the 11-risk-group model, we estimated C statistic, expected and observed event ratio (E/O), calibration-in-the-large (CITL), and calibration slope. In addition, we recalibrated the intercept using local data to update the RAM. RESULTS Among the 3,876 included patients, 998 (26%) were aged ≥ 85 years, while 656 (17%) were hospitalized in the intensive care unit. The median length of hospital stay was 14 days. Clinically relevant bleeding occurred in 58 patients (1.5%), 49 (1.3%) of whom experienced major bleeding. Sensitivity, specificity, NPV, and PPV were 26.1% (95% confidence interval [CI]: 15.8-40.0%), 84.8% (83.6-85.9%), 98.7% (98.2-99.0%), and 2.5% (1.5-4.3%) for any bleeding and 30.9% (95% CI: 18.8-46.3%), 84.9% (83.7-86.0%), 99.0% (98.5-99.3%), and 2.5% (1.5-4.3%) for major bleeding, respectively. The C statistic, E/O, CITL, and calibration slope were 0.64 (95% CI: 0.58-0.71), 1.69 (1.45-2.05), - 0.55 (- 0.81 to - 0.29), and 0.58 (0.29-0.87) for any bleeding and 0.67 (95% CI: 0.60-0.74), 0.76 (0.61-0.87), 0.29 (0.00-0.58), and 0.42 (0.19-0.64) for major bleeding, respectively. Updating the model substantially corrected the poor calibration observed. CONCLUSIONS In our Japanese cohort, the IMPROVE bleeding RAM retained the reported moderate discriminative performance. Model recalibration substantially improved the poor calibration obtained using the original RAM. Before its introduction into clinical practice, the updated RAM needs further validation studies and an optimized threshold.
Collapse
Affiliation(s)
- Daichi Arakaki
- Department of Emergency Medicine and General Internal Medicine, Fujita Health University School of Medicine, 1-98 Dengakugakubo, Kutsukakecho, Toyoake, Achi, 470-1192, Toyoake, Aichi, Japan
- Department of Emergency and Critical Care, Nagoya University Hospital, Nagoya, Aichi, Japan
| | - Mitsunaga Iwata
- Department of Emergency Medicine and General Internal Medicine, Fujita Health University School of Medicine, 1-98 Dengakugakubo, Kutsukakecho, Toyoake, Achi, 470-1192, Toyoake, Aichi, Japan
| | - Teruhiko Terasawa
- Department of Emergency Medicine and General Internal Medicine, Fujita Health University School of Medicine, 1-98 Dengakugakubo, Kutsukakecho, Toyoake, Achi, 470-1192, Toyoake, Aichi, Japan.
| |
Collapse
|
6
|
Collins GS, Dhiman P, Ma J, Schlussel MM, Archer L, Van Calster B, Harrell FE, Martin GP, Moons KGM, van Smeden M, Sperrin M, Bullock GS, Riley RD. Evaluation of clinical prediction models (part 1): from development to external validation. BMJ 2024; 384:e074819. [PMID: 38191193 PMCID: PMC10772854 DOI: 10.1136/bmj-2023-074819] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/04/2023] [Indexed: 01/10/2024]
Affiliation(s)
- Gary S Collins
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Paula Dhiman
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Jie Ma
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Michael M Schlussel
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Lucinda Archer
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, UK
| | - Ben Van Calster
- KU Leuven, Department of Development and Regeneration, Leuven, Belgium
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, Netherlands
- EPI-Centre, KU Leuven, Belgium
| | - Frank E Harrell
- Department of Biostatistics, Vanderbilt University, Nashville, TN, USA
| | - Glen P Martin
- Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK
| | - Karel G M Moons
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Maarten van Smeden
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Matthew Sperrin
- Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK
| | - Garrett S Bullock
- Department of Orthopaedic Surgery, Wake Forest School of Medicine, Winston-Salem, NC, USA
- Centre for Sport, Exercise and Osteoarthritis Research Versus Arthritis, University of Oxford, Oxford, UK
| | - Richard D Riley
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, UK
| |
Collapse
|
7
|
Tanner KT, Keogh RH, Coupland CAC, Hippisley-Cox J, Diaz-Ordaz K. Dynamic updating of clinical survival prediction models in a changing environment. Diagn Progn Res 2023; 7:24. [PMID: 38082429 PMCID: PMC10714456 DOI: 10.1186/s41512-023-00163-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 10/17/2023] [Indexed: 01/31/2024] Open
Abstract
BACKGROUND Over time, the performance of clinical prediction models may deteriorate due to changes in clinical management, data quality, disease risk and/or patient mix. Such prediction models must be updated in order to remain useful. In this study, we investigate dynamic model updating of clinical survival prediction models. In contrast to discrete or one-time updating, dynamic updating refers to a repeated process for updating a prediction model with new data. We aim to extend previous research which focused largely on binary outcome prediction models by concentrating on time-to-event outcomes. We were motivated by the rapidly changing environment seen during the COVID-19 pandemic where mortality rates changed over time and new treatments and vaccines were introduced. METHODS We illustrate three methods for dynamic model updating: Bayesian dynamic updating, recalibration, and full refitting. We use a simulation study to compare performance in a range of scenarios including changing mortality rates, predictors with low prevalence and the introduction of a new treatment. Next, the updating strategies were applied to a model for predicting 70-day COVID-19-related mortality using patient data from QResearch, an electronic health records database from general practices in the UK. RESULTS In simulated scenarios with mortality rates changing over time, all updating methods resulted in better calibration than not updating. Moreover, dynamic updating outperformed ad hoc updating. In the simulation scenario with a new predictor and a small updating dataset, Bayesian updating improved the C-index over not updating and refitting. In the motivating example with a rare outcome, no single updating method offered the best performance. CONCLUSIONS We found that a dynamic updating process outperformed one-time discrete updating in the simulations. Bayesian updating offered good performance overall, even in scenarios with new predictors and few events. Intercept recalibration was effective in scenarios with smaller sample size and changing baseline hazard. Refitting performance depended on sample size and produced abrupt changes in hazard ratio estimates between periods.
Collapse
Affiliation(s)
- Kamaryn T Tanner
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK.
| | - Ruth H Keogh
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK
| | - Carol A C Coupland
- Nuffield Department of Primary Health Care Sciences, University of Oxford, Oxford, OX2 6HT, UK
- Centre for Academic Primary Care, University of Nottingham, Nottingham, NG7 2UH, UK
| | - Julia Hippisley-Cox
- Nuffield Department of Primary Health Care Sciences, University of Oxford, Oxford, OX2 6HT, UK
| | - Karla Diaz-Ordaz
- Department of Statistical Science, University College London, London, WC1E 6BT, UK
| |
Collapse
|
8
|
Booth S, Mozumder SI, Archer L, Ensor J, Riley RD, Lambert PC, Rutherford MJ. Using temporal recalibration to improve the calibration of risk prediction models in competing risk settings when there are trends in survival over time. Stat Med 2023; 42:5007-5024. [PMID: 37705296 PMCID: PMC10946485 DOI: 10.1002/sim.9898] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 07/31/2023] [Accepted: 08/23/2023] [Indexed: 09/15/2023]
Abstract
We have previously proposed temporal recalibration to account for trends in survival over time to improve the calibration of predictions from prognostic models for new patients. This involves first estimating the predictor effects using data from all individuals (full dataset) and then re-estimating the baseline using a subset of the most recent data whilst constraining the predictor effects to remain the same. In this article, we demonstrate how temporal recalibration can be applied in competing risk settings by recalibrating each cause-specific (or subdistribution) hazard model separately. We illustrate this using an example of colon cancer survival with data from the Surveillance Epidemiology and End Results (SEER) program. Data from patients diagnosed in 1995-2004 were used to fit two models for deaths due to colon cancer and other causes respectively. We discuss considerations that need to be made in order to apply temporal recalibration such as the choice of data used in the recalibration step. We also demonstrate how to assess the calibration of these models in new data for patients diagnosed subsequently in 2005. Comparison was made to a standard analysis (when improvements over time are not taken into account) and a period analysis which is similar to temporal recalibration but differs in the data used to estimate the predictor effects. The 10-year calibration plots demonstrated that using the standard approach over-estimated the risk of death due to colon cancer and the total risk of death and that calibration was improved using temporal recalibration or period analysis.
Collapse
Affiliation(s)
- Sarah Booth
- Biostatistics Research Group, Department of Population Health SciencesUniversity of LeicesterLeicesterUK
| | - Sarwar I. Mozumder
- Biostatistics Research Group, Department of Population Health SciencesUniversity of LeicesterLeicesterUK
- Oncology Biometrics Statistical Innovation, AstraZenecaCambridgeUK
| | - Lucinda Archer
- Institute of Applied Health Research, College of Medical and Dental SciencesUniversity of BirminghamBirminghamUK
| | - Joie Ensor
- Institute of Applied Health Research, College of Medical and Dental SciencesUniversity of BirminghamBirminghamUK
| | - Richard D. Riley
- Institute of Applied Health Research, College of Medical and Dental SciencesUniversity of BirminghamBirminghamUK
| | - Paul C. Lambert
- Biostatistics Research Group, Department of Population Health SciencesUniversity of LeicesterLeicesterUK
- Department of Medical Epidemiology and BiostatisticsKarolinska InstitutetStockholmSweden
| | - Mark J. Rutherford
- Biostatistics Research Group, Department of Population Health SciencesUniversity of LeicesterLeicesterUK
| |
Collapse
|
9
|
Riley S, Tam K, Tse WY, Connor A, Wei Y. An external validation of the Kidney Donor Risk Index in the UK transplant population in the presence of semi-competing events. Diagn Progn Res 2023; 7:20. [PMID: 37986130 PMCID: PMC10662562 DOI: 10.1186/s41512-023-00159-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 09/11/2023] [Indexed: 11/22/2023] Open
Abstract
BACKGROUND Transplantation represents the optimal treatment for many patients with end-stage kidney disease. When a donor kidney is available to a waitlisted patient, clinicians responsible for the care of the potential recipient must make the decision to accept or decline the offer based upon complex and variable information about the donor, the recipient and the transplant process. A clinical prediction model may be able to support clinicians in their decision-making. The Kidney Donor Risk Index (KDRI) was developed in the United States to predict graft failure following kidney transplantation. The survival process following transplantation consists of semi-competing events where death precludes graft failure, but not vice-versa. METHODS We externally validated the KDRI in the UK kidney transplant population and assessed whether validation under a semi-competing risks framework impacted predictive performance. Additionally, we explored whether the KDRI requires updating. We included 20,035 adult recipients of first, deceased donor, single, kidney-only transplants between January 1, 2004, and December 31, 2018, collected by the UK Transplant Registry and held by NHS Blood and Transplant. The outcomes of interest were 1- and 5-year graft failure following transplantation. In light of the semi-competing events, recipient death was handled in two ways: censoring patients at the time of death and modelling death as a competing event. Cox proportional hazard models were used to validate the KDRI when censoring graft failure by death, and cause-specific Cox models were used to account for death as a competing event. RESULTS The KDRI underestimated event probabilities for those at higher risk of graft failure. For 5-year graft failure, discrimination was poorer in the semi-competing risks model (0.625, 95% CI 0.611 to 0.640;0.611, 95% CI 0.597 to 0.625), but predictions were more accurate (Brier score 0.117, 95% CI 0.112 to 0.121; 0.114, 95% CI 0.109 to 0.118). Calibration plots were similar regardless of whether the death was modelled as a competing event or not. Updating the KDRI worsened calibration, but marginally improved discrimination. CONCLUSIONS Predictive performance for 1-year graft failure was similar between death-censored and competing event graft failure, but differences appeared when predicting 5-year graft failure. The updated index did not have superior performance and we conclude that updating the KDRI in the present form is not required.
Collapse
Affiliation(s)
- Stephanie Riley
- Centre for Mathematical Sciences, School of Engineering, Computing and Mathematics, University of Plymouth, Plymouth, UK.
| | - Kimberly Tam
- School of Engineering, Computing and Mathematics, University of Plymouth, Plymouth, UK
| | - Wai-Yee Tse
- Department of Renal Medicine, South West Transplant Centre, University Hospitals Plymouth NHS Trust, Plymouth, UK
| | - Andrew Connor
- Department of Renal Medicine, South West Transplant Centre, University Hospitals Plymouth NHS Trust, Plymouth, UK
| | - Yinghui Wei
- Centre for Mathematical Sciences, School of Engineering, Computing and Mathematics, University of Plymouth, Plymouth, UK.
| |
Collapse
|
10
|
Vartiainen P, Jukarainen S, Rhedin SA, Prinz A, Hartonen T, Vabalas A, Viippola E, Rodosthenous RS, Koskelainen S, Liu A, Lundholm C, Smew AI, Osvald EC, Helle E, Perola M, Almqvist C, Heinonen S, Ganna A. Risk factors for severe respiratory syncytial virus infection during the first year of life: development and validation of a clinical prediction model. Lancet Digit Health 2023; 5:e821-e830. [PMID: 37890904 DOI: 10.1016/s2589-7500(23)00175-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 07/13/2023] [Accepted: 08/16/2023] [Indexed: 10/29/2023]
Abstract
BACKGROUND Novel immunisation methods against respiratory syncytial virus (RSV) are emerging, but knowledge of risk factors for severe RSV disease is insufficient for optimal targeting of interventions against them. Our aims were to identify predictors for RSV hospital admission from registry-based data and to develop and validate a clinical prediction model to guide RSV immunoprophylaxis for infants younger than 1 year. METHODS In this model development and validation study, we studied all infants born in Finland between June 1, 1997, and May 31, 2020, and in Sweden between June 1, 2006, and May 31, 2020, along with the data for their parents and siblings. Infants were excluded if they died or were admitted to hospital for RSV within the first 7 days of life. The outcome was hospital admission due to RSV bronchiolitis during the first year of life. The Finnish study population was divided into a development dataset (born between June 1, 1997, and May 31, 2017) and a temporal hold-out validation dataset (born between June 1, 2017, and May 31, 2020). The development dataset was used for predictor discovery and selection in which we screened 1511 candidate predictors from the infants', parents', and siblings' data, and developed a logistic regression model with the 16 most important predictors. This model was then validated using the Finnish hold-out validation dataset and the Swedish dataset. FINDINGS In total, there were 1 124 561 infants in the Finnish development dataset, 130 352 infants in the Finnish hold-out validation dataset, and 1 459 472 infants in the Swedish dataset. In addition to known predictors such as severe congenital heart defects (adjusted odds ratio 2·89, 95% CI 2·28-3·65), we confirmed some less established predictors for RSV hospital admission, most notably oesophageal malformations (3·11, 1·86-5·19) and lower complexity congenital heart defects (1·43, 1·25-1·63). The prediction model's C-statistic was 0·766 (95% CI 0·742-0·789) in Finnish data and 0·737 (0·710-0·762) in Swedish validation data. The infants in the highest decile of predicted RSV hospital admission probability had 4·5 times higher observed risk compared with others. Calibration varied according to epidemic intensity. The model's performance was similar to a machine learning (XGboost) model using all 1511 candidate predictors (C-statistic in Finland 0·771, 95% CI 0·754-0·788). The prediction model showed clinical utility in decision curve analysis and in hypothetical number needed to treat calculations for immunisation, and its C-statistic was similar across different strata of parental income. INTERPRETATION The identified predictors and the prediction model can be used in guiding RSV immunoprophylaxis in infants, or as a basis for further immunoprophylaxis targeting tools. FUNDING Sigrid Jusélius Foundation, European Research Council, Pediatric Research Foundation, and Academy of Finland.
Collapse
Affiliation(s)
- Pekka Vartiainen
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland; Pediatric Research Center, Helsinki University Hospital, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
| | - Sakari Jukarainen
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland
| | - Samuel Arthur Rhedin
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden; Sachs' Children and Youth Hospital, Stockholm, Sweden
| | - Alexandra Prinz
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland
| | - Tuomo Hartonen
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland
| | - Andrius Vabalas
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland
| | - Essi Viippola
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland
| | | | - Sara Koskelainen
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland; Finnish Institute for Health and Welfare (THL), Helsinki, Finland
| | - Aoxing Liu
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland; Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Cecilia Lundholm
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Awad I Smew
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Emma Caffrey Osvald
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden; Pediatric Allergy and Pulmonology Unit at Astrid Lindgren Children's Hospital, Karolinska University Hospital, Stockholm, Sweden
| | - Emmi Helle
- Pediatric Research Center, Helsinki University Hospital, Faculty of Medicine, University of Helsinki, Helsinki, Finland; Stem Cells and Metabolism Research Program, Faculty of Medicine, University of Helsinki, Helsinki, Finland; Finnish Institute for Health and Welfare (THL), Helsinki, Finland; Department of Paediatrics, Labatt Family Heart Centre, The Hospital for Sick Children, University of Toronto, Toronto, ON, Canada
| | - Markus Perola
- Research Program for Clinical and Molecular Metabolism, Faculty of Medicine, University of Helsinki, Helsinki, Finland; Finnish Institute for Health and Welfare (THL), Helsinki, Finland
| | - Catarina Almqvist
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden; Pediatric Allergy and Pulmonology Unit at Astrid Lindgren Children's Hospital, Karolinska University Hospital, Stockholm, Sweden
| | - Santtu Heinonen
- Pediatric Research Center, Helsinki University Hospital, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Andrea Ganna
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland; Broad Institute of MIT and Harvard, Cambridge, MA, USA; Analytical and Translational Genetic Unit, Massachusetts General Hospital, Cambridge, MA, USA
| |
Collapse
|
11
|
de Boer A, van Beek PE, Andriessen P, Groenendaal F, Hogeveen M, Meijer JS, Obermann-Borst SA, Onland W, Scheepers L(HCJ, Vermeulen MJ, Verweij EJT(J, De Proost L, Geurtzen R. Opportunities and Challenges of Prognostic Models for Extremely Preterm Infants. CHILDREN (BASEL, SWITZERLAND) 2023; 10:1712. [PMID: 37892375 PMCID: PMC10605480 DOI: 10.3390/children10101712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 10/06/2023] [Accepted: 10/19/2023] [Indexed: 10/29/2023]
Abstract
Predicting the short- and long-term outcomes of extremely preterm infants remains a challenge. Multivariable prognostic models might be valuable tools for clinicians, parents, and policymakers for providing accurate outcome estimates. In this perspective, we discuss the opportunities and challenges of using prognostic models in extremely preterm infants at population and individual levels. At a population level, these models could support the development of guidelines for decisions about treatment limits and may support policy processes such as benchmarking and resource allocation. At an individual level, these models may enhance prenatal counselling conversations by considering multiple variables and improving transparency about expected outcomes. Furthermore, they may improve consistency in projections shared with parents. For the development of prognostic models, we discuss important considerations such as predictor and outcome measure selection, clinical impact assessment, and generalizability. Lastly, future recommendations for developing and using prognostic models are suggested. Importantly, the purpose of a prognostic model should be clearly defined, and integrating these models into prenatal counselling requires thoughtful consideration.
Collapse
Affiliation(s)
- Angret de Boer
- Department of Neonatology, Amalia Children’s Hospital, Radboud University Medical Center, Geert Grooteplein Zuid 32, 6525 GA Nijmegen, The Netherlands; (P.E.v.B.); (M.H.); (R.G.)
- Department of Obstetrics and Gynecology, Leiden University Medical Centre, 2333 ZA Leiden, The Netherlands;
| | - Pauline E. van Beek
- Department of Neonatology, Amalia Children’s Hospital, Radboud University Medical Center, Geert Grooteplein Zuid 32, 6525 GA Nijmegen, The Netherlands; (P.E.v.B.); (M.H.); (R.G.)
- Department of Neonatology, Máxima Medical Center, 5504 DB Veldhoven, The Netherlands; (P.A.); (J.S.M.)
| | - Peter Andriessen
- Department of Neonatology, Máxima Medical Center, 5504 DB Veldhoven, The Netherlands; (P.A.); (J.S.M.)
| | - Floris Groenendaal
- Department of Neonatology, Wilhelmina Children’s Hospital, University Medical Center Utrecht, 3584 EA Utrecht, The Netherlands;
| | - Marije Hogeveen
- Department of Neonatology, Amalia Children’s Hospital, Radboud University Medical Center, Geert Grooteplein Zuid 32, 6525 GA Nijmegen, The Netherlands; (P.E.v.B.); (M.H.); (R.G.)
| | - Julia S. Meijer
- Department of Neonatology, Máxima Medical Center, 5504 DB Veldhoven, The Netherlands; (P.A.); (J.S.M.)
| | - Sylvia A. Obermann-Borst
- Care4Neo, Dutch Neonatal Patient and Parent Advocacy Organization, 3068 JN Rotterdam, The Netherlands; (S.A.O.-B.); (M.J.V.)
| | - Wes Onland
- Department of Neonatology, Emma Children’s Hospital, Amsterdam University Medical Center, 1105 AZ Amsterdam, The Netherlands;
- Amsterdam Reproduction & Development, 1105 AZ Amsterdam, The Netherlands
| | | | - Marijn J. Vermeulen
- Care4Neo, Dutch Neonatal Patient and Parent Advocacy Organization, 3068 JN Rotterdam, The Netherlands; (S.A.O.-B.); (M.J.V.)
- Department of Neonatal and Pediatric Intensive Care, Division of Neonatology, Sophia Children’s Hospital, Erasmus Medical Center, 3015 CN Rotterdam, The Netherlands
| | - E. J. T. (Joanne) Verweij
- Department of Obstetrics and Gynecology, Leiden University Medical Centre, 2333 ZA Leiden, The Netherlands;
| | - Lien De Proost
- Department of Ethics and Law, Leiden University Medical Centre, 2333 ZA Leiden, The Netherlands;
| | - Rosa Geurtzen
- Department of Neonatology, Amalia Children’s Hospital, Radboud University Medical Center, Geert Grooteplein Zuid 32, 6525 GA Nijmegen, The Netherlands; (P.E.v.B.); (M.H.); (R.G.)
| |
Collapse
|
12
|
Roschewitz M, Khara G, Yearsley J, Sharma N, James JJ, Ambrózay É, Heroux A, Kecskemethy P, Rijken T, Glocker B. Automatic correction of performance drift under acquisition shift in medical image classification. Nat Commun 2023; 14:6608. [PMID: 37857643 PMCID: PMC10587231 DOI: 10.1038/s41467-023-42396-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 10/10/2023] [Indexed: 10/21/2023] Open
Abstract
Image-based prediction models for disease detection are sensitive to changes in data acquisition such as the replacement of scanner hardware or updates to the image processing software. The resulting differences in image characteristics may lead to drifts in clinically relevant performance metrics which could cause harm in clinical decision making, even for models that generalise in terms of area under the receiver-operating characteristic curve. We propose Unsupervised Prediction Alignment, a generic automatic recalibration method that requires no ground truth annotations and only limited amounts of unlabelled example images from the shifted data distribution. We illustrate the effectiveness of the proposed method to detect and correct performance drift in mammography-based breast cancer screening and on publicly available histopathology data. We show that the proposed method can preserve the expected performance in terms of sensitivity/specificity under various realistic scenarios of image acquisition shift, thus offering an important safeguard for clinical deployment.
Collapse
Affiliation(s)
- Mélanie Roschewitz
- Kheiron Medical Technologies, London, UK.
- Imperial College London, Department of Computing, London, UK.
| | | | | | - Nisha Sharma
- Leeds Teaching Hospital NHS Trust, Department of Radiology, Leeds, UK
| | - Jonathan J James
- Nottingham University Hospitals NHS Trust, Nottingham City Hospital, Nottingham Breast Institute, Nottingham, UK
| | | | | | | | | | - Ben Glocker
- Kheiron Medical Technologies, London, UK.
- Imperial College London, Department of Computing, London, UK.
| |
Collapse
|
13
|
Vaid A, Sawant A, Suarez-Farinas M, Lee J, Kaul S, Kovatch P, Freeman R, Jiang J, Jayaraman P, Fayad Z, Argulian E, Lerakis S, Charney AW, Wang F, Levin M, Glicksberg B, Narula J, Hofer I, Singh K, Nadkarni GN. Implications of the Use of Artificial Intelligence Predictive Models in Health Care Settings : A Simulation Study. Ann Intern Med 2023; 176:1358-1369. [PMID: 37812781 DOI: 10.7326/m23-0949] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/11/2023] Open
Abstract
BACKGROUND Substantial effort has been directed toward demonstrating uses of predictive models in health care. However, implementation of these models into clinical practice may influence patient outcomes, which in turn are captured in electronic health record data. As a result, deployed models may affect the predictive ability of current and future models. OBJECTIVE To estimate changes in predictive model performance with use through 3 common scenarios: model retraining, sequentially implementing 1 model after another, and intervening in response to a model when 2 are simultaneously implemented. DESIGN Simulation of model implementation and use in critical care settings at various levels of intervention effectiveness and clinician adherence. Models were either trained or retrained after simulated implementation. SETTING Admissions to the intensive care unit (ICU) at Mount Sinai Health System (New York, New York) and Beth Israel Deaconess Medical Center (Boston, Massachusetts). PATIENTS 130 000 critical care admissions across both health systems. INTERVENTION Across 3 scenarios, interventions were simulated at varying levels of clinician adherence and effectiveness. MEASUREMENTS Statistical measures of performance, including threshold-independent (area under the curve) and threshold-dependent measures. RESULTS At fixed 90% sensitivity, in scenario 1 a mortality prediction model lost 9% to 39% specificity after retraining once and in scenario 2 a mortality prediction model lost 8% to 15% specificity when created after the implementation of an acute kidney injury (AKI) prediction model; in scenario 3, models for AKI and mortality prediction implemented simultaneously, each led to reduced effective accuracy of the other by 1% to 28%. LIMITATIONS In real-world practice, the effectiveness of and adherence to model-based recommendations are rarely known in advance. Only binary classifiers for tabular ICU admissions data were simulated. CONCLUSION In simulated ICU settings, a universally effective model-updating approach for maintaining model performance does not seem to exist. Model use may have to be recorded to maintain viability of predictive modeling. PRIMARY FUNDING SOURCE National Center for Advancing Translational Sciences.
Collapse
Affiliation(s)
- Akhil Vaid
- Division of Data-Driven and Digital Medicine, Department of Medicine, and The Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (A.V., P.J.)
| | - Ashwin Sawant
- Division of Data-Driven and Digital Medicine, Department of Medicine; The Charles Bronfman Institute of Personalized Medicine; and Division of Hospital Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (A.S.)
| | - Mayte Suarez-Farinas
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, New York (M.S., J.L.)
| | - Juhee Lee
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, New York (M.S., J.L.)
| | - Sanjeev Kaul
- Department of Surgery, Hackensack Meridian School of Medicine, Nutley, New Jersey (S.K.)
| | - Patricia Kovatch
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York (P.K., B.G.)
| | - Robert Freeman
- Division of Data-Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (R.F.)
| | - Joy Jiang
- The Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (J.J.)
| | - Pushkala Jayaraman
- Division of Data-Driven and Digital Medicine, Department of Medicine, and The Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (A.V., P.J.)
| | - Zahi Fayad
- BioMedical Engineering and Imaging Institute and Mount Sinai Heart, Icahn School of Medicine at Mount Sinai, New York, New York (Z.F.)
| | - Edgar Argulian
- Mount Sinai Heart, Icahn School of Medicine at Mount Sinai, New York, New York (E.A., S.L., J.N.)
| | - Stamatios Lerakis
- Mount Sinai Heart, Icahn School of Medicine at Mount Sinai, New York, New York (E.A., S.L., J.N.)
| | - Alexander W Charney
- The Charles Bronfman Institute of Personalized Medicine and Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, New York, and Department of Surgery, Hackensack Meridian School of Medicine, Nutley, New Jersey (A.W.C.)
| | - Fei Wang
- Department of Population Health Sciences, Weill Cornell Medicine, New York, New York (F.W.)
| | - Matthew Levin
- Department of Anesthesiology, Perioperative and Pain Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (M.L.)
| | - Benjamin Glicksberg
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York (P.K., B.G.)
| | - Jagat Narula
- Mount Sinai Heart, Icahn School of Medicine at Mount Sinai, New York, New York (E.A., S.L., J.N.)
| | - Ira Hofer
- Division of Data-Driven and Digital Medicine, Department of Medicine; The Charles Bronfman Institute of Personalized Medicine; and Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, New York (I.H.)
| | - Karandeep Singh
- Department of Learning Health Sciences, University of Michigan Medical School, Ann Arbor, Michigan (K.S.)
| | - Girish N Nadkarni
- Division of Data-Driven and Digital Medicine, Department of Medicine; The Charles Bronfman Institute of Personalized Medicine; and Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (G.N.N.)
| |
Collapse
|
14
|
Spencer KL, Absolom KL, Allsop MJ, Relton SD, Pearce J, Liao K, Naseer S, Salako O, Howdon D, Hewison J, Velikova G, Faivre-Finn C, Bekker HL, van der Veer SN. Fixing the Leaky Pipe: How to Improve the Uptake of Patient-Reported Outcomes-Based Prognostic and Predictive Models in Cancer Clinical Practice. JCO Clin Cancer Inform 2023; 7:e2300070. [PMID: 37976441 PMCID: PMC10681558 DOI: 10.1200/cci.23.00070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 09/08/2023] [Accepted: 09/29/2023] [Indexed: 11/19/2023] Open
Abstract
PURPOSE This discussion paper outlines challenges and proposes solutions for successfully implementing prediction models that incorporate patient-reported outcomes (PROs) in cancer practice. METHODS We organized a full-day multidisciplinary meeting of people with expertise in cancer care delivery, PRO collection, PRO use in prediction modeling, computing, implementation, and decision science. The discussions presented here focused on identifying challenges to the development, implementation and use of prediction models incorporating PROs, and suggesting possible solutions. RESULTS Specific challenges and solutions were identified across three broad areas. (1) Understanding decision making and implementation: necessitating multidisciplinary collaboration in the early stages and throughout; early stakeholder engagement to define the decision problem and ensure acceptability of PROs in prediction; understanding patient/clinician interpretation of PRO predictions and uncertainty to optimize prediction impact; striving for model integration into existing electronic health records; and early regulatory alignment. (2) Recognizing the limitations to PRO collection and their impact on prediction: incorporating validated, clinically important PROs to maximize model generalizability and clinical engagement; and minimizing missing PRO data (resulting from both structural digital exclusion and time-varying factors) to avoid exacerbating existing inequalities. (3) Statistical and modeling challenges: incorporating statistical methods to address missing data; ensuring predictive modeling recognizes complex causal relationships; and considering temporal and geographic recalibration so that model predictions reflect the relevant population. CONCLUSION Developing and implementing PRO-based prediction models in cancer care requires extensive multidisciplinary working from the earliest stages, recognition of implementation challenges because of PRO collection and model presentation, and robust statistical methods to manage missing data, causality, and calibration. Prediction models incorporating PROs should be viewed as complex interventions, with their development and impact assessment carried out to reflect this.
Collapse
Affiliation(s)
- Katie L. Spencer
- Leeds Institute of Health Sciences, University of Leeds, Leeds, United Kingdom
- Leeds Cancer Centre, Leeds Teaching Hospitals NHS Trust, Leeds, United Kingdom
| | - Kate L. Absolom
- Leeds Institute of Health Sciences, University of Leeds, Leeds, United Kingdom
| | - Matthew J. Allsop
- Leeds Institute of Health Sciences, University of Leeds, Leeds, United Kingdom
| | - Samuel D. Relton
- Leeds Institute of Data Analytics, University of Leeds, Leeds, United Kingdom
| | - Jessica Pearce
- Leeds Cancer Centre, Leeds Teaching Hospitals NHS Trust, Leeds, United Kingdom
- Leeds Institute of Medical Research, University of Leeds, Leeds, United Kingdom
| | - Kuan Liao
- Division of Informatics, Imaging and Data Sciences, Faculty of Biology, Medicine and Health, Centre for Health Informatics, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom
| | - Sairah Naseer
- School of Medicine, University of Leeds, Leeds, United Kingdom
| | - Omolola Salako
- College of Medicine, University of Lagos, Lagos, Nigeria
| | - Daniel Howdon
- Leeds Institute of Health Sciences, University of Leeds, Leeds, United Kingdom
| | - Jenny Hewison
- Leeds Institute of Health Sciences, University of Leeds, Leeds, United Kingdom
| | - Galina Velikova
- Leeds Cancer Centre, Leeds Teaching Hospitals NHS Trust, Leeds, United Kingdom
- Leeds Institute of Medical Research, University of Leeds, Leeds, United Kingdom
| | - Corinne Faivre-Finn
- Institute of Cancer Sciences, University of Manchester, Manchester, United Kingdom
| | - Hilary L. Bekker
- Leeds Institute of Health Sciences, University of Leeds, Leeds, United Kingdom
| | - Sabine N. van der Veer
- Division of Informatics, Imaging and Data Sciences, Faculty of Biology, Medicine and Health, Centre for Health Informatics, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom
| |
Collapse
|
15
|
Fabricius J, Pedersen AR. Subacute prognosis of oral nutrition (SPOON): Development of a multivariable prognostic model for complete oral intake in tube-fed subjects with acquired brain injury. Clin Nutr 2023; 42:1770-1777. [PMID: 37572580 DOI: 10.1016/j.clnu.2023.07.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 06/16/2023] [Accepted: 07/10/2023] [Indexed: 08/14/2023]
Abstract
BACKGROUND & AIMS Enteral feeding is very common following an acquired brain injury. However, no prognostic models for oral food intake have been developed for subacute rehabilitation. The aim of this study was to develop a prognostic model and online tool, coined "subacute prognosis of oral nutrition" (SPOON), for complete oral intake in tube-fed subjects with acquired brain injury. METHODS The model was developed using routinely gathered clinical data from a cohort of 1233 adult patients who were tube-fed at admission for sub-acute inpatient rehabilitation. Candidate predictors were included based on scientific evidence and their availability in the medical records within the first days following admission. The outcome was time until achieving complete oral food intake without any tube-feeding supplements. Time until complete oral intake was analyzed by discrete time-to-event analysis with logit-link and presented as daily odds ratios (OR) with 95% confidence intervals (CI). RESULTS The following predictors of complete oral intake were included in the model: age, diagnosis, cuffed tracheostomy tube, days from injury to admission for rehabilitation, and the Early Functional Abilities (EFA) sum score. Multiple adjusted analyses were performed stratified by cuffed tracheostomy tube status. Some of the strongest predictors of complete oral intake were age 18-40 years, OR 1.99 (95%CI: 1.53; 2.59); 0-2 weeks since injury, OR 3.75 (95%CI: 2.72; 5.16); and EFA 61-100 (slight/no disturbance in function), OR 5.81 (95%CI: 4.47; 7.55). The online prognostic tool SPOON was evaluated in a usability study. Based on feedback from clinicians, the tool was further refined to enable extraction of data for prediction directly from medical records. CONCLUSIONS The objective of SPOON is to complement the planning of rehabilitation initiatives and inform discussions to determine if a percutaneous endoscopic gastrostomy (PEG) tube should be inserted. SPOON is being implemented locally, but external validation based on appropriate data modeling is warranted before further clinical implementation.
Collapse
Affiliation(s)
- Jesper Fabricius
- Hammel Neurorehabilitation Centre and University Research Clinic, Aarhus University, Voldbyvej 15, 8450, Hammel, Denmark.
| | - Asger Roer Pedersen
- Hammel Neurorehabilitation Centre and University Research Clinic, Aarhus University, Voldbyvej 15, 8450, Hammel, Denmark; University Research Clinic for Innovative Patient Pathways, Diagnostic Centre, Silkeborg, Denmark
| |
Collapse
|
16
|
Shah DA, De Wolf ED, Paul PA, Madden LV. Into the Trees: Random Forests for Predicting Fusarium Head Blight Epidemics of Wheat in the United States. PHYTOPATHOLOGY 2023; 113:1483-1493. [PMID: 36880796 DOI: 10.1094/phyto-10-22-0380-r] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Constructing models that accurately predict Fusarium head blight (FHB) epidemics and are also amenable to large-scale deployment is a challenging task. In the United States, the emphasis has been on simple logistic regression (LR) models, which are easy to implement but may suffer from lower accuracies when compared with more complicated, harder-to-deploy (over large geographies) model frameworks such as functional or boosted regressions. This article examined the plausibility of random forests (RFs) for the binary prediction of FHB epidemics as a possible mediation between model simplicity and complexity without sacrificing accuracy. A minimalist set of predictors was also desirable rather than having the RF model use all 90 candidate variables as predictors. The input predictor set was filtered with the aid of three RF variable selection algorithms (Boruta, varSelRF, and VSURF), using resampling techniques to quantify the variability and stability of selected variable sets. Post-selection filtering produced 58 competitive RF models with no more than 14 predictors each. One variable representing temperature stability in the 20 days before anthesis was the most frequently selected predictor. This was a departure from the prominence of relative humidity-based variables previously reported in LR models for FHB. The RF models had overall superior predictive performance over the LR models and may be suitable candidates for use by the Fusarium Head Blight Prediction Center.
Collapse
Affiliation(s)
- Denis A Shah
- Department of Plant Pathology, Kansas State University, Manhattan, KS 66506
| | - Erick D De Wolf
- Department of Plant Pathology, Kansas State University, Manhattan, KS 66506
| | - Pierce A Paul
- Department of Plant Pathology, The Ohio State University, Ohio Agricultural Research and Development Center, Wooster, OH 44691
| | - Laurence V Madden
- Department of Plant Pathology, The Ohio State University, Ohio Agricultural Research and Development Center, Wooster, OH 44691
| |
Collapse
|
17
|
Rockenschaub P, Gill MJ, McNulty D, Carroll O, Freemantle N, Shallcross L. Can the application of machine learning to electronic health records guide antibiotic prescribing decisions for suspected urinary tract infection in the Emergency Department? PLOS DIGITAL HEALTH 2023; 2:e0000261. [PMID: 37310941 DOI: 10.1371/journal.pdig.0000261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 04/25/2023] [Indexed: 06/15/2023]
Abstract
Urinary tract infections (UTIs) are a major cause of emergency hospital admissions, but it remains challenging to diagnose them reliably. Application of machine learning (ML) to routine patient data could support clinical decision-making. We developed a ML model predicting bacteriuria in the ED and evaluated its performance in key patient groups to determine scope for its future use to improve UTI diagnosis and thus guide antibiotic prescribing decisions in clinical practice. We used retrospective electronic health records from a large UK hospital (2011-2019). Non-pregnant adults who attended the ED and had a urine sample cultured were eligible for inclusion. The primary outcome was predominant bacterial growth ≥104 cfu/mL in urine. Predictors included demography, medical history, ED diagnoses, blood tests, and urine flow cytometry. Linear and tree-based models were trained via repeated cross-validation, re-calibrated, and validated on data from 2018/19. Changes in performance were investigated by age, sex, ethnicity, and suspected ED diagnosis, and compared to clinical judgement. Among 12,680 included samples, 4,677 (36.9%) showed bacterial growth. Relying primarily on flow cytometry parameters, our best model achieved an area under the ROC curve (AUC) of 0.813 (95% CI 0.792-0.834) in the test data, and achieved both higher sensitivity and specificity compared to proxies of clinician's judgement. Performance remained stable for white and non-white patients but was lower during a period of laboratory procedure change in 2015, in patients ≥65 years (AUC 0.783, 95% CI 0.752-0.815), and in men (AUC 0.758, 95% CI 0.717-0.798). Performance was also slightly reduced in patients with recorded suspicion of UTI (AUC 0.797, 95% CI 0.765-0.828). Our results suggest scope for use of ML to inform antibiotic prescribing decisions by improving diagnosis of suspected UTI in the ED, but performance varied with patient characteristics. Clinical utility of predictive models for UTI is therefore likely to differ for important patient subgroups including women <65 years, women ≥65 years, and men. Tailored models and decision thresholds may be required that account for differences in achievable performance, background incidence, and risks of infectious complications in these groups.
Collapse
Affiliation(s)
- Patrick Rockenschaub
- Institute of Health Informatics, University College London, London, United Kingdom
| | - Martin J Gill
- Department of Clinical Microbiology, Queen Elizabeth Hospital Birmingham, University Hospitals Birmingham NHS Foundation Trust, Birmingham, United Kingdom
| | - Dave McNulty
- Health Informatics, University Hospitals Birmingham NHS Foundation Trust, Birmingham, United Kingdom
| | - Orlagh Carroll
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Nick Freemantle
- Institute of Clinical Trials and Methodology, University College London, London, United Kingdom
| | - Laura Shallcross
- Institute of Health Informatics, University College London, London, United Kingdom
| |
Collapse
|
18
|
Naye F, Décary S, Houle C, LeBlanc A, Cook C, Dugas M, Skidmore B, Tousignant-Laflamme Y. Six Externally Validated Prognostic Models Have Potential Clinical Value to Predict Patient Health Outcomes in the Rehabilitation of Musculoskeletal Conditions: A Systematic Review. Phys Ther 2023; 103:7066982. [PMID: 37245218 DOI: 10.1093/ptj/pzad021] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 06/21/2022] [Accepted: 01/06/2023] [Indexed: 05/30/2023]
Abstract
OBJECTIVE The purpose of this systematic review was to identify and appraise externally validated prognostic models to predict a patient's health outcomes relevant to physical rehabilitation of musculoskeletal (MSK) conditions. METHODS We systematically reviewed 8 databases and reported our findings according to Preferred Reporting Items for Systematic Reviews and Meta-Analysis 2020. An information specialist designed a search strategy to identify externally validated prognostic models for MSK conditions. Paired reviewers independently screened the title, abstract, and full text and conducted data extraction. We extracted characteristics of included studies (eg, country and study design), prognostic models (eg, performance measures and type of model) and predicted clinical outcomes (eg, pain and disability). We assessed the risk of bias and concerns of applicability using the prediction model risk of bias assessment tool. We proposed and used a 5-step method to determine which prognostic models were clinically valuable. RESULTS We found 4896 citations, read 300 full-text articles, and included 46 papers (37 distinct models). Prognostic models were externally validated for the spine, upper limb, lower limb conditions, and MSK trauma, injuries, and pain. All studies presented a high risk of bias. Half of the models showed low concerns for applicability. Reporting of calibration and discrimination performance measures was often lacking. We found 6 externally validated models with adequate measures, which could be deemed clinically valuable [ie, (1) STart Back Screening Tool, (2) Wallis Occupational Rehabilitation RisK model, (3) Da Silva model, (4) PICKUP model, (5) Schellingerhout rule, and (6) Keene model]. Despite having a high risk of bias, which is mostly explained by the very conservative properties of the PROBAST tool, the 6 models remain clinically relevant. CONCLUSION We found 6 externally validated prognostic models developed to predict patients' health outcomes that were clinically relevant to the physical rehabilitation of MSK conditions. IMPACT Our results provide clinicians with externally validated prognostic models to help them better predict patients' clinical outcomes and facilitate personalized treatment plans. Incorporating clinically valuable prognostic models could inherently improve the value of care provided by physical therapists.
Collapse
Affiliation(s)
- Florian Naye
- School of Rehabilitation, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, Quebec, Canada
- Clinical Research of the Centre Hospitalier Universitaire de Sherbrooke (CRCHUS), Sherbrooke, Quebec, Canada
| | - Simon Décary
- Department of Family Medicine and Emergency Medicine, Pavillon Ferdinand-Vandry, Université Laval, Quebec, Quebec, Canada
- Tier 1 Canada Research Chair in Shared Decision Making and Knowledge Translation, Centre de recherche sur les soins et les services de première ligne de l'Université Laval (CERSSPL-UL), Quebec, Quebec, Canada
| | - Catherine Houle
- School of Rehabilitation, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, Quebec, Canada
- Clinical Research of the Centre Hospitalier Universitaire de Sherbrooke (CRCHUS), Sherbrooke, Quebec, Canada
| | - Annie LeBlanc
- Department of Family Medicine and Emergency Medicine, Pavillon Ferdinand-Vandry, Université Laval, Quebec, Quebec, Canada
| | - Chad Cook
- Physical Therapy Division, Duke University, Durham, North Carolina, USA
| | - Michèle Dugas
- VITAM Research Center, Centre Intégré Universitaire de Santé et de Services Sociaux de la Capitale-Nationale, Quebec, Quebec, Canada
| | - Becky Skidmore
- Independent Information Specialist, Ottawa, Ontario, Canada
| | - Yannick Tousignant-Laflamme
- School of Rehabilitation, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, Quebec, Canada
- Clinical Research of the Centre Hospitalier Universitaire de Sherbrooke (CRCHUS), Sherbrooke, Quebec, Canada
| |
Collapse
|
19
|
Gaskins AJ, Zhang Y, Chang J, Kissin DM. Predicted probabilities of live birth following assisted reproductive technology using United States national surveillance data from 2016 to 2018. Am J Obstet Gynecol 2023; 228:557.e1-557.e10. [PMID: 36702210 PMCID: PMC11057011 DOI: 10.1016/j.ajog.2023.01.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 01/02/2023] [Accepted: 01/14/2023] [Indexed: 01/24/2023]
Abstract
BACKGROUND As the use of in vitro fertilization continues to increase in the United States, up-to-date models that estimate cumulative live birth rates after multiple oocyte retrievals and embryo transfers (fresh and frozen) are valuable for patients and clinicians weighing treatment options. OBJECTIVE This study aimed to develop models that generate predicted probabilities of live birth in individuals considering in vitro fertilization based on demographic and reproductive characteristics. STUDY DESIGN Our population-based cohort study used data from the National Assisted Reproductive Technology Surveillance System 2016 to 2018, including 196,916 women who underwent 207,766 autologous embryo transfer cycles and 25,831 women who underwent 36,909 donor oocyte transfer cycles. We used data on autologous in vitro fertilization cycles to develop models that estimate a patient's cumulative live birth rate after all embryo transfers (fresh and frozen) within 12 months after 1, 2, and 3 oocyte retrievals in new and returning patients. Among patients using donor oocytes, we estimated the cumulative live birth rate after their first, second, and third embryo transfers. Multinomial logistic regression models adjusted for age, prepregnancy body mass index (imputed for 18% of missing values), parity, gravidity, and infertility diagnoses were used to estimate the cumulative live birth rate. RESULTS Among new and returning patients undergoing autologous in vitro fertilization, female age had the strongest association with cumulative live birth rate. Other factors associated with higher cumulative live birth rates were lower body mass index and parity or gravidity ≥1, although results were inconsistent. Infertility diagnoses of diminished ovarian reserve, uterine factor, and other reasons were associated with a lower cumulative live birth rate, whereas male factor, tubal factor, ovulatory disorders, and unexplained infertility were associated with a higher cumulative live birth rate. Based on our models, a new patient who is 35 years old, with a body mass index of 25 kg/m2, no previous pregnancy, and unexplained infertility diagnoses, has a 48%, 69%, and 80% cumulative live birth rate after the first, second, and third oocyte retrieval, respectively. Cumulative live birth rates are 29%, 48%, and 62%, respectively, if the patient had diminished ovarian reserve, and 25%, 41%, and 52%, respectively, if the patient was 40 years old (with unexplained infertility). Very few recipient characteristics were associated with cumulative live birth rate in donor oocyte patients. CONCLUSION Our models provided estimates of cumulative live birth rate based on demographic and reproductive characteristics to help inform patients and providers of a woman's probability of success after in vitro fertilization.
Collapse
Affiliation(s)
- Audrey J Gaskins
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA; Division of Reproductive Health, National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention, Atlanta, GA.
| | - Yujia Zhang
- Division of Reproductive Health, National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention, Atlanta, GA
| | - Jeani Chang
- Division of Reproductive Health, National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention, Atlanta, GA
| | - Dmitry M Kissin
- Division of Reproductive Health, National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention, Atlanta, GA
| |
Collapse
|
20
|
Nusman CM, Snoek L, van Leeuwen LM, Dierikx TH, van der Weijden BM, Achten NB, Bijlsma MW, Visser DH, van Houten MA, Bekker V, de Meij TGJ, van Rossem E, Felderhof M, Plötz FB. Group B Streptococcus Early-Onset Disease: New Preventive and Diagnostic Tools to Decrease the Burden of Antibiotic Use. Antibiotics (Basel) 2023; 12:antibiotics12030489. [PMID: 36978356 PMCID: PMC10044457 DOI: 10.3390/antibiotics12030489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 02/18/2023] [Accepted: 02/24/2023] [Indexed: 03/05/2023] Open
Abstract
The difficulty in recognizing early-onset neonatal sepsis (EONS) in a timely manner due to non-specific symptoms and the limitations of diagnostic tests, combined with the risk of serious consequences if EONS is not treated in a timely manner, has resulted in a low threshold for starting empirical antibiotic treatment. New guideline strategies, such as the neonatal sepsis calculator, have been proven to reduce the antibiotic burden related to EONS, but lack sensitivity for detecting EONS. In this review, the potential of novel, targeted preventive and diagnostic methods for EONS is discussed from three different perspectives: maternal, umbilical cord and newborn perspectives. Promising strategies from the maternal perspective include Group B Streptococcus (GBS) prevention, exploring the virulence factors of GBS, maternal immunization and antepartum biomarkers. The diagnostic methods obtained from the umbilical cord are preliminary but promising. Finally, promising fields from the newborn perspective include biomarkers, new microbiological techniques and clinical prediction and monitoring strategies. Consensus on the definition of EONS and the standardization of research on novel diagnostic biomarkers are crucial for future implementation and to reduce current antibiotic overexposure in newborns.
Collapse
Affiliation(s)
- Charlotte M. Nusman
- Department of Paediatrics, Emma Children’s Hospital, Amsterdam University Medical Centre, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - Linde Snoek
- Department of Neurology, Amsterdam University Medical Centre, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - Lisanne M. van Leeuwen
- Department of Paediatrics and Department of Vaccin, Infection and Immunology, Spaarne Hospital, Boerhaavelaan 22, 2035 RC Haarlem, The Netherlands
- Department of Paediatrics, Willem Alexander Children Hospital, Leiden University Medical Center, Albinusdreef 2, 2333 ZA Leiden, The Netherlands
| | - Thomas H. Dierikx
- Department of Pediatric Gastroenterology, Emma Children’s Hospital, Amsterdam University Medical Centre, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
- Amsterdam Gastroenterology Endocrinology Metabolism Research Institute, Meibergdreef 69-71, 1105 BK Amsterdam, The Netherlands
| | - Bo M. van der Weijden
- Department of Paediatrics, Emma Children’s Hospital, Amsterdam University Medical Centre, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
- Department of Paediatrics, Tergooi Hospital, Rijksstraatweg 1, 1261 AN Blaricum, The Netherlands
| | - Niek B. Achten
- Department of Paediatrics, Erasmus University Medical Centre, Sophia Children’s Hospital, Wytemaweg 80, 3015 CN Rotterdam, The Netherlands
| | - Merijn W. Bijlsma
- Department of Paediatrics, Emma Children’s Hospital, Amsterdam University Medical Centre, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - Douwe H. Visser
- Department of Neonatology, Emma Children’s Hospital, Amsterdam University Medical Centre, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
| | - Marlies A. van Houten
- Department of Paediatrics and Department of Vaccin, Infection and Immunology, Spaarne Hospital, Boerhaavelaan 22, 2035 RC Haarlem, The Netherlands
| | - Vincent Bekker
- Division of Neonatology, Department of Pediatrics, Willem Alexander Children’s Hospital, Leiden University Medical Center, Albinusdreef 2, 2333 ZA Leiden, The Netherlands
| | - Tim G. J. de Meij
- Department of Pediatric Gastroenterology, Emma Children’s Hospital, Amsterdam University Medical Centre, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
- Amsterdam Gastroenterology Endocrinology Metabolism Research Institute, Meibergdreef 69-71, 1105 BK Amsterdam, The Netherlands
| | - Ellen van Rossem
- Department of Paediatrics, Flevo Hospital, Hospitaalweg 1, 1315 RA Almere, The Netherlands
| | - Mariet Felderhof
- Department of Paediatrics, Flevo Hospital, Hospitaalweg 1, 1315 RA Almere, The Netherlands
| | - Frans B. Plötz
- Department of Paediatrics, Emma Children’s Hospital, Amsterdam University Medical Centre, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
- Department of Paediatrics, Tergooi Hospital, Rijksstraatweg 1, 1261 AN Blaricum, The Netherlands
- Correspondence: ; Tel.: +31-88-753-3664
| |
Collapse
|
21
|
Abstract
BACKGROUND Clinical prediction models should be validated before implementation in clinical practice. But is favorable performance at internal validation or one external validation sufficient to claim that a prediction model works well in the intended clinical context? MAIN BODY We argue to the contrary because (1) patient populations vary, (2) measurement procedures vary, and (3) populations and measurements change over time. Hence, we have to expect heterogeneity in model performance between locations and settings, and across time. It follows that prediction models are never truly validated. This does not imply that validation is not important. Rather, the current focus on developing new models should shift to a focus on more extensive, well-conducted, and well-reported validation studies of promising models. CONCLUSION Principled validation strategies are needed to understand and quantify heterogeneity, monitor performance over time, and update prediction models when appropriate. Such strategies will help to ensure that prediction models stay up-to-date and safe to support clinical decision-making.
Collapse
|
22
|
Cavallazzi R, Bradley J, Chandler T, Furmanek S, Ramirez JA. Severity of Illness Scores and Biomarkers for Prognosis of Patients with Coronavirus Disease 2019. Semin Respir Crit Care Med 2023; 44:75-90. [PMID: 36646087 DOI: 10.1055/s-0042-1759567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
The spectrum of disease severity and the insidiousness of clinical presentation make it difficult to recognize patients with coronavirus disease 2019 (COVID-19) at higher risk of worse outcomes or death when they are seen in the early phases of the disease. There are now well-established risk factors for worse outcomes in patients with COVID-19. These should be factored in when assessing the prognosis of these patients. However, a more precise prognostic assessment in an individual patient may warrant the use of predictive tools. In this manuscript, we conduct a literature review on the severity of illness scores and biomarkers for the prognosis of patients with COVID-19. Several COVID-19-specific scores have been developed since the onset of the pandemic. Some of them are promising and can be integrated into the assessment of these patients. We also found that the well-known pneumonia severity index (PSI) and CURB-65 (confusion, uremia, respiratory rate, BP, age ≥ 65 years) are good predictors of mortality in hospitalized patients with COVID-19. While neither the PSI nor the CURB-65 should be used for the triage of outpatient versus inpatient treatment, they can be integrated by a clinician into the assessment of disease severity and can be used in epidemiological studies to determine the severity of illness in patient populations. Biomarkers also provide valuable prognostic information and, importantly, may depict the main physiological derangements in severe disease. We, however, do not advocate the isolated use of severity of illness scores or biomarkers for decision-making in an individual patient. Instead, we suggest the use of these tools on a case-by-case basis with the goal of enhancing clinician judgment.
Collapse
Affiliation(s)
- Rodrigo Cavallazzi
- Division of Pulmonary, Critical Care Medicine, and Sleep Disorders, University of Louisville, Norton Healthcare, Louisville, Kentucky
| | - James Bradley
- Division of Pulmonary, Critical Care Medicine, and Sleep Disorders, University of Louisville, Norton Healthcare, Louisville, Kentucky
| | - Thomas Chandler
- Norton Infectious Diseases Institute, Norton Healthcare, Louisville, Kentucky
| | - Stephen Furmanek
- Norton Infectious Diseases Institute, Norton Healthcare, Louisville, Kentucky
| | - Julio A Ramirez
- Norton Infectious Diseases Institute, Norton Healthcare, Louisville, Kentucky
| |
Collapse
|
23
|
Hochstenbach LM, Determann D, Fijten RR, Bloemen-van Gurp EJ, Verwey R. Taking shared decision making for prostate cancer to the next level: Requirements for a Dutch treatment decision aid with personalized risks on side effects. Internet Interv 2023; 31:100606. [PMID: 36844795 PMCID: PMC9945792 DOI: 10.1016/j.invent.2023.100606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 12/16/2022] [Accepted: 01/30/2023] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND Different curative treatment modalities need to be considered in case of localized prostate cancer, all comparable in terms of survival and recurrence though different in side effects. To better inform patients and support shared decision making, the development of a web-based patient decision aid including personalized risk information was proposed. This paper reports on requirements in terms of content of information, visualization of risk profiles, and use in practice. METHODS Based on a Dutch 10-step guide about the setup of a decision aid next to a practice guideline, an iterative and co-creative design process was followed. In collaboration with various groups of experts (health professionals, usability and linguistic experts, patients and the general public), research and development activities were continuously alternated. RESULTS Content requirements focused on presenting information only about conventional treatments and main side effects; based on risk group; and including clear explanations about personalized risks. Visual requirements involved presenting general and personalized risks separately; through bar charts or icon arrays; and along with numbers or words, and legends. Organizational requirements included integration into local clinical pathways; agreement about information input and output; and focus on patients' numeracy and graph literacy skills. CONCLUSIONS The iterative and co-creative development process was challenging, though extremely valuable. The translation of requirements resulted in a decision aid about four conventional treatment options, including general or personalized risks for erection, urinary and intestinal problems that are communicated with icon arrays and numbers. Future implementation and validation studies need to inform about use and value in practice.
Collapse
Affiliation(s)
- Laura M.J. Hochstenbach
- Center of Expertise for Innovative Care and Technology (EIZT), School of Nursing, Zuyd University of Applied Sciences, P.O. Box 550, 6400 AN Heerlen, the Netherlands,Department of Health Services Research, Care and Public Health Research Institute (CAPHRI), Faculty of Health Medicine and Life Sciences, Maastricht University, P.O. Box 616, 6200 MD Maastricht, the Netherlands,Corresponding author at: Maastricht University, Department of Health Services Research, P.O. Box 616, 6200 MD, the Netherlands.
| | | | - Rianne R.R. Fijten
- Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Developmental Biology, Maastricht University Medical Center+, P.O. Box 616, 6200 MD Maastricht, the Netherlands
| | - Esther J. Bloemen-van Gurp
- Center of Expertise for Innovative Care and Technology (EIZT), School of Nursing, Zuyd University of Applied Sciences, P.O. Box 550, 6400 AN Heerlen, the Netherlands,Expertise Center Empowering Healthy Behavior, Fontys University of Applied Sciences, P.O. Box 347, 5600 AH Eindhoven, the Netherlands
| | - Renée Verwey
- Center of Expertise for Innovative Care and Technology (EIZT), School of Nursing, Zuyd University of Applied Sciences, P.O. Box 550, 6400 AN Heerlen, the Netherlands
| |
Collapse
|
24
|
Binuya MAE, Engelhardt EG, Schats W, Schmidt MK, Steyerberg EW. Methodological guidance for the evaluation and updating of clinical prediction models: a systematic review. BMC Med Res Methodol 2022; 22:316. [PMID: 36510134 PMCID: PMC9742671 DOI: 10.1186/s12874-022-01801-8] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 11/22/2022] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Clinical prediction models are often not evaluated properly in specific settings or updated, for instance, with information from new markers. These key steps are needed such that models are fit for purpose and remain relevant in the long-term. We aimed to present an overview of methodological guidance for the evaluation (i.e., validation and impact assessment) and updating of clinical prediction models. METHODS We systematically searched nine databases from January 2000 to January 2022 for articles in English with methodological recommendations for the post-derivation stages of interest. Qualitative analysis was used to summarize the 70 selected guidance papers. RESULTS Key aspects for validation are the assessment of statistical performance using measures for discrimination (e.g., C-statistic) and calibration (e.g., calibration-in-the-large and calibration slope). For assessing impact or usefulness in clinical decision-making, recent papers advise using decision-analytic measures (e.g., the Net Benefit) over simplistic classification measures that ignore clinical consequences (e.g., accuracy, overall Net Reclassification Index). Commonly recommended methods for model updating are recalibration (i.e., adjustment of intercept or baseline hazard and/or slope), revision (i.e., re-estimation of individual predictor effects), and extension (i.e., addition of new markers). Additional methodological guidance is needed for newer types of updating (e.g., meta-model and dynamic updating) and machine learning-based models. CONCLUSION Substantial guidance was found for model evaluation and more conventional updating of regression-based models. An important development in model evaluation is the introduction of a decision-analytic framework for assessing clinical usefulness. Consensus is emerging on methods for model updating.
Collapse
Affiliation(s)
- M. A. E. Binuya
- grid.430814.a0000 0001 0674 1393Division of Molecular Pathology, the Netherlands Cancer Institute – Antoni van Leeuwenhoek Hospital, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands ,grid.10419.3d0000000089452978Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands ,grid.10419.3d0000000089452978Department of Clinical Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - E. G. Engelhardt
- grid.430814.a0000 0001 0674 1393Division of Molecular Pathology, the Netherlands Cancer Institute – Antoni van Leeuwenhoek Hospital, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands ,grid.430814.a0000 0001 0674 1393Division of Psychosocial Research and Epidemiology, the Netherlands Cancer Institute – Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands
| | - W. Schats
- grid.430814.a0000 0001 0674 1393Scientific Information Service, The Netherlands Cancer Institute – Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands
| | - M. K. Schmidt
- grid.430814.a0000 0001 0674 1393Division of Molecular Pathology, the Netherlands Cancer Institute – Antoni van Leeuwenhoek Hospital, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands ,grid.10419.3d0000000089452978Department of Clinical Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - E. W. Steyerberg
- grid.10419.3d0000000089452978Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| |
Collapse
|
25
|
Zhang X, Liu K, Yuan B, Wang H, Chen S, Xue Y, Chen W, Liu M, Hu Y. A hybrid adaptive approach for instance transfer learning with dynamic and imbalanced data. INT J INTELL SYST 2022; 37:11582-11599. [PMID: 36816520 PMCID: PMC9936919 DOI: 10.1002/int.23055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2022] [Accepted: 08/16/2022] [Indexed: 11/06/2022]
Abstract
Machine learning has demonstrated success in clinical risk prediction modeling with complex electronic health record data. However, the evolving nature of clinical practices can dynamically change the underlying data distribution over time, leading to model performance drift. Adopting an outdated model is potentially risky and may result in unintentional losses. In this paper, we propose a novel Hybrid Adaptive Boosting approach (HA-Boost) for transfer learning. HA-Boost is characterized by the domain similarity-based and class imbalance-based adaptation mechanisms, which simultaneously address two critical limitations of the classical TrAdaBoost algorithm. We validated HA-Boost in predicting hospital-acquired acute kidney injury using real-world longitudinal electronic health records data. The experiment results demonstrate that HA-Boost stably outperforms the competing baselines in terms of both AUROC and AUPRC across a 7-year time span. This study has confirmed the effectiveness of transfer learning as a superior model updating approach in dynamic environment.
Collapse
Affiliation(s)
- Xiangzhou Zhang
- Big Data Decision Institute, Jinan University, Guangzhou, China
| | - Kang Liu
- Big Data Decision Institute, Jinan University, Guangzhou, China
- School of Management, Jinan University, Guangzhou, China
| | - Borong Yuan
- Big Data Decision Institute, Jinan University, Guangzhou, China
- College of Information Science and Technology, Jinan University, Guangzhou, China
| | - Hongnian Wang
- Big Data Decision Institute, Jinan University, Guangzhou, China
- School of Management, Jinan University, Guangzhou, China
| | - Shaoyong Chen
- Big Data Decision Institute, Jinan University, Guangzhou, China
- College of Information Science and Technology, Jinan University, Guangzhou, China
| | - Yunfei Xue
- Big Data Decision Institute, Jinan University, Guangzhou, China
- College of Information Science and Technology, Jinan University, Guangzhou, China
| | - Weiqi Chen
- Big Data Decision Institute, Jinan University, Guangzhou, China
| | - Mei Liu
- Division of Medical Informatics, University of Kansas Medical Center, Kansas City, KS, United States of America
| | - Yong Hu
- Big Data Decision Institute, Jinan University, Guangzhou, China
| |
Collapse
|
26
|
Ebrahimi A, Wiil UK, Naemi A, Mansourvar M, Andersen K, Nielsen AS. Identification of clinical factors related to prediction of alcohol use disorder from electronic health records using feature selection methods. BMC Med Inform Decis Mak 2022; 22:304. [PMID: 36424597 PMCID: PMC9686074 DOI: 10.1186/s12911-022-02051-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Accepted: 11/16/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND High dimensionality in electronic health records (EHR) causes a significant computational problem for any systematic search for predictive, diagnostic, or prognostic patterns. Feature selection (FS) methods have been indicated to be effective in feature reduction as well as in identifying risk factors related to prediction of clinical disorders. This paper examines the prediction of patients with alcohol use disorder (AUD) using machine learning (ML) and attempts to identify risk factors related to the diagnosis of AUD. METHODS A FS framework consisting of two operational levels, base selectors and ensemble selectors. The first level consists of five FS methods: three filter methods, one wrapper method, and one embedded method. Base selector outputs are aggregated to develop four ensemble FS methods. The outputs of FS method were then fed into three ML algorithms: support vector machine (SVM), K-nearest neighbor (KNN), and random forest (RF) to compare and identify the best feature subset for the prediction of AUD from EHRs. RESULTS In terms of feature reduction, the embedded FS method could significantly reduce the number of features from 361 to 131. In terms of classification performance, RF based on 272 features selected by our proposed ensemble method (Union FS) with the highest accuracy in predicting patients with AUD, 96%, outperformed all other models in terms of AUROC, AUPRC, Precision, Recall, and F1-Score. Considering the limitations of embedded and wrapper methods, the best overall performance was achieved by our proposed Union Filter FS, which reduced the number of features to 223 and improved Precision, Recall, and F1-Score in RF from 0.77, 0.65, and 0.71 to 0.87, 0.81, and 0.84, respectively. Our findings indicate that, besides gender, age, and length of stay at the hospital, diagnosis related to digestive organs, bones, muscles and connective tissue, and the nervous systems are important clinical factors related to the prediction of patients with AUD. CONCLUSION Our proposed FS method could improve the classification performance significantly. It could identify clinical factors related to prediction of AUD from EHRs, thereby effectively helping clinical staff to identify and treat AUD patients and improving medical knowledge of the AUD condition. Moreover, the diversity of features among female and male patients as well as gender disparity were investigated using FS methods and ML techniques.
Collapse
Affiliation(s)
- Ali Ebrahimi
- grid.10825.3e0000 0001 0728 0170SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, Denmark
| | - Uffe Kock Wiil
- grid.10825.3e0000 0001 0728 0170SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, Denmark
| | - Amin Naemi
- grid.10825.3e0000 0001 0728 0170SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, Denmark
| | - Marjan Mansourvar
- grid.10825.3e0000 0001 0728 0170Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark
| | - Kjeld Andersen
- grid.10825.3e0000 0001 0728 0170Unit for Clinical Alcohol Research, Clinical Institute, University of Southern Denmark, Odense, Denmark
| | - Anette Søgaard Nielsen
- grid.10825.3e0000 0001 0728 0170Unit for Clinical Alcohol Research, Clinical Institute, University of Southern Denmark, Odense, Denmark
| |
Collapse
|
27
|
van Klaveren D, Zanos TP, Nelson J, Levy TJ, Park JG, Retel Helmrich IRA, Rietjens JAC, Basile MJ, Hajizadeh N, Lingsma HF, Kent DM. Prognostic models for COVID-19 needed updating to warrant transportability over time and space. BMC Med 2022; 20:456. [PMID: 36424619 PMCID: PMC9686462 DOI: 10.1186/s12916-022-02651-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 11/04/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Supporting decisions for patients who present to the emergency department (ED) with COVID-19 requires accurate prognostication. We aimed to evaluate prognostic models for predicting outcomes in hospitalized patients with COVID-19, in different locations and across time. METHODS We included patients who presented to the ED with suspected COVID-19 and were admitted to 12 hospitals in the New York City (NYC) area and 4 large Dutch hospitals. We used second-wave patients who presented between September and December 2020 (2137 and 3252 in NYC and the Netherlands, respectively) to evaluate models that were developed on first-wave patients who presented between March and August 2020 (12,163 and 5831). We evaluated two prognostic models for in-hospital death: The Northwell COVID-19 Survival (NOCOS) model was developed on NYC data and the COVID Outcome Prediction in the Emergency Department (COPE) model was developed on Dutch data. These models were validated on subsequent second-wave data at the same site (temporal validation) and at the other site (geographic validation). We assessed model performance by the Area Under the receiver operating characteristic Curve (AUC), by the E-statistic, and by net benefit. RESULTS Twenty-eight-day mortality was considerably higher in the NYC first-wave data (21.0%), compared to the second-wave (10.1%) and the Dutch data (first wave 10.8%; second wave 10.0%). COPE discriminated well at temporal validation (AUC 0.82), with excellent calibration (E-statistic 0.8%). At geographic validation, discrimination was satisfactory (AUC 0.78), but with moderate over-prediction of mortality risk, particularly in higher-risk patients (E-statistic 2.9%). While discrimination was adequate when NOCOS was tested on second-wave NYC data (AUC 0.77), NOCOS systematically overestimated the mortality risk (E-statistic 5.1%). Discrimination in the Dutch data was good (AUC 0.81), but with over-prediction of risk, particularly in lower-risk patients (E-statistic 4.0%). Recalibration of COPE and NOCOS led to limited net benefit improvement in Dutch data, but to substantial net benefit improvement in NYC data. CONCLUSIONS NOCOS performed moderately worse than COPE, probably reflecting unique aspects of the early pandemic in NYC. Frequent updating of prognostic models is likely to be required for transportability over time and space during a dynamic pandemic.
Collapse
Affiliation(s)
- David van Klaveren
- Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands. .,Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA.
| | - Theodoros P Zanos
- Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | - Jason Nelson
- Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA
| | - Todd J Levy
- Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | - Jinny G Park
- Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA
| | - Isabel R A Retel Helmrich
- Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
| | - Judith A C Rietjens
- Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
| | - Melissa J Basile
- Division of Pulmonary Critical Care and Sleep Medicine, Department of Medicine, Donald and Barbara Zucker School of Medicine at Hofstra/Northwell Health, Hempstead, NY, USA
| | - Negin Hajizadeh
- Division of Pulmonary Critical Care and Sleep Medicine, Department of Medicine, Donald and Barbara Zucker School of Medicine at Hofstra/Northwell Health, Hempstead, NY, USA
| | - Hester F Lingsma
- Department of Public Health, Erasmus MC University Medical Center Rotterdam, Dr. Molewaterplein 50, 3015 GE, Rotterdam, The Netherlands
| | - David M Kent
- Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, USA
| |
Collapse
|
28
|
Levy TJ, Coppa K, Cang J, Barnaby DP, Paradis MD, Cohen SL, Makhnevich A, van Klaveren D, Kent DM, Davidson KW, Hirsch JS, Zanos TP. Development and validation of self-monitoring auto-updating prognostic models of survival for hospitalized COVID-19 patients. Nat Commun 2022; 13:6812. [PMID: 36357420 PMCID: PMC9648888 DOI: 10.1038/s41467-022-34646-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 11/02/2022] [Indexed: 11/12/2022] Open
Abstract
Clinical prognostic models can assist patient care decisions. However, their performance can drift over time and location, necessitating model monitoring and updating. Despite rapid and significant changes during the pandemic, prognostic models for COVID-19 patients do not currently account for these drifts. We develop a framework for continuously monitoring and updating prognostic models and apply it to predict 28-day survival in COVID-19 patients. We use demographic, laboratory, and clinical data from electronic health records of 34912 hospitalized COVID-19 patients from March 2020 until May 2022 and compare three modeling methods. Model calibration performance drift is immediately detected with minor fluctuations in discrimination. The overall calibration on the prospective validation cohort is significantly improved when comparing the dynamically updated models against their static counterparts. Our findings suggest that, using this framework, models remain accurate and well-calibrated across various waves, variants, race and sex and yield positive net-benefits.
Collapse
Affiliation(s)
- Todd J Levy
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
| | - Kevin Coppa
- Clinical Digital Solutions, Northwell Health, New Hyde Park, NY, 11042, USA
| | - Jinxuan Cang
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
| | - Douglas P Barnaby
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA
| | - Marc D Paradis
- Northwell Holdings, Northwell Health, Manhasset, NY, 11030, USA
| | - Stuart L Cohen
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA
| | - Alex Makhnevich
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA
| | - David van Klaveren
- Department of Public Health, Erasmus MC University Medical Center, Rotterdam, Netherlands
- Predictive Analytics and Comparative Effectiveness Center, Tufts Medical Center, Boston, MA, USA
| | - David M Kent
- Predictive Analytics and Comparative Effectiveness Center, Tufts Medical Center, Boston, MA, USA
| | - Karina W Davidson
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA
| | - Jamie S Hirsch
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA
- Clinical Digital Solutions, Northwell Health, New Hyde Park, NY, 11042, USA
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA
| | - Theodoros P Zanos
- Institute of Health System Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA.
- Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, 11030, USA.
- Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY, 11549, USA.
| |
Collapse
|
29
|
Davis SE, Walsh CG, Matheny ME. Open questions and research gaps for monitoring and updating AI-enabled tools in clinical settings. Front Digit Health 2022; 4:958284. [PMID: 36120717 PMCID: PMC9478183 DOI: 10.3389/fdgth.2022.958284] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 08/11/2022] [Indexed: 11/15/2022] Open
Abstract
As the implementation of artificial intelligence (AI)-enabled tools is realized across diverse clinical environments, there is a growing understanding of the need for ongoing monitoring and updating of prediction models. Dataset shift-temporal changes in clinical practice, patient populations, and information systems-is now well-documented as a source of deteriorating model accuracy and a challenge to the sustainability of AI-enabled tools in clinical care. While best practices are well-established for training and validating new models, there has been limited work developing best practices for prospective validation and model maintenance. In this paper, we highlight the need for updating clinical prediction models and discuss open questions regarding this critical aspect of the AI modeling lifecycle in three focus areas: model maintenance policies, performance monitoring perspectives, and model updating strategies. With the increasing adoption of AI-enabled tools, the need for such best practices must be addressed and incorporated into new and existing implementations. This commentary aims to encourage conversation and motivate additional research across clinical and data science stakeholders.
Collapse
Affiliation(s)
- Sharon E. Davis
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States,Correspondence: Sharon E. Davis
| | - Colin G. Walsh
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States,Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, United States,Department of Psychiatry, Vanderbilt University Medical Center, Nashville, TN, United States
| | - Michael E. Matheny
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States,Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, United States,Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, United States,Tennessee Valley Healthcare System VA Medical Center, Veterans Health Administration, Nashville, TN, United States
| |
Collapse
|
30
|
Drayton DJ, Ayres M, Relton SD, Sperrin M, Hall M. Risk scores in anaesthesia: the future is hard to predict. BJA OPEN 2022; 3:100027. [PMID: 37588581 PMCID: PMC10430853 DOI: 10.1016/j.bjao.2022.100027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 06/29/2022] [Indexed: 08/18/2023]
Abstract
External validation helps to assess whether a given risk prediction model will perform well in a target population. Validation is an important step in maintaining the utility of risk prediction models, as their ability to provide reliable risk estimates will deteriorate over time (calibration drift).
Collapse
Affiliation(s)
| | | | - Samuel D. Relton
- Leeds Institute of Health Science, University of Leeds, Leeds, UK
| | - Matthew Sperrin
- Division of Informatics, Imaging & Data Sciences, University of Manchester, Manchester, UK
| | | |
Collapse
|
31
|
Safari A, Adibi A, Sin DD, Lee TY, Ho JK, Sadatsafavi M. ACCEPT 2·0: Recalibrating and externally validating the Acute COPD exacerbation prediction tool (ACCEPT). EClinicalMedicine 2022; 51:101574. [PMID: 35898315 PMCID: PMC9309408 DOI: 10.1016/j.eclinm.2022.101574] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 06/30/2022] [Accepted: 07/01/2022] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND The Acute Chronic Obstructive Pulmonary Disease (COPD) Exacerbation Prediction Tool (ACCEPT) was developed for individualised prediction of COPD exacerbations. ACCEPT was well calibrated overall and had a high discriminatory power, but overestimated risk among individuals without recent exacerbations. The objectives of this study were to 1) fine-tune ACCEPT to make better predictions for individuals with a negative exacerbation history, 2) develop more parsimonious models, and 3) externally validate the models in a new dataset. METHODS We recalibrated ACCEPT using data from the Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-points (ECLIPSE, a three-year observational study, 1,803 patients, 2,117 exacerbations) study by applying non-parametric regression splines to the predicted rates. We developed three reduced versions of ACCEPT by removing symptom score and/or baseline medications as predictors. We examined the discrimination, calibration, and net benefit of ACCEPT 2·0 in the placebo arm of the Towards a Revolution in COPD Health (TORCH, a three-year randomised clinical trial of inhaled therapies in COPD, 1,091 patients, 1,064 exacerbations) study. The primary outcome for prediction was the occurrence of ≥2 moderate or ≥1 severe exacerbation in the next 12 months; the secondary outcomes were prediction of the occurrence of any moderate/severe exacerbation or any severe exacerbation. FINDINGS ACCEPT 2·0 had an area-under-the-curve (AUC) of 0·76 for predicting the primary outcome. Exacerbation history alone (current standard of care) had an AUC of 0·68. The model was well calibrated in patients with positive or negative exacerbation histories. Changes in AUC in reduced versions were minimal for the primary outcome as well as for predicting the occurrence of any moderate/severe exacerbations (ΔAUC<0·011), but more substantial for predicting the occurrence of any severe exacerbations (ΔAUC<0·020). All versions of ACCEPT 2·0 provided positive net benefit over the use of exacerbation history alone for some range of thresholds. INTERPRETATION ACCEPT 2·0 showed good calibration regardless of exacerbation history, and predicts exacerbation risk better than current standard of care for a range of thresholds. Future studies need to investigate the utility of exacerbation prediction in various subgroups of patients. FUNDING This study was funded by a team grant from the Canadian Institutes of Health Research (PHT 178432).
Collapse
Affiliation(s)
- Abdollah Safari
- Respiratory Evaluation Sciences Program, Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, Canada
- Department of Mathematics, Statistics, and Computer Science, University of Tehran, Tehran, Iran
| | - Amin Adibi
- Respiratory Evaluation Sciences Program, Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, Canada
| | - Don D. Sin
- Centre for Heart Lung Innovation, St. Paul's Hospital and Department of Medicine (Division of Respirology), The University of British Columbia, Vancouver, Canada
| | - Tae Yoon Lee
- Respiratory Evaluation Sciences Program, Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, Canada
| | - Joseph Khoa Ho
- Respiratory Evaluation Sciences Program, Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, Canada
| | - Mohsen Sadatsafavi
- Respiratory Evaluation Sciences Program, Collaboration for Outcomes Research and Evaluation, Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, Canada
- Centre for Heart Lung Innovation, St. Paul's Hospital and Department of Medicine (Division of Respirology), The University of British Columbia, Vancouver, Canada
- Corresponding author at: Room 4110, Faculty of Pharmaceutical Sciences, 2405 Wesbrook Mall, Vancouver, BC, V6T1Z3, Canada.
| |
Collapse
|
32
|
Liuzzi P, Magliacano A, De Bellis F, Mannini A, Estraneo A. Predicting outcome of patients with prolonged disorders of consciousness using machine learning models based on medical complexity. Sci Rep 2022; 12:13471. [PMID: 35931703 PMCID: PMC9356130 DOI: 10.1038/s41598-022-17561-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 07/27/2022] [Indexed: 12/25/2022] Open
Abstract
Patients with severe acquired brain injury and prolonged disorders of consciousness (pDoC) are characterized by high clinical complexity and high risk to develop medical complications. The present multi-center longitudinal study aimed at investigating the impact of medical complications on the prediction of clinical outcome by means of machine learning models. Patients with pDoC were consecutively enrolled at admission in 23 intensive neurorehabilitation units (IRU) and followed-up at 6 months from onset via the Glasgow Outcome Scale-Extended (GOSE). Demographic and clinical data at study entry and medical complications developed within 3 months from admission were collected. Machine learning models were developed, targeting neurological outcomes at 6 months from brain injury using data collected at admission. Then, after concatenating predictions of such models to the medical complications collected within 3 months, a cascade model was developed. One hundred seventy six patients with pDoC (M: 123, median age 60.2 years) were included in the analysis. At admission, the best performing solution (k-Nearest Neighbors regression, KNN) resulted in a median validation error of 0.59 points [IQR 0.14] and a classification accuracy of dichotomized GOS-E of 88.6%. Coherently, at 3 months, the best model resulted in a median validation error of 0.49 points [IQR 0.11] and a classification accuracy of 92.6%. Interpreting the admission KNN showed how the negative effect of older age is strengthened when patients' communication levels are high and ameliorated when no communication is present. The model trained at 3 months showed appropriate adaptation of the admission prediction according to the severity of the developed medical complexity in the first 3 months. In this work, we developed and cross-validated an interpretable decision support tool capable of distinguishing patients which will reach sufficient independence levels at 6 months (GOS-E > 4). Furthermore, we provide an updated prediction at 3 months, keeping in consideration the rehabilitative path and the risen medical complexity.
Collapse
Affiliation(s)
- Piergiuseppe Liuzzi
- IRCCS Fondazione Don Carlo Gnocchi ONLUS, Via di Scandicci 269, Florence, Italy.,Scuola Superiore Sant'Anna, Istituto di BioRobotica, Viale Rinaldo Piaggio 34, Pontedera, Italy
| | - Alfonso Magliacano
- Fondazione Don Carlo Gnocchi ONLUS, Scientific Institute for Research and Health Care, Via Quadrivio, Sant'Angelo dei Lombardi, Italy
| | - Francesco De Bellis
- Fondazione Don Carlo Gnocchi ONLUS, Scientific Institute for Research and Health Care, Via Quadrivio, Sant'Angelo dei Lombardi, Italy
| | - Andrea Mannini
- IRCCS Fondazione Don Carlo Gnocchi ONLUS, Via di Scandicci 269, Florence, Italy.
| | - Anna Estraneo
- Fondazione Don Carlo Gnocchi ONLUS, Scientific Institute for Research and Health Care, Via Quadrivio, Sant'Angelo dei Lombardi, Italy.,Unità di Neurologia, Santa Maria della Pietà General Hospital, Via della Repubblica 7, Nola, Italy
| |
Collapse
|
33
|
Reynard C, Jenkins D, Martin GP, Kontopantelis E, Body R. Is your clinical prediction model past its sell by date? Arch Emerg Med 2022; 39:956-958. [DOI: 10.1136/emermed-2021-212224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 06/22/2022] [Indexed: 11/04/2022]
|
34
|
de Jong VMT, Rousset RZ, Antonio-Villa NE, Buenen AG, Van Calster B, Bello-Chavolla OY, Brunskill NJ, Curcin V, Damen JAA, Fermín-Martínez CA, Fernández-Chirino L, Ferrari D, Free RC, Gupta RK, Haldar P, Hedberg P, Korang SK, Kurstjens S, Kusters R, Major RW, Maxwell L, Nair R, Naucler P, Nguyen TL, Noursadeghi M, Rosa R, Soares F, Takada T, van Royen FS, van Smeden M, Wynants L, Modrák M, Asselbergs FW, Linschoten M, Moons KGM, Debray TPA. Clinical prediction models for mortality in patients with covid-19: external validation and individual participant data meta-analysis. BMJ 2022; 378:e069881. [PMID: 35820692 PMCID: PMC9273913 DOI: 10.1136/bmj-2021-069881] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
OBJECTIVE To externally validate various prognostic models and scoring rules for predicting short term mortality in patients admitted to hospital for covid-19. DESIGN Two stage individual participant data meta-analysis. SETTING Secondary and tertiary care. PARTICIPANTS 46 914 patients across 18 countries, admitted to a hospital with polymerase chain reaction confirmed covid-19 from November 2019 to April 2021. DATA SOURCES Multiple (clustered) cohorts in Brazil, Belgium, China, Czech Republic, Egypt, France, Iran, Israel, Italy, Mexico, Netherlands, Portugal, Russia, Saudi Arabia, Spain, Sweden, United Kingdom, and United States previously identified by a living systematic review of covid-19 prediction models published in The BMJ, and through PROSPERO, reference checking, and expert knowledge. MODEL SELECTION AND ELIGIBILITY CRITERIA Prognostic models identified by the living systematic review and through contacting experts. A priori models were excluded that had a high risk of bias in the participant domain of PROBAST (prediction model study risk of bias assessment tool) or for which the applicability was deemed poor. METHODS Eight prognostic models with diverse predictors were identified and validated. A two stage individual participant data meta-analysis was performed of the estimated model concordance (C) statistic, calibration slope, calibration-in-the-large, and observed to expected ratio (O:E) across the included clusters. MAIN OUTCOME MEASURES 30 day mortality or in-hospital mortality. RESULTS Datasets included 27 clusters from 18 different countries and contained data on 46 914patients. The pooled estimates ranged from 0.67 to 0.80 (C statistic), 0.22 to 1.22 (calibration slope), and 0.18 to 2.59 (O:E ratio) and were prone to substantial between study heterogeneity. The 4C Mortality Score by Knight et al (pooled C statistic 0.80, 95% confidence interval 0.75 to 0.84, 95% prediction interval 0.72 to 0.86) and clinical model by Wang et al (0.77, 0.73 to 0.80, 0.63 to 0.87) had the highest discriminative ability. On average, 29% fewer deaths were observed than predicted by the 4C Mortality Score (pooled O:E 0.71, 95% confidence interval 0.45 to 1.11, 95% prediction interval 0.21 to 2.39), 35% fewer than predicted by the Wang clinical model (0.65, 0.52 to 0.82, 0.23 to 1.89), and 4% fewer than predicted by Xie et al's model (0.96, 0.59 to 1.55, 0.21 to 4.28). CONCLUSION The prognostic value of the included models varied greatly between the data sources. Although the Knight 4C Mortality Score and Wang clinical model appeared most promising, recalibration (intercept and slope updates) is needed before implementation in routine care.
Collapse
Affiliation(s)
- Valentijn M T de Jong
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
- Cochrane Netherlands, University Medical Centre Utrecht, Utrecht University, Netherlands
- Data Analytics and Methods Task Force, European Medicines Agency, Amsterdam, Netherlands
| | - Rebecca Z Rousset
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Neftalí Eduardo Antonio-Villa
- Dirección de Investigación, Instituto Nacional de Geriatría, Mexico City, Mexico
- MD/PhD (PECEM) Program, Faculty of Medicine, National Autonomous University of Mexico, Mexico City, Mexico
| | | | - Ben Van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, Netherlands
- EPI-centre, KU Leuven, Leuven, Belgium
| | | | - Nigel J Brunskill
- Department of Cardiovascular Sciences, College of Life Sciences, University of Leicester, Leicester, UK
- John Walls Renal Unit, University Hospitals of Leicester NHS Trust, Leicester, UK
| | - Vasa Curcin
- School of Population Health and Environmental Sciences, King's College London, London, UK
| | - Johanna A A Damen
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
- Cochrane Netherlands, University Medical Centre Utrecht, Utrecht University, Netherlands
| | - Carlos A Fermín-Martínez
- Dirección de Investigación, Instituto Nacional de Geriatría, Mexico City, Mexico
- MD/PhD (PECEM) Program, Faculty of Medicine, National Autonomous University of Mexico, Mexico City, Mexico
| | - Luisa Fernández-Chirino
- Dirección de Investigación, Instituto Nacional de Geriatría, Mexico City, Mexico
- Faculty of Chemistry, Universidad Nacional Autónoma de México, México City, Mexico
| | - Davide Ferrari
- School of Population Health and Environmental Sciences, King's College London, London, UK
- Centre for Clinical Infection and Diagnostics Research, School of Immunology and Microbial Sciences, King's College London, London, UK
| | - Robert C Free
- Department of Respiratory Sciences, College of Life Sciences, University of Leicester, Leicester, UK
- NIHR Leicester Biomedical Research Centre, University of Leicester, Leicester, UK
| | - Rishi K Gupta
- Institute for Global Health, University College London, London, UK
| | - Pranabashis Haldar
- Department of Respiratory Sciences, College of Life Sciences, University of Leicester, Leicester, UK
- NIHR Leicester Biomedical Research Centre, University of Leicester, Leicester, UK
- Department of Respiratory Medicine, University Hospitals of Leicester NHS Trust, Leicester, UK
| | - Pontus Hedberg
- Department of Infectious Diseases, Karolinska University Hospital, Stockholm, Sweden
- Division of Infectious Diseases, Department of Medicine Solna, Karolinska Institute, Stockholm, Sweden
| | - Steven Kwasi Korang
- Copenhagen Trial Unit, Centre for Clinical Intervention Research, Department 7812, Rigshospitalet, Copenhagen University Hospital, Denmark
| | - Steef Kurstjens
- Laboratory of Clinical Chemistry and Haematology, Jeroen Bosch Hospital, Den Bosch, Netherlands
| | - Ron Kusters
- Laboratory of Clinical Chemistry and Haematology, Jeroen Bosch Hospital, Den Bosch, Netherlands
- Department of Health Technology and Services Research, Technical Medical Centre, University of Twente, Enschede, Netherlands
| | - Rupert W Major
- Department of Cardiovascular Sciences, College of Life Sciences, University of Leicester, Leicester, UK
- Department of Cardiovascular Sciences, College of Life Sciences, University of Leicester, Leicester, UK
| | - Lauren Maxwell
- Heidelberger Institut für Global Health, Universitätsklinikum Heidelberg, Germany
| | - Rajeshwari Nair
- University of Iowa Carver College of Medicine, Iowa City, IA, USA
- Centre for Access and Delivery Research Evaluation Iowa City Veterans Affairs Health Care System, Iowa City, IA, USA
| | - Pontus Naucler
- Department of Infectious Diseases, Karolinska University Hospital, Stockholm, Sweden
- Division of Infectious Diseases, Department of Medicine Solna, Karolinska Institute, Stockholm, Sweden
| | - Tri-Long Nguyen
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
- Section of Epidemiology, Department of Public Health, University of Copenhagen, Copenhagen, Denmark
- Department of Pharmacy, University Hospital Centre of Nîmes, Nîmes, France
| | - Mahdad Noursadeghi
- Division of Infection and Immunity, University College London, London, UK
| | - Rossana Rosa
- Infectious Diseases Service, UnityPoint Health-Des Moines, Des Moines, IA, USA
| | - Felipe Soares
- Industrial Engineering Department, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | - Toshihiko Takada
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
- Department of General Medicine, Shirakawa Satellite for Teaching And Research (STAR), Fukushima Medical University, Fukushima, Japan
| | - Florien S van Royen
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Maarten van Smeden
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Laure Wynants
- Bernhoven, Uden, Netherlands
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
| | - Martin Modrák
- Institute of Microbiology of the Czech Academy of Sciences, Prague, Czech Republic
| | - Folkert W Asselbergs
- Department of Cardiology, Division of Heart and Lungs, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
- Health Data Research UK and Institute of Health Informatics, University College London, London, UK
- Institute of Cardiovascular Science, Faculty of Population Health Sciences, University College London, London, UK
| | - Marijke Linschoten
- Department of Cardiology, Division of Heart and Lungs, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Karel G M Moons
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
- Cochrane Netherlands, University Medical Centre Utrecht, Utrecht University, Netherlands
| | - Thomas P A Debray
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
- Cochrane Netherlands, University Medical Centre Utrecht, Utrecht University, Netherlands
| |
Collapse
|
35
|
Faraway J, Boxall-Clasby J, Feil EJ, Gibbon MJ, Hatfield O, Kasprzyk-Hordern B, Smith T. Challenges in realising the potential of wastewater-based epidemiology to quantitatively monitor and predict the spread of disease. JOURNAL OF WATER AND HEALTH 2022; 20:1038-1050. [PMID: 35902986 DOI: 10.2166/wh.2022.020] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Researchers around the world have demonstrated correlations between measurements of SARS-CoV-2 RNA in wastewater (WW) and case rates of COVID-19 derived from direct testing of individuals. This has raised concerns that wastewater-based epidemiology (WBE) methods might be used to quantify the spread of this and other diseases, perhaps faster than direct testing, and with less expense and intrusion. We illustrate, using data from Scotland and the USA, the issues regarding the construction of effective predictive models for disease case rates. We discuss the effects of variation in, and the problem of aligning, public health (PH) reporting and WW measurements. We investigate time-varying effects in PH-reported case rates and their relationship to WW measurements. We show the lack of proportionality of WW measurements to case rates with associated spatial heterogeneity. We illustrate how the precision of predictions is affected by the level of aggregation chosen. We determine whether PH or WW measurements are the leading indicators of disease and how they may be used in conjunction to produce predictive models. The prospects of using WW-based predictive models with or without ongoing PH data are discussed.
Collapse
Affiliation(s)
- Julian Faraway
- Department of Mathematical Sciences, University of Bath, Bath BA2 7AY, UK E-mail:
| | | | - Edward J Feil
- Department of Biology and Biochemistry, University of Bath, Bath BA2 7AY, UK
| | - Marjorie J Gibbon
- Department of Biology and Biochemistry, University of Bath, Bath BA2 7AY, UK
| | - Oliver Hatfield
- Institute for Mathematical Innovation, University of Bath, Bath BA2 7AY, UK
| | | | - Theresa Smith
- Department of Mathematical Sciences, University of Bath, Bath BA2 7AY, UK E-mail:
| |
Collapse
|
36
|
Mbanu P, Saunders MP, Mistry H, Mercer J, Malcomson L, Yousif S, Price G, Kochhar R, Renehan AG, van Herk M, Osorio EV. Clinical and radiomics prediction of complete response in rectal cancer pre-chemoradiotherapy. Phys Imaging Radiat Oncol 2022; 23:48-53. [PMID: 35800297 PMCID: PMC9253904 DOI: 10.1016/j.phro.2022.06.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 06/18/2022] [Accepted: 06/20/2022] [Indexed: 11/11/2022] Open
|
37
|
van Royen FS, Moons KGM, Geersing GJ, van Smeden M. Developing, validating, updating and judging the impact of prognostic models for respiratory diseases. Eur Respir J 2022; 60:13993003.00250-2022. [PMID: 35728976 DOI: 10.1183/13993003.00250-2022] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 05/27/2022] [Indexed: 11/05/2022]
Affiliation(s)
- Florien S van Royen
- Dept. General Practice, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, Netherlands
| | - Karel G M Moons
- Dept. Epidemiology, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, Netherlands
| | - Geert-Jan Geersing
- Dept. General Practice, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, Netherlands
| | - Maarten van Smeden
- Dept. Epidemiology, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, Netherlands
| |
Collapse
|
38
|
Kheirandish M, Catanzaro D, Crudu V, Zhang S. Integrating landmark modeling framework and machine learning algorithms for dynamic prediction of tuberculosis treatment outcomes. J Am Med Inform Assoc 2022; 29:900-908. [PMID: 35139541 PMCID: PMC9006704 DOI: 10.1093/jamia/ocac003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 12/15/2021] [Accepted: 01/27/2022] [Indexed: 11/12/2022] Open
Abstract
OBJECTIVE This study aims to establish an informative dynamic prediction model of treatment outcomes using follow-up records of tuberculosis (TB) patients, which can timely detect cases when the current treatment plan may not be effective. MATERIALS AND METHODS We used 122 267 follow-up records from 17 958 new cases of pulmonary TB in the Republic of Moldova. A dynamic prediction framework integrating landmark modeling and machine learning algorithms was designed to predict patient outcomes during the course of treatment. Sensitivity and positive predictive value (PPV) were calculated to evaluate performance of the model at critical time points. New measures were defined to determine when follow-up laboratory tests should be conducted to obtain most informative results. RESULTS The random-forest algorithm performed better than support vector machine and penalized multinomial logistic regression models for predicting TB treatment outcomes. For all 3 outcome classes (ie, cured, not cured, and died after 24 months following treatment initiation), sensitivity and PPV of prediction models improved as more follow-up information was collected. Specifically, sensitivity and PPV increased from 0.55 to 0.84 and from 0.32 to 0.88, respectively, for the not cured class. CONCLUSION The dynamic prediction framework utilizes longitudinal laboratory test results to predict patient outcomes at various landmarks. Sputum culture and smear results are among the important variables for prediction; however, the most recent sputum result is not always the most informative one. This framework can potentially facilitate a more effective treatment monitoring program and provide insights for policymakers toward improved guidelines on follow-up tests.
Collapse
Affiliation(s)
- Maryam Kheirandish
- Department of Industrial Engineering, University of Arkansas, Fayetteville, Arkansas, USA
| | - Donald Catanzaro
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
| | - Valeriu Crudu
- Institute of Phthisiopneumology “Chrirl Draganiuc,” Chisinau, Moldova
- State University of Medicine and Pharmacy “Nicolae Testemitanu,” Chisinau, Moldova
| | - Shengfan Zhang
- Department of Industrial Engineering, University of Arkansas, Fayetteville, Arkansas, USA
| |
Collapse
|
39
|
Craddock M, Crockett C, McWilliam A, Price G, Sperrin M, van der Veer SN, Faivre-Finn C. Evaluation of Prognostic and Predictive Models in the Oncology Clinic. Clin Oncol (R Coll Radiol) 2022; 34:102-113. [PMID: 34922799 DOI: 10.1016/j.clon.2021.11.022] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 11/19/2021] [Accepted: 11/25/2021] [Indexed: 12/13/2022]
Abstract
Predictive and prognostic models hold great potential to support clinical decision making in oncology and could ultimately facilitate a paradigm shift to a more personalised form of treatment. While a large number of models relevant to the field of oncology have been developed, few have been translated into clinical use and assessment of clinical utility is not currently considered a routine part of model development. In this narrative review of the clinical evaluation of prediction models in oncology, we propose a high-level process diagram for the life cycle of a clinical model, encompassing model commissioning, clinical implementation and ongoing quality assurance, which aims to bridge the gap between model development and clinical implementation.
Collapse
Affiliation(s)
- M Craddock
- University of Manchester, Radiotherapy Related Research Group, Division of Cancer Sciences, School of Medical Sciences, Manchester, UK.
| | - C Crockett
- Department of Clinical Oncology, The Christie NHS Foundation Trust, Manchester, UK
| | - A McWilliam
- University of Manchester, Radiotherapy Related Research Group, Division of Cancer Sciences, School of Medical Sciences, Manchester, UK
| | - G Price
- University of Manchester, Radiotherapy Related Research Group, Division of Cancer Sciences, School of Medical Sciences, Manchester, UK
| | - M Sperrin
- Centre for Health Informatics, Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, UK
| | - S N van der Veer
- Centre for Health Informatics, Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, UK
| | - C Faivre-Finn
- University of Manchester, Radiotherapy Related Research Group, Division of Cancer Sciences, School of Medical Sciences, Manchester, UK; Department of Clinical Oncology, The Christie NHS Foundation Trust, Manchester, UK
| |
Collapse
|
40
|
Validation of the Trauma and Injury Severity Score for Prediction of Mortality in a Greek Trauma Population. J Trauma Nurs 2022; 29:34-40. [PMID: 35007249 DOI: 10.1097/jtn.0000000000000629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
BACKGROUND Although the Trauma and Injury Severity Score (TRISS) has been extensively used for mortality risk adjustment in trauma, its applicability in contemporary trauma populations is increasingly questioned. OBJECTIVE The study aimed to evaluate the predictive performance of the TRISS in its original and revised version and compare these with a recalibrated version, including current data from a Greek trauma population. METHODS This is a retrospective cohort study of admitted trauma patients conducted in two tertiary Greek hospitals from January 2016 to December 2018. The model algorithm was calculated based on the Major Trauma Outcome Study coefficients (TRISSMTOS), the National Trauma Data Bank coefficients (TRISSNTDB), and reweighted coefficients of logistic regression obtained from a Greek trauma dataset (TRISSGrTD). The primary endpoint was inhospital mortality. Models' prediction was performed using discrimination and calibration statistics. RESULTS A total of 8,988 trauma patients were included, of whom 854 died (9.5%). The TRISSMTOS displayed excellent discrimination with an area under the curve (AUC) of 0.912 (95% CI 0.902-0.923) and comparable with TRISSNTDB (AUC = 0.908, 95% CI 0.897-0.919, p = .1195). Calibration of both models was poor (Hosmer-Lemeshow test p < .001), tending to underestimate the probability of mortality across almost all risk groups. The TRISSGrTD resulted in statistically significant improvement in discrimination (AUC = 0.927, 95% CI 0.918-0.936, p < .0001) and acceptable calibration (Hosmer-Lemeshow test p = .113). CONCLUSION In this cohort of Greek trauma patients, the performance of the original TRISS was suboptimal, and there was no evidence that it has benefited from its latest revision. By contrast, a strong case exists for supporting a locally recalibrated version to render the TRISS applicable for mortality prediction and performance benchmarking.
Collapse
|
41
|
de Hond AAH, Leeuwenberg AM, Hooft L, Kant IMJ, Nijman SWJ, van Os HJA, Aardoom JJ, Debray TPA, Schuit E, van Smeden M, Reitsma JB, Steyerberg EW, Chavannes NH, Moons KGM. Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review. NPJ Digit Med 2022; 5:2. [PMID: 35013569 PMCID: PMC8748878 DOI: 10.1038/s41746-021-00549-7] [Citation(s) in RCA: 105] [Impact Index Per Article: 52.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 12/13/2021] [Indexed: 12/23/2022] Open
Abstract
While the opportunities of ML and AI in healthcare are promising, the growth of complex data-driven prediction models requires careful quality and applicability assessment before they are applied and disseminated in daily practice. This scoping review aimed to identify actionable guidance for those closely involved in AI-based prediction model (AIPM) development, evaluation and implementation including software engineers, data scientists, and healthcare professionals and to identify potential gaps in this guidance. We performed a scoping review of the relevant literature providing guidance or quality criteria regarding the development, evaluation, and implementation of AIPMs using a comprehensive multi-stage screening strategy. PubMed, Web of Science, and the ACM Digital Library were searched, and AI experts were consulted. Topics were extracted from the identified literature and summarized across the six phases at the core of this review: (1) data preparation, (2) AIPM development, (3) AIPM validation, (4) software development, (5) AIPM impact assessment, and (6) AIPM implementation into daily healthcare practice. From 2683 unique hits, 72 relevant guidance documents were identified. Substantial guidance was found for data preparation, AIPM development and AIPM validation (phases 1-3), while later phases clearly have received less attention (software development, impact assessment and implementation) in the scientific literature. The six phases of the AIPM development, evaluation and implementation cycle provide a framework for responsible introduction of AI-based prediction models in healthcare. Additional domain and technology specific research may be necessary and more practical experience with implementing AIPMs is needed to support further guidance.
Collapse
Affiliation(s)
- Anne A H de Hond
- Department of Information Technology and Digital Innovation, Leiden University Medical Center, Leiden, The Netherlands.
- Clinical AI Implementation and Research Lab, Leiden University Medical Center, Leiden, The Netherlands.
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands.
| | - Artuur M Leeuwenberg
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
| | - Lotty Hooft
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Ilse M J Kant
- Department of Information Technology and Digital Innovation, Leiden University Medical Center, Leiden, The Netherlands
- Clinical AI Implementation and Research Lab, Leiden University Medical Center, Leiden, The Netherlands
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| | - Steven W J Nijman
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Hendrikus J A van Os
- Clinical AI Implementation and Research Lab, Leiden University Medical Center, Leiden, The Netherlands
- National eHealth Living Lab, Leiden, The Netherlands
| | - Jiska J Aardoom
- National eHealth Living Lab, Leiden, The Netherlands
- Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, The Netherlands
| | - Thomas P A Debray
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Ewoud Schuit
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Maarten van Smeden
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Johannes B Reitsma
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Ewout W Steyerberg
- Clinical AI Implementation and Research Lab, Leiden University Medical Center, Leiden, The Netherlands
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| | - Niels H Chavannes
- National eHealth Living Lab, Leiden, The Netherlands
- Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, The Netherlands
| | - Karel G M Moons
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
42
|
Chen YJ, Duku E, Georgiades S. Rethinking Autism Intervention Science: A Dynamic Perspective. Front Psychiatry 2022; 13:827406. [PMID: 35280173 PMCID: PMC8915252 DOI: 10.3389/fpsyt.2022.827406] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Accepted: 01/31/2022] [Indexed: 11/13/2022] Open
Abstract
Recent advances in longitudinal methodologies for observational studies have contributed to a better understanding of Autism as a neurodevelopmental condition characterized by within-person and between-person variability over time across behavioral domains. However, this finer-grained approach to the study of developmental variability has yet to be applied to Autism intervention science. The widely adopted experimental designs in the field-randomized control trials and quasi-experimental designs-hold value for inferring treatment effects; at the same time, they are limited in elucidating what works for whom, why, and when, given the idiosyncrasies of neurodevelopmental disorders where predictors and outcomes are often dynamic in nature. This perspective paper aims to serve as a primer for Autism intervention scientists to rethink the way we approach predictors of treatment response and treatment-related change using a dynamic lens. We discuss several empirical gaps, and potential methodological challenges and opportunities pertaining to: (1) capturing finer-grained treatment effects in specific behavioral domains as indexed by micro-level within-person changes during and beyond intervention; and (2) examining and modeling dynamic prediction of treatment response. Addressing these issues can contribute to enhanced study designs and methodologies that generate evidence to inform the development of more personalized interventions and stepped care approaches for individuals on the heterogeneous spectrum of Autism with changing needs across development.
Collapse
Affiliation(s)
- Yun-Ju Chen
- Department of Psychiatry and Behavioural Neurosciences, McMaster University, Hamilton, ON, Canada
| | - Eric Duku
- Department of Psychiatry and Behavioural Neurosciences, McMaster University, Hamilton, ON, Canada
| | - Stelios Georgiades
- Department of Psychiatry and Behavioural Neurosciences, McMaster University, Hamilton, ON, Canada
| |
Collapse
|
43
|
Reynard C, Martin GP, Kontopantelis E, Jenkins DA, Heagerty A, McMillan B, Jafar A, Garlapati R, Body R. Advanced cardiovascular risk prediction in the emergency department: updating a clinical prediction model - a large database study protocol. Diagn Progn Res 2021; 5:16. [PMID: 34620253 PMCID: PMC8499458 DOI: 10.1186/s41512-021-00105-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 09/27/2021] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Patients presenting with chest pain represent a large proportion of attendances to emergency departments. In these patients clinicians often consider the diagnosis of acute myocardial infarction (AMI), the timely recognition and treatment of which is clinically important. Clinical prediction models (CPMs) have been used to enhance early diagnosis of AMI. The Troponin-only Manchester Acute Coronary Syndromes (T-MACS) decision aid is currently in clinical use across Greater Manchester. CPMs have been shown to deteriorate over time through calibration drift. We aim to assess potential calibration drift with T-MACS and compare methods for updating the model. METHODS We will use routinely collected electronic data from patients who were treated using TMACS at two large NHS hospitals. This is estimated to include approximately 14,000 patient episodes spanning June 2016 to October 2020. The primary outcome of acute myocardial infarction will be sourced from NHS Digital's admitted patient care dataset. We will assess the calibration drift of the existing model and the benefit of updating the CPM by model recalibration, model extension and dynamic updating. These models will be validated by bootstrapping and one step ahead prequential testing. We will evaluate predictive performance using calibrations plots and c-statistics. We will also examine the reclassification of predicted probability with the updated TMACS model. DISCUSSION CPMs are widely used in modern medicine, but are vulnerable to deteriorating calibration over time. Ongoing refinement using routinely collected electronic data will inevitably be more efficient than deriving and validating new models. In this analysis we will seek to exemplify methods for updating CPMs to protect the initial investment of time and effort. If successful, the updating methods could be used to continually refine the algorithm used within TMACS, maintaining or even improving predictive performance over time. TRIAL REGISTRATION ISRCTN number: ISRCTN41008456.
Collapse
Affiliation(s)
- Charles Reynard
- grid.5379.80000000121662407Division of Cardiovascular Sciences, University of Manchester, Manchester, UK
- grid.498924.aEmergency Department, Manchester University NHS Foundation Trust, Manchester, UK
| | - Glen P. Martin
- grid.5379.80000000121662407Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK
| | - Evangelos Kontopantelis
- grid.5379.80000000121662407Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK
| | - David A. Jenkins
- grid.5379.80000000121662407Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK
| | - Anthony Heagerty
- grid.5379.80000000121662407Division of Cardiovascular Sciences, University of Manchester, Manchester, UK
| | - Brian McMillan
- Centre for Primary Care and Health Services Research Division of Population Health, Health Services Research and Primary Care School of Health Sciences Faculty of Biology, Medicine and Health University of Manchestern, Manchester, UK
| | - Anisa Jafar
- grid.5379.80000000121662407Humanitarian and Conflict Response Institute, University of Manchester, Manchester, UK
| | - Rajendar Garlapati
- grid.439642.e0000 0004 0489 3782Emergency Department, Royal Blackburn Hospital, East Lancashire Hospitals NHS Trust, Burnley, UK
| | - Richard Body
- grid.5379.80000000121662407Division of Cardiovascular Sciences, University of Manchester, Manchester, UK
- grid.498924.aEmergency Department, Manchester University NHS Foundation Trust, Manchester, UK
| |
Collapse
|
44
|
Asgari S, Khalili D, Zayeri F, Azizi F, Hadaegh F. Dynamic prediction models improved the risk classification of type 2 diabetes compared with classical static models. J Clin Epidemiol 2021; 140:33-43. [PMID: 34455032 DOI: 10.1016/j.jclinepi.2021.08.026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 07/07/2021] [Accepted: 08/20/2021] [Indexed: 11/27/2022]
Abstract
OBJECTIVE Dynamic prediction models use the repeated measurements of predictors to estimate coefficients that link the longitudinal predictors to a static model (i.e. Cox regression). This study aims to develop and validate a dynamic prediction for incident type 2 diabetes (T2DM) as the outcome. STUDY DESIGN AND SETTING Data from the Tehran lipid and glucose study was used to develop (n = 5291 individuals; phases 1 to 3) and validate (n = 3147 individuals; phases 3 to 6) the dynamic prediction model among individuals aged ≥ 20 years. We used repeated measurements of fasting plasma glucose (FPG) or waist circumference (WC) in the framework of the joint modeling (JM) of longitudinal and time-to-event analysis. RESULTS Compared with the Cox which used just baseline data, JM showed the same discrimination, better calibration, and higher clinical usefulness (i.e. with a net benefit considering both true and false positive decisions); all were shown with repeated measurements of FPG/WC. Additionally, in our study, the dynamic models improve the risk reclassification (net reclassification index 33% for FPG and 24% for WC model). CONCLUSION Dynamic prediction models, compared with the static one could yield significant improvements in the prediction of T2DM. The complexity of the dynamic models could be addressed by using decision support systems.
Collapse
Affiliation(s)
- Samaneh Asgari
- Prevention of Metabolic Disorders Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Davood Khalili
- Prevention of Metabolic Disorders Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran; Department of Biostatistics and Epidemiology, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| | - Farid Zayeri
- Proteomics Research Center and Department of Biostatistics, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Fereidoun Azizi
- Endocrine Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Farzad Hadaegh
- Prevention of Metabolic Disorders Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
45
|
Lenselink C, Ties D, Pleijhuis R, van der Harst P. Validation and comparison of 28 risk prediction models for coronary artery disease. Eur J Prev Cardiol 2021; 29:666-674. [PMID: 34329420 DOI: 10.1093/eurjpc/zwab095] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 04/29/2021] [Accepted: 05/18/2021] [Indexed: 12/16/2022]
Abstract
AIMS Risk prediction models (RPMs) for coronary artery disease (CAD), using variables to calculate CAD risk, are potentially valuable tools in prevention strategies. However, their use in the clinical practice is limited by a lack of poor model description, external validation, and head-to-head comparisons. METHODS AND RESULTS CAD RPMs were identified through Tufts PACE CPM Registry and a systematic PubMed search. Every RPM was externally validated in the three cohorts (the UK Biobank, LifeLines, and PREVEND studies) for the primary endpoint myocardial infarction (MI) and secondary endpoint CAD, consisting of MI, percutaneous coronary intervention, and coronary artery bypass grafting. Model discrimination (C-index), calibration (intercept and regression slope), and accuracy (Brier score) were assessed and compared head-to-head between RPMs. Linear regression analysis was performed to evaluate predictive factors to estimate calibration ability of an RPM. Eleven articles containing 28 CAD RPMs were included. No single best-performing RPM could be identified across all cohorts and outcomes. Most RPMs yielded fair discrimination ability: mean C-index of RPMs was 0.706 ± 0.049, 0.778 ± 0.097, and 0.729 ± 0.074 (P < 0.01) for prediction of MI in UK Biobank, LifeLines, and PREVEND, respectively. Endpoint incidence in the original development cohorts was identified as a significant predictor for external validation performance. CONCLUSION Performance of CAD RPMs was comparable upon validation in three large cohorts, based on which no specific RPM can be recommended for predicting CAD risk.
Collapse
Affiliation(s)
- Chris Lenselink
- Department of Cardiology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9700 RB Groningen, The Netherlands
| | - Daan Ties
- Department of Cardiology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9700 RB Groningen, The Netherlands
| | - Rick Pleijhuis
- Department of Internal Medicine, University Medical Center Groningen, Hanzeplein 1, 9700 RB, Groningen, The Netherlands
| | - Pim van der Harst
- Department of Cardiology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9700 RB Groningen, The Netherlands
| |
Collapse
|
46
|
Bharat C, Hickman M, Barbieri S, Degenhardt L. Big data and predictive modelling for the opioid crisis: existing research and future potential. Lancet Digit Health 2021; 3:e397-e407. [PMID: 34045004 DOI: 10.1016/s2589-7500(21)00058-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Revised: 01/21/2021] [Accepted: 03/24/2021] [Indexed: 12/30/2022]
Abstract
A need exists to accurately estimate overdose risk and improve understanding of how to deliver treatments and interventions in people with opioid use disorder in a way that reduces such risk. We consider opportunities for predictive analytics and routinely collected administrative data to evaluate how overdose could be reduced among people with opioid use disorder. Specifically, we summarise global trends in opioid use and overdoses; describe the use of big data in research into opioid overdose; consider the potential for predictive modelling, including machine learning, for prevention and monitoring of opioid overdoses; and outline the challenges and risks relating to the use of big data and machine learning in reducing harms that are related to opioid use. Future research for improving the coverage and provision of existing interventions, treatments, and resources for opioid use disorder requires collaboration of multiple agencies. Predictive modelling could transport the concept of stratified medicine to public health through novel methods, such as predictive modelling and emulated trials for evaluating diagnoses and prognoses of opioid use disorder, predicting treatment response, and providing targeted treatment recommendations.
Collapse
Affiliation(s)
- Chrianna Bharat
- National Drug and Alcohol Research Centre, University of New South Wales, Sydney, NSW, Australia.
| | - Matthew Hickman
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Sebastiano Barbieri
- Centre for Big Data Research in Health, University of New South Wales, Sydney, NSW, Australia
| | - Louisa Degenhardt
- National Drug and Alcohol Research Centre, University of New South Wales, Sydney, NSW, Australia
| |
Collapse
|