1
|
Joshi S, Urteaga I, van Amsterdam WAC, Hripcsak G, Elias P, Recht B, Elhadad N, Fackler J, Sendak MP, Wiens J, Deshpande K, Wald Y, Fiterau M, Lipton Z, Malinsky D, Nayan M, Namkoong H, Park S, Vogt JE, Ranganath R. AI as an intervention: improving clinical outcomes relies on a causal approach to AI development and validation. J Am Med Inform Assoc 2025:ocae301. [PMID: 39775871 DOI: 10.1093/jamia/ocae301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2024] [Revised: 11/13/2024] [Accepted: 12/13/2024] [Indexed: 01/11/2025] Open
Abstract
The primary practice of healthcare artificial intelligence (AI) starts with model development, often using state-of-the-art AI, retrospectively evaluated using metrics lifted from the AI literature like AUROC and DICE score. However, good performance on these metrics may not translate to improved clinical outcomes. Instead, we argue for a better development pipeline constructed by working backward from the end goal of positively impacting clinically relevant outcomes using AI, leading to considerations of causality in model development and validation, and subsequently a better development pipeline. Healthcare AI should be "actionable," and the change in actions induced by AI should improve outcomes. Quantifying the effect of changes in actions on outcomes is causal inference. The development, evaluation, and validation of healthcare AI should therefore account for the causal effect of intervening with the AI on clinically relevant outcomes. Using a causal lens, we make recommendations for key stakeholders at various stages of the healthcare AI pipeline. Our recommendations aim to increase the positive impact of AI on clinical outcomes.
Collapse
Affiliation(s)
- Shalmali Joshi
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, United States
| | - Iñigo Urteaga
- BCAM-Basque Center for Applied Mathematics, Bilbao 48009, Spain
- IKERBASQUE-Basque Foundation for Science, Bilbao 48009, Spain
| | - Wouter A C van Amsterdam
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht 3584 CX, The Netherlands
| | - George Hripcsak
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, United States
| | - Pierre Elias
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, United States
- Division of Cardiology, Columbia University, New York, NY 10032, United States
| | - Benjamin Recht
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA 94720, United States
| | - Noémie Elhadad
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, United States
| | - James Fackler
- Department of Anesthesiology and Critical Care Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21287, United States
| | - Mark P Sendak
- Population Health and Data Science, Duke Institute of Health Innovation, Durham, NC 27701, United States
| | - Jenna Wiens
- Department of Computer Science and Engineering, University of Michigan, Ann Arbor, Ann Arbor, MI 48109, United States
| | - Kaivalya Deshpande
- Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, United States
| | - Yoav Wald
- Center for Data Science, New York University, New York, NY 10011, United States
| | - Madalina Fiterau
- College of Information and Computer Sciences, University of Massachusetts, Amherst, Amherst, MA 01003, United States
| | - Zachary Lipton
- Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA 15213, United States
| | - Daniel Malinsky
- Department of Biostatistics, Columbia University, New York, NY 10032, United States
| | - Madhur Nayan
- Department of Population Health and Urology, NYU Grossman School of Medicine, New York, NY 10016, United States
| | - Hongseok Namkoong
- Division of Decisions, Risk, and Operations, Columbia Business School, New York, NY 10027, United States
| | - Soojin Park
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, United States
| | - Julia E Vogt
- Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland
| | - Rajesh Ranganath
- Center for Data Science, New York University, New York, NY 10011, United States
- Department of Computer Science, New York University, New York, NY 10012, United States
| |
Collapse
|
2
|
Radecki RP. Let a Million Monkeys With Typewriters Do Your Quality Measure Reporting: January 2025 Annals of Emergency Medicine Journal Club. Ann Emerg Med 2025; 85:92-94. [PMID: 39706610 DOI: 10.1016/j.annemergmed.2024.11.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2024]
Affiliation(s)
- Ryan P Radecki
- Department of Emergency Medicine, Christchurch Hospital, Christchurch, New Zealand
| |
Collapse
|
3
|
McCradden MD, London AJ, Gichoya JW, Sendak M, Erdman L, Stedman I, Oakden-Rayner L, Akrout I, Anderson JA, Farmer LA, Greer R, Goldenberg A, Ho Y, Joshi S, Louise J, Mamdani M, Mazwi ML, Mohamud A, Palmer LJ, Peperidis A, Pfohl SR, Rickard M, Semmler C, Singh K, Singh D, Soremekun S, Tikhomirov L, van der Vegt AH, Verspoor K, Liu X. CANAIRI: the Collaboration for Translational Artificial Intelligence Trials in healthcare. Nat Med 2025; 31:9-11. [PMID: 39762426 DOI: 10.1038/s41591-024-03364-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2025]
Affiliation(s)
- Melissa D McCradden
- Women's and Children's Health Network, Adelaide, South Australia, Australia.
- Australian Institute for Machine Learning, University of Adelaide, Adelaide, South Australia, Australia.
- SickKids Research Institute, Toronto, Ontario, Canada.
| | | | | | - Mark Sendak
- Duke Institute for Health Innovation, Durham, NC, USA
| | - Lauren Erdman
- James M. Anderson Center for Health Systems Excellence, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
- University of Cincinnati School of Medicine, Cincinnati, OH, USA
| | - Ian Stedman
- School of Public Policy and Administration at York University, Toronto, Ontario, Canada
| | - Lauren Oakden-Rayner
- Australian Institute for Machine Learning, University of Adelaide, Adelaide, South Australia, Australia
| | - Ismail Akrout
- Artificial Intelligence in Medicine Initiative, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - James A Anderson
- SickKids Research Institute, Toronto, Ontario, Canada
- Department of Bioethics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
| | | | - Robert Greer
- Artificial Intelligence in Medicine Initiative, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Anna Goldenberg
- SickKids Research Institute, Toronto, Ontario, Canada
- School of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute for AI, Toronto, Ontario, Canada
- Canadian Institute for Advanced Research, Toronto, Ontario, Canada
| | - Yvonne Ho
- Royal Australian and New Zealand College of Radiologists, Sydney, New South Wales, Australia
- Medical Devices and Product Quality Division, Health Products and Regulation Group, Australian Government Department of Health and Aged Care, Canberra, Australian Capital Territory, Australia
| | - Shalmali Joshi
- Biomedical Informatics, Columbia University, New York, NY, USA
- Columbia University Irving Medical Center, New York, NY, USA
| | - Jennie Louise
- Women's and Children's Hospital Research Centre, Adelaide, South Australia, Australia
- Biostatistics Unit, South Australian Health and Medical Research Institute, Adelaide, South Australia, Australia
| | - Muhammad Mamdani
- Vector Institute for AI, Toronto, Ontario, Canada
- Unity Health, Toronto, Ontario, Canada
- Temerty Centre for Artificial Intelligence Research and Education in Medicine (T-CAIREM), University of Toronto, Toronto, Ontario, Canada
| | | | - Abdullahi Mohamud
- SickKids Research Institute, Toronto, Ontario, Canada
- Artificial Intelligence in Medicine Initiative, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Lyle J Palmer
- Australian Institute for Machine Learning, University of Adelaide, Adelaide, South Australia, Australia
- School of Public Health, University of Adelaide, Adelaide, South Australia, Australia
| | | | | | - Mandy Rickard
- Division of Urology, Department of Surgery, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Carolyn Semmler
- Australian Institute for Machine Learning, University of Adelaide, Adelaide, South Australia, Australia
- School of Psychology, University of Adelaide, Adelaide, South Australia, Australia
| | - Karandeep Singh
- Division of Biomedical Informatics, Department of Medicine, University of California San Diego, San Diego, CA, USA
| | - Devin Singh
- SickKids Research Institute, Toronto, Ontario, Canada
- Artificial Intelligence in Medicine Initiative, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Seyi Soremekun
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, UK
| | - Lana Tikhomirov
- Australian Institute for Machine Learning, University of Adelaide, Adelaide, South Australia, Australia
- School of Psychology, University of Adelaide, Adelaide, South Australia, Australia
| | - Anton H van der Vegt
- Queensland Digital Health Centre, University of Queensland, Brisbane, Queensland, Australia
| | - Karin Verspoor
- School of Computing Technologies, RMIT University, Melbourne, VIC, Australia
| | - Xiaoxuan Liu
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- National Institute for Health and Care Research, Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
| |
Collapse
|
4
|
Mehta D, Gonzalez XT, Huang G, Abraham J. Machine learning-augmented interventions in perioperative care: a systematic review and meta-analysis. Br J Anaesth 2024; 133:1159-1172. [PMID: 39322472 PMCID: PMC11589382 DOI: 10.1016/j.bja.2024.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Revised: 08/01/2024] [Accepted: 08/05/2024] [Indexed: 09/27/2024] Open
Abstract
BACKGROUND We lack evidence on the cumulative effectiveness of machine learning (ML)-driven interventions in perioperative settings. Therefore, we conducted a systematic review to appraise the evidence on the impact of ML-driven interventions on perioperative outcomes. METHODS Ovid MEDLINE, CINAHL, Embase, Scopus, PubMed, and ClinicalTrials.gov were searched to identify randomised controlled trials (RCTs) evaluating the effectiveness of ML-driven interventions in surgical inpatient populations. The review was registered with PROSPERO (CRD42023433163) and conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. Meta-analysis was conducted for outcomes with two or more studies using a random-effects model, and vote counting was conducted for other outcomes. RESULTS Among 13 included RCTs, three types of ML-driven interventions were evaluated: Hypotension Prediction Index (HPI) (n=5), Nociception Level Index (NoL) (n=7), and a scheduling system (n=1). Compared with the standard care, HPI led to a significant decrease in absolute hypotension (n=421, P=0.003, I2=75%) and relative hypotension (n=208, P<0.0001, I2=0%); NoL led to significantly lower mean pain scores in the post-anaesthesia care unit (PACU) (n=191, P=0.004, I2=19%). NoL showed no significant impact on intraoperative opioid consumption (n=339, P=0.31, I2=92%) or PACU opioid consumption (n=339, P=0.11, I2=0%). No significant difference in hospital length of stay (n=361, P=0.81, I2=0%) and PACU stay (n=267, P=0.44, I2=0) was found between HPI and NoL. CONCLUSIONS HPI decreased the duration of intraoperative hypotension, and NoL decreased postoperative pain scores, but no significant impact on other clinical outcomes was found. We highlight the need to address both methodological and clinical practice gaps to ensure the successful future implementation of ML-driven interventions. SYSTEMATIC REVIEW PROTOCOL CRD42023433163 (PROSPERO).
Collapse
Affiliation(s)
- Divya Mehta
- Department of Anesthesiology, Washington University School of Medicine, St. Louis, MO, USA
| | - Xiomara T Gonzalez
- Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, TX, USA
| | - Grace Huang
- Medical Education, Washington University School of Medicine, St. Louis, MO, USA
| | - Joanna Abraham
- Department of Anesthesiology, Washington University School of Medicine, St. Louis, MO, USA; Institute for Informatics, Data Science and Biostatistics (I2DB), Washington University School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
5
|
Spies NC, Farnsworth CW, Wheeler S, McCudden CR. Validating, Implementing, and Monitoring Machine Learning Solutions in the Clinical Laboratory Safely and Effectively. Clin Chem 2024; 70:1334-1343. [PMID: 39255250 DOI: 10.1093/clinchem/hvae126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Accepted: 07/30/2024] [Indexed: 09/12/2024]
Abstract
BACKGROUND Machine learning solutions offer tremendous promise for improving clinical and laboratory operations in pathology. Proof-of-concept descriptions of these approaches have become commonplace in laboratory medicine literature, but only a scant few of these have been implemented within clinical laboratories, owing to the often substantial barriers in validating, implementing, and monitoring these applications in practice. This mini-review aims to highlight the key considerations in each of these steps. CONTENT Effective and responsible applications of machine learning in clinical laboratories require robust validation prior to implementation. A comprehensive validation study involves a critical evaluation of study design, data engineering and interoperability, target label definition, metric selection, generalizability and applicability assessment, algorithmic fairness, and explainability. While the main text highlights these concepts in broad strokes, a supplementary code walk-through is also provided to facilitate a more practical understanding of these topics using a real-world classification task example, the detection of saline-contaminated chemistry panels.Following validation, the laboratorian's role is far from over. Implementing machine learning solutions requires an interdisciplinary effort across several roles in an organization. We highlight the key roles, responsibilities, and terminologies for successfully deploying a validated solution into a live production environment. Finally, the implemented solution must be routinely monitored for signs of performance degradation and updated if necessary. SUMMARY This mini-review aims to bridge the gap between theory and practice by highlighting key concepts in validation, implementation, and monitoring machine learning solutions effectively and responsibly in the clinical laboratory.
Collapse
Affiliation(s)
- Nicholas C Spies
- Department of Pathology, University of Utah School of Medicine/ARUP Laboratories, Salt Lake City, UT, United States
| | - Christopher W Farnsworth
- Division of Laboratory and Genomic Medicine, Department of Pathology, Washington University in St. Louis School of Medicine, St. Louis, MO, United States
| | - Sarah Wheeler
- Department of Pathology, University of Pittsburgh School of Medicine and UPMC, Pittsburgh, PA, United States
| | - Christopher R McCudden
- Division of Biochemistry, Department of Pathology and Laboratory Medicine, University of Ottawa, Ottawa, ON, Canada
| |
Collapse
|
6
|
Mureșanu S, Hedeșiu M, Iacob L, Eftimie R, Olariu E, Dinu C, Jacobs R. Automating Dental Condition Detection on Panoramic Radiographs: Challenges, Pitfalls, and Opportunities. Diagnostics (Basel) 2024; 14:2336. [PMID: 39451659 PMCID: PMC11507083 DOI: 10.3390/diagnostics14202336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2024] [Revised: 10/13/2024] [Accepted: 10/14/2024] [Indexed: 10/26/2024] Open
Abstract
Background/Objectives: The integration of AI into dentistry holds promise for improving diagnostic workflows, particularly in the detection of dental pathologies and pre-radiotherapy screening for head and neck cancer patients. This study aimed to develop and validate an AI model for detecting various dental conditions, with a focus on identifying teeth at risk prior to radiotherapy. Methods: A YOLOv8 model was trained on a dataset of 1628 annotated panoramic radiographs and externally validated on 180 radiographs from multiple centers. The model was designed to detect a variety of dental conditions, including periapical lesions, impacted teeth, root fragments, prosthetic restorations, and orthodontic devices. Results: The model showed strong performance in detecting implants, endodontic treatments, and surgical devices, with precision and recall values exceeding 0.8 for several conditions. However, performance declined during external validation, highlighting the need for improvements in generalizability. Conclusions: YOLOv8 demonstrated robust detection capabilities for several dental conditions, especially in training data. However, further refinement is needed to enhance generalizability in external datasets and improve performance for conditions like periapical lesions and bone loss.
Collapse
Affiliation(s)
- Sorana Mureșanu
- Department of Oral and Maxillofacial Surgery and Radiology, Iuliu Hațieganu University of Medicine and Pharmacy, 32 Clinicilor Street, 400006 Cluj-Napoca, Romania
| | - Mihaela Hedeșiu
- Department of Oral and Maxillofacial Surgery and Radiology, Iuliu Hațieganu University of Medicine and Pharmacy, 32 Clinicilor Street, 400006 Cluj-Napoca, Romania
| | - Liviu Iacob
- Department of Computer Science, Technical University of Cluj-Napoca, 400114 Cluj-Napoca, Romania
| | - Radu Eftimie
- Iuliu Hațieganu University of Medicine and Pharmacy, 32 Clinicilor Street, 400006 Cluj-Napoca, Romania
| | - Eliza Olariu
- Department of Electrical Engineering, Technical University of Cluj-Napoca, 400114 Cluj-Napoca, Romania
| | - Cristian Dinu
- Department of Oral and Maxillofacial Surgery and Radiology, Iuliu Hațieganu University of Medicine and Pharmacy, 32 Clinicilor Street, 400006 Cluj-Napoca, Romania
| | - Reinhilde Jacobs
- OMFS IMPATH Research Group, Department of Imaging and Pathology, Faculty of Medicine, Katholieke Universiteit Leuven, 3000 Louvain, Belgium
- Department of Oral and Maxillofacial Surgery, University Hospitals Leuven, 3000 Louvain, Belgium
- Department of Dental Medicine, Karolinska Institute, 171 77 Stockholm, Sweden
| | | |
Collapse
|
7
|
Martinson AK, Chin AT, Butte MJ, Rider NL. Artificial Intelligence and Machine Learning for Inborn Errors of Immunity: Current State and Future Promise. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. IN PRACTICE 2024; 12:2695-2704. [PMID: 39127104 DOI: 10.1016/j.jaip.2024.08.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 07/10/2024] [Accepted: 08/01/2024] [Indexed: 08/12/2024]
Abstract
Artificial intelligence (AI) and machine learning (ML) research within medicine has exponentially increased over the last decade, with studies showcasing the potential of AI/ML algorithms to improve clinical practice and outcomes. Ongoing research and efforts to develop AI-based models have expanded to aid in the identification of inborn errors of immunity (IEI). The use of larger electronic health record data sets, coupled with advances in phenotyping precision and enhancements in ML techniques, has the potential to significantly improve the early recognition of IEI, thereby increasing access to equitable care. In this review, we provide a comprehensive examination of AI/ML for IEI, covering the spectrum from data preprocessing for AI/ML analysis to current applications within immunology, and address the challenges associated with implementing clinical decision support systems to refine the diagnosis and management of IEI.
Collapse
Affiliation(s)
| | - Aaron T Chin
- Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California, Los Angeles, Los Angeles, Calif
| | - Manish J Butte
- Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California, Los Angeles, Los Angeles, Calif
| | - Nicholas L Rider
- Department of Health Systems & Implementation Science, Virginia Tech Carilion School of Medicine, Roanoke, Va; Department of Medicine, Division of Allergy-Immunology, Carilion Clinic, Roanoke, Va.
| |
Collapse
|
8
|
Scott IA, van der Vegt A, Lane P, McPhail S, Magrabi F. Achieving large-scale clinician adoption of AI-enabled decision support. BMJ Health Care Inform 2024; 31:e100971. [PMID: 38816209 PMCID: PMC11141172 DOI: 10.1136/bmjhci-2023-100971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2023] [Accepted: 05/15/2024] [Indexed: 06/01/2024] Open
Abstract
Computerised decision support (CDS) tools enabled by artificial intelligence (AI) seek to enhance accuracy and efficiency of clinician decision-making at the point of care. Statistical models developed using machine learning (ML) underpin most current tools. However, despite thousands of models and hundreds of regulator-approved tools internationally, large-scale uptake into routine clinical practice has proved elusive. While underdeveloped system readiness and investment in AI/ML within Australia and perhaps other countries are impediments, clinician ambivalence towards adopting these tools at scale could be a major inhibitor. We propose a set of principles and several strategic enablers for obtaining broad clinician acceptance of AI/ML-enabled CDS tools.
Collapse
Affiliation(s)
- Ian A Scott
- Internal Medicine and Clinical Epidemiology, Princess Alexandra Hospital, Brisbane, Queensland, Australia
- Centre for Health Services Research, The University of Queensland Faculty of Medicine and Biomedical Sciences, Brisbane, Queensland, Australia
| | - Anton van der Vegt
- Digital Health Centre, The University of Queensland Faculty of Medicine and Biomedical Sciences, Herston, Queensland, Australia
| | - Paul Lane
- Safety, Quality and Innovation, The Prince Charles Hospital, Brisbane, Queensland, Australia
| | - Steven McPhail
- Australian Centre for Health Services Innovation, Queensland University of Technology Faculty of Health, Brisbane, Queensland, Australia
| | - Farah Magrabi
- Macquarie University, Sydney, New South Wales, Australia
| |
Collapse
|
9
|
van der Vegt A, Campbell V, Zuccon G. Why clinical artificial intelligence is (almost) non-existent in Australian hospitals and how to fix it. Med J Aust 2024; 220:172-175. [PMID: 38146620 DOI: 10.5694/mja2.52195] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 10/27/2023] [Indexed: 12/27/2023]
Affiliation(s)
- Anton van der Vegt
- Centre for Health Services Research, University of Queensland, Brisbane, QLD
| | | | | |
Collapse
|
10
|
van der Vegt AH, Campbell V, Mitchell I, Malycha J, Simpson J, Flenady T, Flabouris A, Lane PJ, Mehta N, Kalke VR, Decoyna JA, Es’haghi N, Liu CH, Scott IA. Systematic review and longitudinal analysis of implementing Artificial Intelligence to predict clinical deterioration in adult hospitals: what is known and what remains uncertain. J Am Med Inform Assoc 2024; 31:509-524. [PMID: 37964688 PMCID: PMC10797271 DOI: 10.1093/jamia/ocad220] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 10/27/2023] [Accepted: 10/31/2023] [Indexed: 11/16/2023] Open
Abstract
OBJECTIVE To identify factors influencing implementation of machine learning algorithms (MLAs) that predict clinical deterioration in hospitalized adult patients and relate these to a validated implementation framework. MATERIALS AND METHODS A systematic review of studies of implemented or trialed real-time clinical deterioration prediction MLAs was undertaken, which identified: how MLA implementation was measured; impact of MLAs on clinical processes and patient outcomes; and barriers, enablers and uncertainties within the implementation process. Review findings were then mapped to the SALIENT end-to-end implementation framework to identify the implementation stages at which these factors applied. RESULTS Thirty-seven articles relating to 14 groups of MLAs were identified, each trialing or implementing a bespoke algorithm. One hundred and seven distinct implementation evaluation metrics were identified. Four groups reported decreased hospital mortality, 1 significantly. We identified 24 barriers, 40 enablers, and 14 uncertainties and mapped these to the 5 stages of the SALIENT implementation framework. DISCUSSION Algorithm performance across implementation stages decreased between in silico and trial stages. Silent plus pilot trial inclusion was associated with decreased mortality, as was the use of logistic regression algorithms that used less than 39 variables. Mitigation of alert fatigue via alert suppression and threshold configuration was commonly employed across groups. CONCLUSIONS : There is evidence that real-world implementation of clinical deterioration prediction MLAs may improve clinical outcomes. Various factors identified as influencing success or failure of implementation can be mapped to different stages of implementation, thereby providing useful and practical guidance for implementers.
Collapse
Affiliation(s)
- Anton H van der Vegt
- Centre for Health Services Research, The University of Queensland, Brisbane, QLD 4102, Australia
| | - Victoria Campbell
- Intensive Care Unit, Sunshine Coast Hospital and Health Service, Birtynia, QLD 4575, Australia
- School of Medicine and Dentistry, Griffith University, Gold Coast, QLD 4222, Australia
| | - Imogen Mitchell
- Office of Research and Education, Canberra Health Services, Canberra, ACT 2601, Australia
| | - James Malycha
- Department of Critical Care Medicine, The Queen Elizabeth Hospital, Woodville, SA 5011, Australia
| | - Joanna Simpson
- Eastern Health Intensive Care Services, Eastern Health, Box Hill, VIC 3128, Australia
| | - Tracy Flenady
- School of Nursing, Midwifery & Social Sciences, Central Queensland University, Rockhampton, QLD 4701, Australia
| | - Arthas Flabouris
- Intensive Care Department, Royal Adelaide Hospital, Adelaide, SA 5000, Australia
- Adelaide Medical School, University of Adelaide, Adelaide, SA 5005, Australia
| | - Paul J Lane
- Safety Quality & Innovation, The Prince Charles Hospital, Chermside, QLD 4032, Australia
| | - Naitik Mehta
- Patient Safety and Quality, Clinical Excellence Queensland, Brisbane, QLD 4001, Australia
| | - Vikrant R Kalke
- Patient Safety and Quality, Clinical Excellence Queensland, Brisbane, QLD 4001, Australia
| | - Jovie A Decoyna
- School of Medicine and Dentistry, Griffith University, Gold Coast, QLD 4222, Australia
| | - Nicholas Es’haghi
- School of Medicine and Dentistry, Griffith University, Gold Coast, QLD 4222, Australia
| | - Chun-Huei Liu
- School of Medicine and Dentistry, Griffith University, Gold Coast, QLD 4222, Australia
| | - Ian A Scott
- Centre for Health Services Research, The University of Queensland, Brisbane, QLD 4102, Australia
- Department of Internal Medicine and Clinical Epidemiology, Princess Alexandra Hospital, Brisbane, QLD 4102, Australia
| |
Collapse
|