1
|
Jeanson F, Farkouh ME, Godoy LC, Minha S, Tzuman O, Marcus G. Medical calculators derived synthetic cohorts: a novel method for generating synthetic patient data. Sci Rep 2024; 14:11437. [PMID: 38763934 PMCID: PMC11102910 DOI: 10.1038/s41598-024-61721-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 05/08/2024] [Indexed: 05/21/2024] Open
Abstract
This study shows that we can use synthetic cohorts created from medical risk calculators to gain insights into how risk estimations, clinical reasoning, data-driven subgrouping, and the confidence in risk calculator scores are connected. When prediction variables aren't evenly distributed in these synthetic cohorts, they can be used to group similar cases together, revealing new insights about how cohorts behave. We also found that the confidence in predictions made by these calculators can vary depending on patient characteristics. This suggests that it might be beneficial to include a "normalized confidence" score in future versions of these calculators for healthcare professionals. We plan to explore this idea further in our upcoming research.
Collapse
Affiliation(s)
| | - Michael E Farkouh
- Peter Munk Cardiac Centre and Heart and Stroke Richard Lewar Centre, University of Toronto, Toronto, Canada
| | - Lucas C Godoy
- Peter Munk Cardiac Centre and Heart and Stroke Richard Lewar Centre, University of Toronto, Toronto, Canada
| | - Sa'ar Minha
- Department of Cardiology, Shamir Medical Center, Zeriffin, Israel
- Tel Aviv University Faculty of Medicine, Tel Aviv, Israel
| | - Oran Tzuman
- Department of Cardiology, Shamir Medical Center, Zeriffin, Israel
- Tel Aviv University Faculty of Medicine, Tel Aviv, Israel
| | - Gil Marcus
- Department of Cardiology, Shamir Medical Center, Zeriffin, Israel
- Tel Aviv University Faculty of Medicine, Tel Aviv, Israel
| |
Collapse
|
2
|
Backmund T, Bohlender T, Gaik C, Koch T, Kranke P, Nardi-Hiebl S, Vojnar B, Eberhart LHJ. [Comparison of different prediction models for the occurrence of nausea and vomiting in the postoperative phase : A systematic qualitative comparison based on prospectively defined quality indicators]. DIE ANAESTHESIOLOGIE 2024; 73:251-262. [PMID: 38319326 DOI: 10.1007/s00101-024-01386-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 11/23/2023] [Indexed: 02/07/2024]
Abstract
BACKGROUND Various prognostic prediction models exist for evaluating the risk of nausea and vomiting in the postoperative period (PONV). So far, no systematic comparison of these prognostic scores is available. METHOD A systematic literature search was carried out in seven medical databases to find publications on prognostic PONV models. Identified scores were assessed against prospectively defined quality criteria, including generalizability, validation and clinical relevance of the models. RESULTS The literature search revealed 62 relevant publications with a total of 81,834 patients which could be assigned to 8 prognostic models. The simplified Apfel score performed best, primarily because it was extensively validated. The Van den Bosch score and Sinclair score tied for second place. The simplified Koivuranta score was in third place. CONCLUSION The qualitative analysis highlights the strengths and weaknesses of each prediction system based on predetermined standardized quality criteria.
Collapse
Affiliation(s)
- T Backmund
- Klinik für Anästhesie und Intensivtherapie, Philipps Universität Marburg, Baldinger Straße, 35043 Marburg, Deutschland.
| | - T Bohlender
- Klinik für Anästhesie und Intensivtherapie, Philipps Universität Marburg, Baldinger Straße, 35043 Marburg, Deutschland
| | - C Gaik
- Klinik für Anästhesie und Intensivtherapie, Philipps Universität Marburg, Baldinger Straße, 35043 Marburg, Deutschland
| | - T Koch
- Klinik für Anästhesie und Intensivtherapie, Philipps Universität Marburg, Baldinger Straße, 35043 Marburg, Deutschland
| | - P Kranke
- Klinik und Poliklinik für Anästhesiologie, Intensivmedizin, Notfallmedizin und Schmerztherapie, Universitätsklinikum Würzburg, Oberdürrbacher Straße 6, 97080 Würzburg, Deutschland
| | - S Nardi-Hiebl
- Klinik für Anästhesie und Intensivtherapie, Philipps Universität Marburg, Baldinger Straße, 35043 Marburg, Deutschland
| | - B Vojnar
- Klinik für Anästhesie und Intensivtherapie, Philipps Universität Marburg, Baldinger Straße, 35043 Marburg, Deutschland
| | - L H J Eberhart
- Klinik für Anästhesie und Intensivtherapie, Philipps Universität Marburg, Baldinger Straße, 35043 Marburg, Deutschland
| |
Collapse
|
3
|
Penzias RE, Bohne C, Gicheha E, Molyneux EM, Gathara D, Ngwala SK, Zimba E, Rashid E, Odedere O, Dosunmu O, Tillya R, Shabani J, Cross JH, Ochieng C, Webster HH, Chiume M, Dube Q, Wainaina J, Kassim I, Irimu G, Adudans S, James F, Tongo O, Ezeaka VC, Salim N, Masanja H, Oden M, Richards-Kortum R, Hailegabriel T, Gupta G, Cousens S, Lawn JE, Ohuma EO. Quantifying health facility service readiness for small and sick newborn care: comparing standards-based and WHO level-2 + scoring for 64 hospitals implementing with NEST360 in Kenya, Malawi, Nigeria, and Tanzania. BMC Pediatr 2024; 23:656. [PMID: 38475761 DOI: 10.1186/s12887-024-04578-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 01/18/2024] [Indexed: 03/14/2024] Open
Abstract
BACKGROUND Service readiness tools are important for assessing hospital capacity to provide quality small and sick newborn care (SSNC). Lack of summary scoring approaches for SSNC service readiness means we are unable to track national targets such as the Every Newborn Action Plan targets. METHODS A health facility assessment (HFA) tool was co-designed by Newborn Essential Solutions and Technologies (NEST360) and UNICEF with four African governments. Data were collected in 68 NEST360-implementing neonatal units in Kenya, Malawi, Nigeria, and Tanzania (September 2019-March 2021). Two summary scoring approaches were developed: a) standards-based, including items for SSNC service readiness by health system building block (HSBB), and scored on availability and functionality, and b) level-2 + , scoring items on readiness to provide WHO level-2 + clinical interventions. For each scoring approach, scores were aggregated and summarised as a percentage and equally weighted to obtain an overall score by hospital, HSBB, and clinical intervention. RESULTS Of 1508 HFA items, 1043 (69%) were included in standards-based and 309 (20%) in level-2 + scoring. Sixty-eight neonatal units across four countries had median standards-based scores of 51% [IQR 48-57%] at baseline, with variation by country: 62% [IQR 59-66%] in Kenya, 49% [IQR 46-51%] in Malawi, 50% [IQR 42-58%] in Nigeria, and 55% [IQR 53-62%] in Tanzania. The lowest scoring was family-centred care [27%, IQR 18-40%] with governance highest scoring [76%, IQR 71-82%]. For level-2 + scores, the overall median score was 41% [IQR 35-51%] with variation by country: 50% [IQR 44-53%] in Kenya, 41% [IQR 35-50%] in Malawi, 33% [IQR 27-37%] in Nigeria, and 41% [IQR 32-52%] in Tanzania. Readiness to provide antibiotics by culture report was the highest-scoring intervention [58%, IQR 50-75%] and neonatal encephalopathy management was the lowest-scoring [21%, IQR 8-42%]. In both methods, overall scores were low (< 50%) for 27 neonatal units in standards-based scoring and 48 neonatal units in level-2 + scoring. No neonatal unit achieved high scores of > 75%. DISCUSSION Two scoring approaches reveal gaps in SSNC readiness with no neonatal units achieving high scores (> 75%). Government-led quality improvement teams can use these summary scores to identify areas for health systems change. Future analyses could determine which items are most directly linked with quality SSNC and newborn outcomes.
Collapse
Affiliation(s)
- Rebecca E Penzias
- Maternal, Adolescent, Reproductive, & Child Health Centre, London School of Hygiene & Tropical Medicine, London, UK.
| | - Christine Bohne
- Rice360 Institute for Global Health Technologies, Rice University, Texas, USA
- Ifakara Health Institute, Ifakara, Tanzania
| | - Edith Gicheha
- Rice360 Institute for Global Health Technologies, Rice University, Texas, USA
| | - Elizabeth M Molyneux
- Kamuzu University of Health Sciences (Formerly College of Medicine, University of Malawi), Blantyre, Malawi
| | - David Gathara
- Maternal, Adolescent, Reproductive, & Child Health Centre, London School of Hygiene & Tropical Medicine, London, UK
| | - Samuel K Ngwala
- Kamuzu University of Health Sciences (Formerly College of Medicine, University of Malawi), Blantyre, Malawi
- School of Global and Public Health, Kamuzu University of Health Sciences, Blantyre, Malawi
| | - Evelyn Zimba
- Rice360 Institute for Global Health Technologies, Rice University, Texas, USA
| | - Ekran Rashid
- Rice360 Institute for Global Health Technologies, Rice University, Texas, USA
- Aga Khan University Hospital, Nairobi, Kenya
| | - Opeyemi Odedere
- Rice360 Institute for Global Health Technologies, Rice University, Texas, USA
- APIN Public Health Initiatives, Abuja, Nigeria
| | | | | | | | - James H Cross
- Maternal, Adolescent, Reproductive, & Child Health Centre, London School of Hygiene & Tropical Medicine, London, UK
| | - Christian Ochieng
- Maternal, Adolescent, Reproductive, & Child Health Centre, London School of Hygiene & Tropical Medicine, London, UK
| | - Harriet H Webster
- Maternal, Adolescent, Reproductive, & Child Health Centre, London School of Hygiene & Tropical Medicine, London, UK
| | - Msandeni Chiume
- Kamuzu University of Health Sciences (Formerly College of Medicine, University of Malawi), Blantyre, Malawi
- Department of Paediatrics, Kamuzu Central Hospital, Lilongwe, Malawi
| | | | - John Wainaina
- Kenya Medical Research Institute (KEMRI)-Wellcome Trust, Nairobi, Kenya
| | | | - Grace Irimu
- Department of Paediatrics and Child Health, University of Nairobi, Nairobi, Kenya
| | - Steve Adudans
- Academy for Novel Channels in Health and Operations Research (ACANOVA) Africa, Nairobi, Kenya
| | - Femi James
- Newborn Branch, Federal Ministry of Health, Abuja, Nigeria
| | - Olukemi Tongo
- Department of Paediatrics, College of Medicine, University of Ibadan, Ibadan, Nigeria
| | | | - Nahya Salim
- Department of Paediatrics and Child Health, Muhimbili University of Health and Allied Sciences, Dar Es Salaam, Tanzania
| | | | - Maria Oden
- Rice360 Institute for Global Health Technologies, Rice University, Texas, USA
| | | | | | - Gagan Gupta
- Program Group, Health Programme UNICEF Headquarters, New York, NY, USA
| | - Simon Cousens
- Maternal, Adolescent, Reproductive, & Child Health Centre, London School of Hygiene & Tropical Medicine, London, UK
| | - Joy E Lawn
- Maternal, Adolescent, Reproductive, & Child Health Centre, London School of Hygiene & Tropical Medicine, London, UK
| | - Eric O Ohuma
- Maternal, Adolescent, Reproductive, & Child Health Centre, London School of Hygiene & Tropical Medicine, London, UK
| |
Collapse
|
4
|
Krüger W. Diagnostic algorithm allows for a scientifically robust and reliable retrospective diagnosis using textual evidence from mid-19th century Basel, Switzerland. INTERNATIONAL JOURNAL OF PALEOPATHOLOGY 2024; 44:105-111. [PMID: 38218023 DOI: 10.1016/j.ijpp.2024.01.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 12/17/2023] [Accepted: 01/01/2024] [Indexed: 01/15/2024]
Abstract
OBJECTIVE Diagnosing disease from the past using historic textual sources can be controversial as to its accuracy. To overcome these objections, an empirical approach to the historical clinical data was developed. The approach follows a standardised, objective, and systematic evaluation, satisfying the requirements of the philosophy of science. MATERIAL Physician-managed medical records of mid-19th century patients reported to have suffered from tuberculosis. METHOD A diagnostic algorithm, quantifying clinical data into a scoring system, was developed based on criteria recorded in the medical sources. The findings were compared to the autopsy results using the Receiver Operating Characteristics method. RESULTS The generated scoring system correctly predicted the diagnosis of tuberculosis in 86% of patients in the study. 6% false negatives and 8% false positives were predicted. CONCLUSIONS It is possible to retrospectively diagnose in a reliable and scientifically robust manner under certain conditions. It is important to embed the clinical data into the historical context. A general rejection of retrospective diagnosis is unsubstantiated. Well-designed, disease-specific, and source adapted medical scoring systems are new approaches and overcome criticism raised against retrospective diagnosis. SIGNIFICANCE This new approach utilises diverse historic sources and potentially leads to reliable retrospective diagnosis of most common diseases of the past. LIMITATIONS Selection bias of the records allocated. Quality of the historic sources utilized. Restricted statistical assessment potential of historic sources. SUGGESTIONS FOR FURTHER RESEARCH Development of disease- and epoch-specific medical score systems.
Collapse
Affiliation(s)
- Wolfgang Krüger
- Biological Anthropology, Faculty of Medicine, University of Freiburg, Hebelstrasse 29, D-79104 Freiburg im Breisgau, Germany.
| |
Collapse
|
5
|
Kweh BTS, Lee HQ, Tan T, Liew S, Hunn M, Wee Tee J. Posterior Instrumented Spinal Surgery Outcomes in the Elderly: A Comparison of the 5-Item and 11-Item Modified Frailty Indices. Global Spine J 2024; 14:593-602. [PMID: 35969642 PMCID: PMC10802518 DOI: 10.1177/21925682221117139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
STUDY DESIGN Retrospective Cohort. OBJECTIVES To validate the most concise risk stratification system to date, the 5-item modified frailty index (mFI-5), and compare its effectiveness with the established 11-item modified frailty index (mFI-11) in the elderly population undergoing posterior instrumented spine surgery. METHODS A single centre retrospective review of posterior instrumented spine surgeries in patients aged 65 years and older was conducted. The primary outcome was rate of post-operative major complications (Clavien-Dindo Classification ≥ 4). Secondary outcome measures included rate of all complications, 6-month mortality and surgical site infection. Multi-variate analysis was performed and adjusted receiver operating characteristic curves were generated and compared by DeLong's test. The indices were correlated with Spearman's rho. RESULTS 272 cases were identified. The risk of major complications was independently associated with both the mFI-5 (OR 1.89, 95% CI 1.01-3.55, P = .047) and mFI-11 (OR 3.73, 95% CI 1.90-7.30, P = .000). Both the mFI-5 and mFI-11 were statistically significant predictors of risk of all complications (P = .007 and P = .003), surgical site infection (P = .011 and P = .003) and 6-month mortality (P = .031 and P = .000). Adjusted ROC curves determined statistically similar c-statistics for major complications (.68 vs .68, P = .64), all complications (.66 vs .64, P = .10), surgical site infection (.75 vs .75, P = .76) and 6-month mortality (.83 vs .81, P = .21). The 2 indices correlated very well with a Spearman's rho of .944. CONCLUSIONS The mFI-5 and mFI-11 are equally effective predictors of postoperative morbidity and mortality in this population. The brevity of the mFI-5 is advantageous in facilitating its daily clinical use.
Collapse
Affiliation(s)
- Barry T. S. Kweh
- National Trauma Research Institute, Melbourne, VIC, Australia
- Department of Neurosurgery, Royal Melbourne Hospital, Melbourne, VIC, Australia
- Department of Neurosurgery, The Alfred Hospital, Melbourne, VIC, Australia
| | - Hui Qing Lee
- National Trauma Research Institute, Melbourne, VIC, Australia
- Department of Neurosurgery, The Alfred Hospital, Melbourne, VIC, Australia
| | - Terence Tan
- National Trauma Research Institute, Melbourne, VIC, Australia
- Department of Neurosurgery, The Alfred Hospital, Melbourne, VIC, Australia
| | - Susan Liew
- Central Clinical School, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, VIC, Australia
- Department of Orthopaedics, The Alfred Hospital, Melbourne, VIC, Australia
| | - Martin Hunn
- Department of Neurosurgery, The Alfred Hospital, Melbourne, VIC, Australia
- Central Clinical School, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, VIC, Australia
| | - Jin Wee Tee
- National Trauma Research Institute, Melbourne, VIC, Australia
- Department of Neurosurgery, The Alfred Hospital, Melbourne, VIC, Australia
- Department of Orthopaedics, The Alfred Hospital, Melbourne, VIC, Australia
| |
Collapse
|
6
|
Corbin CK, Maclay R, Acharya A, Mony S, Punnathanam S, Thapa R, Kotecha N, Shah NH, Chen JH. DEPLOYR: a technical framework for deploying custom real-time machine learning models into the electronic medical record. J Am Med Inform Assoc 2023; 30:1532-1542. [PMID: 37369008 PMCID: PMC10436147 DOI: 10.1093/jamia/ocad114] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 05/16/2023] [Accepted: 06/13/2023] [Indexed: 06/29/2023] Open
Abstract
OBJECTIVE Heatlhcare institutions are establishing frameworks to govern and promote the implementation of accurate, actionable, and reliable machine learning models that integrate with clinical workflow. Such governance frameworks require an accompanying technical framework to deploy models in a resource efficient, safe and high-quality manner. Here we present DEPLOYR, a technical framework for enabling real-time deployment and monitoring of researcher-created models into a widely used electronic medical record system. MATERIALS AND METHODS We discuss core functionality and design decisions, including mechanisms to trigger inference based on actions within electronic medical record software, modules that collect real-time data to make inferences, mechanisms that close-the-loop by displaying inferences back to end-users within their workflow, monitoring modules that track performance of deployed models over time, silent deployment capabilities, and mechanisms to prospectively evaluate a deployed model's impact. RESULTS We demonstrate the use of DEPLOYR by silently deploying and prospectively evaluating 12 machine learning models trained using electronic medical record data that predict laboratory diagnostic results, triggered by clinician button-clicks in Stanford Health Care's electronic medical record. DISCUSSION Our study highlights the need and feasibility for such silent deployment, because prospectively measured performance varies from retrospective estimates. When possible, we recommend using prospectively estimated performance measures during silent trials to make final go decisions for model deployment. CONCLUSION Machine learning applications in healthcare are extensively researched, but successful translations to the bedside are rare. By describing DEPLOYR, we aim to inform machine learning deployment best practices and help bridge the model implementation gap.
Collapse
Affiliation(s)
- Conor K Corbin
- Department of Biomedical Data Science, Stanford, California, USA
| | - Rob Maclay
- Stanford Children’s Health, Palo Alto, California, USA
| | | | | | | | - Rahul Thapa
- Stanford Health Care, Palo Alto, California, USA
| | | | - Nigam H Shah
- Center for Biomedical Informatics Research, Division of Hospital Medicine, Department of Medicine, Stanford University, School of Medicine, Stanford, California, USA
| | - Jonathan H Chen
- Center for Biomedical Informatics Research, Division of Hospital Medicine, Department of Medicine, Stanford University, School of Medicine, Stanford, California, USA
| |
Collapse
|
7
|
Zisberg A. Defensive Nursing and Patient Mobility: Balancing Safety and Autonomy. Res Gerontol Nurs 2023; 16:162-164. [PMID: 37526631 DOI: 10.3928/19404921-20230629-01] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/02/2023]
Affiliation(s)
- Anna Zisberg
- The Cheryl Spencer Department of Nursing, Chair of the Center of Research & Study of Aging, University of Haifa, Mount Carmel, Israel
| |
Collapse
|
8
|
Chari S, Acharya P, Gruen DM, Zhang O, Eyigoz EK, Ghalwash M, Seneviratne O, Saiz FS, Meyer P, Chakraborty P, McGuinness DL. Informing clinical assessment by contextualizing post-hoc explanations of risk prediction models in type-2 diabetes. Artif Intell Med 2023; 137:102498. [PMID: 36868690 DOI: 10.1016/j.artmed.2023.102498] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 11/21/2022] [Accepted: 01/18/2023] [Indexed: 02/05/2023]
Abstract
Medical experts may use Artificial Intelligence (AI) systems with greater trust if these are supported by 'contextual explanations' that let the practitioner connect system inferences to their context of use. However, their importance in improving model usage and understanding has not been extensively studied. Hence, we consider a comorbidity risk prediction scenario and focus on contexts regarding the patients' clinical state, AI predictions about their risk of complications, and algorithmic explanations supporting the predictions. We explore how relevant information for such dimensions can be extracted from Medical guidelines to answer typical questions from clinical practitioners. We identify this as a question answering (QA) task and employ several state-of-the-art Large Language Models (LLM) to present contexts around risk prediction model inferences and evaluate their acceptability. Finally, we study the benefits of contextual explanations by building an end-to-end AI pipeline including data cohorting, AI risk modeling, post-hoc model explanations, and prototyped a visual dashboard to present the combined insights from different context dimensions and data sources, while predicting and identifying the drivers of risk of Chronic Kidney Disease (CKD) - a common type-2 diabetes (T2DM) comorbidity. All of these steps were performed in deep engagement with medical experts, including a final evaluation of the dashboard results by an expert medical panel. We show that LLMs, in particular BERT and SciBERT, can be readily deployed to extract some relevant explanations to support clinical usage. To understand the value-add of the contextual explanations, the expert panel evaluated these regarding actionable insights in the relevant clinical setting. Overall, our paper is one of the first end-to-end analyses identifying the feasibility and benefits of contextual explanations in a real-world clinical use case. Our findings can help improve clinicians' usage of AI models.
Collapse
Affiliation(s)
- Shruthi Chari
- Rensselaer Polytechnic Institute, 110 8th St, Troy, 12180, NY, USA.
| | - Prasant Acharya
- Rensselaer Polytechnic Institute, 110 8th St, Troy, 12180, NY, USA
| | - Daniel M Gruen
- Rensselaer Polytechnic Institute, 110 8th St, Troy, 12180, NY, USA
| | - Olivia Zhang
- Center for Computational Health, IBM Research, 1101 Kitchawan Rd, Yorktown Heights, 10598, NY, USA
| | - Elif K Eyigoz
- Center for Computational Health, IBM Research, 1101 Kitchawan Rd, Yorktown Heights, 10598, NY, USA
| | - Mohamed Ghalwash
- Center for Computational Health, IBM Research, 1101 Kitchawan Rd, Yorktown Heights, 10598, NY, USA
| | | | | | - Pablo Meyer
- Center for Computational Health, IBM Research, 1101 Kitchawan Rd, Yorktown Heights, 10598, NY, USA
| | - Prithwish Chakraborty
- Center for Computational Health, IBM Research, 1101 Kitchawan Rd, Yorktown Heights, 10598, NY, USA
| | | |
Collapse
|
9
|
APLUS: A Python library for usefulness simulations of machine learning models in healthcare. J Biomed Inform 2023; 139:104319. [PMID: 36791900 DOI: 10.1016/j.jbi.2023.104319] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 02/16/2023]
Abstract
Despite the creation of thousands of machine learning (ML) models, the promise of improving patient care with ML remains largely unrealized. Adoption into clinical practice is lagging, in large part due to disconnects between how ML practitioners evaluate models and what is required for their successful integration into care delivery. Models are just one component of care delivery workflows whose constraints determine clinicians' abilities to act on models' outputs. However, methods to evaluate the usefulness of models in the context of their corresponding workflows are currently limited. To bridge this gap we developed APLUS, a reusable framework for quantitatively assessing via simulation the utility gained from integrating a model into a clinical workflow. We describe the APLUS simulation engine and workflow specification language, and apply it to evaluate a novel ML-based screening pathway for detecting peripheral artery disease at Stanford Health Care.
Collapse
|
10
|
Soleimanpour N, Bann M. Clinical risk calculators informing the decision to admit: A methodologic evaluation and assessment of applicability. PLoS One 2022; 17:e0279294. [PMID: 36534692 PMCID: PMC9762565 DOI: 10.1371/journal.pone.0279294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 12/04/2022] [Indexed: 12/24/2022] Open
Abstract
INTRODUCTION Clinical prediction and decision tools that generate outcome-based risk stratification and/or intervention recommendations are prevalent. Appropriate use and validity of these tools, especially those that inform complex clinical decisions, remains unclear. The objective of this study was to assess the methodologic quality and applicability of clinical risk scoring tools used to guide hospitalization decision-making. METHODS In February 2021, a comprehensive search was performed of a clinical calculator online database (mdcalc.com) that is publicly available and well-known to clinicians. The primary reference for any calculator tool informing outpatient versus inpatient disposition was considered for inclusion. Studies were restricted to the adult, acute care population. Those focused on obstetrics/gynecology or critical care admission were excluded. The Wasson-Laupacis framework of methodologic standards for clinical prediction rules was applied to each study. RESULTS A total of 22 calculators provided hospital admission recommendations for 9 discrete medical conditions using adverse events (14/22), mortality (6/22), or confirmatory diagnosis (2/22) as outcomes of interest. The most commonly met methodologic standards included mathematical technique description (22/22) and clinical sensibility (22/22) and least commonly met included reproducibility of the rule (1/22) and measurement of effect on clinical use (1/22). Description of the studied population was often lacking, especially patient race/ethnicity (2/22) and mental or behavioral health (0/22). Only one study reported any item related to social determinants of health. CONCLUSION Studies commonly do not meet rigorous methodologic standards and often fail to report pertinent details that would guide applicability. These clinical tools focus primarily on specific disease entities and clinical variables, missing the breadth of information necessary to make a disposition determination and raise significant validation and generalizability concerns.
Collapse
Affiliation(s)
| | - Maralyssa Bann
- Department of Medicine, University of Washington School of Medicine, Seattle, Washington, United States of America,Department of Medicine, Harborview Medical Center, Seattle, Washington, United States of America,* E-mail:
| |
Collapse
|
11
|
Loftus TJ, Shickel B, Ozrazgat-Baslanti T, Ren Y, Glicksberg BS, Cao J, Singh K, Chan L, Nadkarni GN, Bihorac A. Artificial intelligence-enabled decision support in nephrology. Nat Rev Nephrol 2022; 18:452-465. [PMID: 35459850 PMCID: PMC9379375 DOI: 10.1038/s41581-022-00562-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/16/2022] [Indexed: 12/12/2022]
Abstract
Kidney pathophysiology is often complex, nonlinear and heterogeneous, which limits the utility of hypothetical-deductive reasoning and linear, statistical approaches to diagnosis and treatment. Emerging evidence suggests that artificial intelligence (AI)-enabled decision support systems - which use algorithms based on learned examples - may have an important role in nephrology. Contemporary AI applications can accurately predict the onset of acute kidney injury before notable biochemical changes occur; can identify modifiable risk factors for chronic kidney disease onset and progression; can match or exceed human accuracy in recognizing renal tumours on imaging studies; and may augment prognostication and decision-making following renal transplantation. Future AI applications have the potential to make real-time, continuous recommendations for discrete actions and yield the greatest probability of achieving optimal kidney health outcomes. Realizing the clinical integration of AI applications will require cooperative, multidisciplinary commitment to ensure algorithm fairness, overcome barriers to clinical implementation, and build an AI-competent workforce. AI-enabled decision support should preserve the pre-eminence of wisdom and augment rather than replace human decision-making. By anchoring intuition with objective predictions and classifications, this approach should favour clinician intuition when it is honed by experience.
Collapse
Affiliation(s)
- Tyler J Loftus
- Department of Surgery, University of Florida Health, Gainesville, FL, USA
| | - Benjamin Shickel
- Department of Medicine, University of Florida Health, Gainesville, FL, USA
| | | | - Yuanfang Ren
- Department of Medicine, University of Florida Health, Gainesville, FL, USA
| | - Benjamin S Glicksberg
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Jie Cao
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Karandeep Singh
- Department of Learning Health Sciences and Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Lili Chan
- The Mount Sinai Clinical Intelligence Center, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Division of Nephrology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Girish N Nadkarni
- The Mount Sinai Clinical Intelligence Center, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- The Division of Data-Driven and Digital Medicine (D3M), Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Azra Bihorac
- Department of Medicine, University of Florida Health, Gainesville, FL, USA.
| |
Collapse
|
12
|
Atkins D, Makridis CA, Alterovitz G, Ramoni R, Clancy C. Developing and Implementing Predictive Models in a Learning Healthcare System: Traditional and Artificial Intelligence Approaches in the Veterans Health Administration. Annu Rev Biomed Data Sci 2022; 5:393-413. [PMID: 35609894 DOI: 10.1146/annurev-biodatasci-122220-110053] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Predicting clinical risk is an important part of healthcare and can inform decisions about treatments, preventive interventions, and provision of extra services. The field of predictive models has been revolutionized over the past two decades by electronic health record data; the ability to link such data with other demographic, socioeconomic, and geographic information; the availability of high-capacity computing; and new machine learning and artificial intelligence methods for extracting insights from complex datasets. These advances have produced a new generation of computerized predictive models, but debate continues about their development, reporting, validation, evaluation, and implementation. In this review we reflect on more than 10 years of experience at the Veterans Health Administration, the largest integrated healthcare system in the United States, in developing, testing, and implementing such models at scale. We report lessons from the implementation of national risk prediction models and suggest an agenda for research. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- David Atkins
- Office of Research and Development, Department of Veterans Affairs, Washington, DC, USA;
| | - Christos A Makridis
- National Artificial Intelligence Institute, Department of Veterans Affairs, Washington, DC, USA
| | - Gil Alterovitz
- National Artificial Intelligence Institute, Department of Veterans Affairs, Washington, DC, USA
| | - Rachel Ramoni
- Office of Research and Development, Department of Veterans Affairs, Washington, DC, USA;
| | - Carolyn Clancy
- Office of Discovery, Education and Affiliate Networks, Department of Veterans Affairs, Washington, DC, USA
| |
Collapse
|
13
|
Gundle KR. CORR Insights®: International Validation of the SORG Machine-learning Algorithm for Predicting the Survival of Patients with Extremity Metastases Undergoing Surgical Treatment. Clin Orthop Relat Res 2022; 480:379-381. [PMID: 34846306 PMCID: PMC8747605 DOI: 10.1097/corr.0000000000002078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 11/09/2021] [Indexed: 02/03/2023]
Affiliation(s)
- Kenneth R Gundle
- Oregon Health and Science University, Department of Orthopaedics and Rehabilitation Portland VA Medical Center, Operative Care Division, Portland, OR, USA
| |
Collapse
|
14
|
Wollborn J, Hassenzahl LO, Reker D, Staehle HF, Omlor AM, Baar W, Kaufmann KB, Ulbrich F, Wunder C, Utzolino S, Buerkle H, Kalbhenn J, Heinrich S, Goebel U. Diagnosing capillary leak in critically ill patients: development of an innovative scoring instrument for non-invasive detection. Ann Intensive Care 2021; 11:175. [PMID: 34910264 PMCID: PMC8674404 DOI: 10.1186/s13613-021-00965-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 12/02/2021] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND The concomitant occurrence of the symptoms intravascular hypovolemia, peripheral edema and hemodynamic instability is typically named Capillary Leak Syndrome (CLS) and often occurs in surgical critical ill patients. However, neither a unitary definition nor standardized diagnostic criteria exist so far. We aimed to investigate common characteristics of this phenomenon with a subsequent scoring system, determining whether CLS contributes to mortality. METHODS We conducted this single-center, observational, multidisciplinary, prospective trial in two separately run surgical ICUs of a tertiary academic medical center. 200 surgical patients admitted to the ICU and 30 healthy volunteers were included. Patients were clinically diagnosed as CLS or No-CLS group (each N = 100) according to the grade of edema, intravascular hypovolemia, hemodynamic instability, and positive fluid balance by two independent attending physicians with > 10 years of experience in ICU. We performed daily measurements with non-invasive body impedance electrical analysis, ultrasound and analysis of serum biomarkers to generate objective diagnostic criteria. Receiver operating characteristics were used, while we developed machine learning models to increase diagnostic specifications for our scoring model. RESULTS The 30-day mortility was increased among CLS patients (12 vs. 1%, P = 0.002), while showing higher SOFA-scores. Extracellular water was increased in patients with CLS with higher echogenicity of subcutaneous tissue [29(24-31) vs. 19(16-21), P < 0.001]. Biomarkers showed characteristic alterations, especially with an increased angiopoietin-2 concentration in CLS [9.9(6.2-17.3) vs. 3.7(2.6-5.6)ng/mL, P < 0.001]. We developed a score using seven parameters (echogenicity, SOFA-score, angiopoietin-2, syndecan-1, ICAM-1, lactate and interleukin-6). A Random Forest prediction model boosted its diagnostic characteristics (AUC 0.963, P < 0.001), while a two-parameter decision tree model showed good specifications (AUC 0.865). CONCLUSIONS Diagnosis of CLS in critically ill patients is feasible by objective, non-invasive parameters using the CLS-Score. A simplified two-parameter diagnostic approach can enhance clinical utility. CLS contributes to mortality and should, therefore, classified as an independent entity. TRIAL REGISTRATION German Clinical Trials Registry (DRKS No. 00012713), Date of registration 10/05/2017, www.drks.de.
Collapse
Affiliation(s)
- Jakob Wollborn
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany. .,Department of Anesthesiology, Perioperative and Pain Medicine, Brigham and Women's Hospital, Harvard Medical School, 75 Francis Street, Boston, MA, 02115, USA. .,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.
| | - Lars O Hassenzahl
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Daniel Reker
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Hans Felix Staehle
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Anne Marie Omlor
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Wolfgang Baar
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Kai B Kaufmann
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Felix Ulbrich
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Christian Wunder
- Department of Anesthesiology and Critical Care, Robert-Bosch-Krankenhaus, Stuttgart, Germany
| | - Stefan Utzolino
- Department of General and Visceral Surgery, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Hartmut Buerkle
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Johannes Kalbhenn
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Sebastian Heinrich
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Ulrich Goebel
- Department of Anesthesiology and Critical Care, Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,Department of Anesthesiology and Critical Care, St. Franziskus-Hospital, Muenster, Germany
| |
Collapse
|
15
|
Ostberg NP, Zafar MA, Elefteriades JA. Machine learning: principles and applications for thoracic surgery. Eur J Cardiothorac Surg 2021; 60:213-221. [PMID: 33748840 DOI: 10.1093/ejcts/ezab095] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 01/25/2021] [Accepted: 01/27/2021] [Indexed: 12/20/2022] Open
Abstract
OBJECTIVES Machine learning (ML) has experienced a revolutionary decade with advances across many disciplines. We seek to understand how recent advances in ML are going to specifically influence the practice of surgery in the future with a particular focus on thoracic surgery. METHODS Review of relevant literature in both technical and clinical domains. RESULTS ML is a revolutionary technology that promises to change the way that surgery is practiced in the near future. Spurred by an advance in computing power and the volume of data produced in healthcare, ML has shown remarkable ability to master tasks that had once been reserved for physicians. Supervised learning, unsupervised learning and reinforcement learning are all important techniques that can be leveraged to improve care. Five key applications of ML to cardiac surgery include diagnostics, surgical skill assessment, postoperative prognostication, augmenting intraoperative performance and accelerating translational research. Some key limitations of ML include lack of interpretability, low quality and volumes of relevant clinical data, ethical limitations and difficulties with clinical implementation. CONCLUSIONS In the future, the practice of cardiac surgery will be greatly augmented by ML technologies, ultimately leading to improved surgical performance and better patient outcomes.
Collapse
Affiliation(s)
- Nicolai P Ostberg
- Aortic Institute at Yale-New Haven Hospital, Yale University School of Medicine, New Haven, CT, USA.,New York University Grossman School of Medicine, New York, NY, USA
| | - Mohammad A Zafar
- Aortic Institute at Yale-New Haven Hospital, Yale University School of Medicine, New Haven, CT, USA
| | - John A Elefteriades
- Aortic Institute at Yale-New Haven Hospital, Yale University School of Medicine, New Haven, CT, USA
| |
Collapse
|
16
|
Welsh P, Welsh CE, Jhund PS, Woodward M, Brown R, Lewsey J, Celis-Morales CA, Ho FK, MacKay DF, Gill JM, Gray SR, Katikireddi SV, Pell JP, Forbes J, Sattar N. Derivation and Validation of a 10-Year Risk Score for Symptomatic Abdominal Aortic Aneurysm: Cohort Study of Nearly 500 000 Individuals. Circulation 2021; 144:604-614. [PMID: 34167317 PMCID: PMC8378547 DOI: 10.1161/circulationaha.120.053022] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
BACKGROUND Abdominal aortic aneurysm (AAA) can occur in patients who are ineligible for routine ultrasound screening. A simple AAA risk score was derived and compared with current guidelines used for ultrasound screening of AAA. METHODS United Kingdom Biobank participants without previous AAA were split into a derivation cohort (n=401 820, 54.6% women, mean age 56.4 years, 95.5% White race) and validation cohort (n=83 816). Incident AAA was defined as first hospital inpatient diagnosis of AAA, death from AAA, or an AAA-related surgical procedure. A multivariable Cox model was developed in the derivation cohort into an AAA risk score that did not require blood biomarkers. To illustrate the sensitivity and specificity of the risk score for AAA, a theoretical threshold to refer patients for ultrasound at 0.25% 10-year risk was modeled. Discrimination of the risk score was compared with a model of US Preventive Services Task Force (USPSTF) AAA screening guidelines. RESULTS In the derivation cohort, there were 1570 (0.40%) cases of AAA over a median 11.3 years of follow-up. Components of the AAA risk score were age (stratified by smoking status), weight (stratified by smoking status), antihypertensive and cholesterol-lowering medication use, height, diastolic blood pressure, baseline cardiovascular disease, and diabetes. In the validation cohort, over 10 years of follow-up, the C-index for the model of the USPSTF guidelines was 0.705 (95% CI, 0.678-0.733). The C-index of the risk score as a continuous variable was 0.856 (95% CI, 0.837-0.878). In the validation cohort, the USPSTF model yielded sensitivity 63.9% and specificity 71.3%. At the 0.25% 10-year risk threshold, the risk score yielded sensitivity 82.1% and specificity 70.7% while also improving the net reclassification index compared with the USPSTF model +0.176 (95% CI, 0.120-0.232). A combined model, whereby risk scoring was combined with the USPSTF model, also improved prediction compared with USPSTF alone (net reclassification index +0.101 [95% CI, 0.055-0.147]). CONCLUSIONS In an asymptomatic general population, a risk score based on patient age, height, weight, and medical history may improve identification of asymptomatic patients at risk for clinical events from AAA. Further development and validation of risk scores to detect asymptomatic AAA are needed.
Collapse
Affiliation(s)
- Paul Welsh
- Institute of Cardiovascular and Medical Sciences (P.W., P.S.J., R.B., C.A.C.-M., J.M.R.G., S.R.G., N.S.), University of Glasgow, United Kingdom
| | - Claire E. Welsh
- Population and Health Sciences Institute, Newcastle University, United Kingdom (C.E.W.)
| | - Pardeep S. Jhund
- Institute of Cardiovascular and Medical Sciences (P.W., P.S.J., R.B., C.A.C.-M., J.M.R.G., S.R.G., N.S.), University of Glasgow, United Kingdom
| | - Mark Woodward
- The George Institute for Global Health, School of Public Health, Imperial College London, United Kingdom (M.W.).,The George Institute for Global Health, University of New South Wales, Sydney, Australia (M.W.).,Department of Epidemiology, Johns Hopkins University, Baltimore, Maryland (M.W.)
| | - Rosemary Brown
- Institute of Cardiovascular and Medical Sciences (P.W., P.S.J., R.B., C.A.C.-M., J.M.R.G., S.R.G., N.S.), University of Glasgow, United Kingdom
| | - Jim Lewsey
- Institute of Health & Wellbeing (J.L., F.K.H., D.F.M., S.V.K., J.P.P.), University of Glasgow, United Kingdom
| | - Carlos A. Celis-Morales
- Institute of Cardiovascular and Medical Sciences (P.W., P.S.J., R.B., C.A.C.-M., J.M.R.G., S.R.G., N.S.), University of Glasgow, United Kingdom
| | - Frederick K. Ho
- Institute of Health & Wellbeing (J.L., F.K.H., D.F.M., S.V.K., J.P.P.), University of Glasgow, United Kingdom
| | | | - Jason M.R. Gill
- Institute of Cardiovascular and Medical Sciences (P.W., P.S.J., R.B., C.A.C.-M., J.M.R.G., S.R.G., N.S.), University of Glasgow, United Kingdom
| | - Stuart R. Gray
- Institute of Cardiovascular and Medical Sciences (P.W., P.S.J., R.B., C.A.C.-M., J.M.R.G., S.R.G., N.S.), University of Glasgow, United Kingdom
| | - S. Vittal Katikireddi
- Institute of Health & Wellbeing (J.L., F.K.H., D.F.M., S.V.K., J.P.P.), University of Glasgow, United Kingdom
| | - Jill P. Pell
- Institute of Health & Wellbeing (J.L., F.K.H., D.F.M., S.V.K., J.P.P.), University of Glasgow, United Kingdom
| | | | - Naveed Sattar
- Institute of Cardiovascular and Medical Sciences (P.W., P.S.J., R.B., C.A.C.-M., J.M.R.G., S.R.G., N.S.), University of Glasgow, United Kingdom
| |
Collapse
|
17
|
Patel BS, Steinberg E, Pfohl SR, Shah NH. Learning decision thresholds for risk stratification models from aggregate clinician behavior. J Am Med Inform Assoc 2021; 28:2258-2264. [PMID: 34350942 PMCID: PMC8449610 DOI: 10.1093/jamia/ocab159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 06/26/2021] [Accepted: 07/13/2021] [Indexed: 11/22/2022] Open
Abstract
Using a risk stratification model to guide clinical practice often requires the choice of a cutoff—called the decision threshold—on the model’s output to trigger a subsequent action such as an electronic alert. Choosing this cutoff is not always straightforward. We propose a flexible approach that leverages the collective information in treatment decisions made in real life to learn reference decision thresholds from physician practice. Using the example of prescribing a statin for primary prevention of cardiovascular disease based on 10-year risk calculated by the 2013 pooled cohort equations, we demonstrate the feasibility of using real-world data to learn the implicit decision threshold that reflects existing physician behavior. Learning a decision threshold in this manner allows for evaluation of a proposed operating point against the threshold reflective of the community standard of care. Furthermore, this approach can be used to monitor and audit model-guided clinical decision making following model deployment.
Collapse
Affiliation(s)
- Birju S Patel
- Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, California, USA
| | - Ethan Steinberg
- Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, California, USA
| | - Stephen R Pfohl
- Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, California, USA
| | - Nigam H Shah
- Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, California, USA
- Corresponding Author: Nigam H. Shah, MBBS, PhD, Stanford Center for Biomedical Informatics Research, Stanford University, 1265 Welch Road, Stanford, CA 94305, USA;
| |
Collapse
|
18
|
Guo LL, Pfohl SR, Fries J, Posada J, Fleming SL, Aftandilian C, Shah N, Sung L. Systematic Review of Approaches to Preserve Machine Learning Performance in the Presence of Temporal Dataset Shift in Clinical Medicine. Appl Clin Inform 2021; 12:808-815. [PMID: 34470057 PMCID: PMC8410238 DOI: 10.1055/s-0041-1735184] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 07/12/2021] [Indexed: 10/20/2022] Open
Abstract
OBJECTIVE The change in performance of machine learning models over time as a result of temporal dataset shift is a barrier to machine learning-derived models facilitating decision-making in clinical practice. Our aim was to describe technical procedures used to preserve the performance of machine learning models in the presence of temporal dataset shifts. METHODS Studies were included if they were fully published articles that used machine learning and implemented a procedure to mitigate the effects of temporal dataset shift in a clinical setting. We described how dataset shift was measured, the procedures used to preserve model performance, and their effects. RESULTS Of 4,457 potentially relevant publications identified, 15 were included. The impact of temporal dataset shift was primarily quantified using changes, usually deterioration, in calibration or discrimination. Calibration deterioration was more common (n = 11) than discrimination deterioration (n = 3). Mitigation strategies were categorized as model level or feature level. Model-level approaches (n = 15) were more common than feature-level approaches (n = 2), with the most common approaches being model refitting (n = 12), probability calibration (n = 7), model updating (n = 6), and model selection (n = 6). In general, all mitigation strategies were successful at preserving calibration but not uniformly successful in preserving discrimination. CONCLUSION There was limited research in preserving the performance of machine learning models in the presence of temporal dataset shift in clinical medicine. Future research could focus on the impact of dataset shift on clinical decision making, benchmark the mitigation strategies on a wider range of datasets and tasks, and identify optimal strategies for specific settings.
Collapse
Affiliation(s)
- Lin Lawrence Guo
- Program in Child Health Evaluative Sciences, The Hospital for Sick Children, Toronto, Canada
| | - Stephen R. Pfohl
- Biomedical Informatics Research, Stanford University, Palo Alto, California, United States
| | - Jason Fries
- Biomedical Informatics Research, Stanford University, Palo Alto, California, United States
| | - Jose Posada
- Biomedical Informatics Research, Stanford University, Palo Alto, California, United States
| | - Scott Lanyon Fleming
- Biomedical Informatics Research, Stanford University, Palo Alto, California, United States
| | - Catherine Aftandilian
- Division of Pediatric Hematology/Oncology, Stanford University, Palo Alto, United States
| | - Nigam Shah
- Biomedical Informatics Research, Stanford University, Palo Alto, California, United States
| | - Lillian Sung
- Program in Child Health Evaluative Sciences, The Hospital for Sick Children, Toronto, Canada
- Division of Haematology/Oncology, The Hospital for Sick Children, Toronto, Canada
| |
Collapse
|
19
|
Sebban S, Evenou D, Jung C, Fausser C, Durand S, Bibal M, Geninasca V, Saux M, Jeulin J. Bronchial Clearance Physiotherapy in Pediatrics. A Controlled, Randomized, Multicenter Study of the Short-Term Effects on Respiration during Outpatient Care for Infants with Acute Bronchiolitis. JOURNAL OF CHILD SCIENCE 2021. [DOI: 10.1055/s-0041-1731304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- S. Sebban
- Department of Physiotherapy, Association des Réseaux Bronchiolite. Teaching Hospital (CHU) Robert-Debré-APHP, Paris, France
| | - D. Evenou
- Department of Physiotherapy, Association des Réseaux Bronchiolite, Teaching Hospital (CHU) Robert-Debré- APHP, Paris, France
| | - C. Jung
- Department of Paediatrics, Clinical Research Centre, Centre Hospitalier Intercommunal de Créteil, Créteil, France
- Department of Massage Therapy/Physiotherapy, Paris Pubic Hospitals Group (APHP), Paris, France
| | - C. Fausser
- Department of Physiotherapy, Association des Réseaux Bronchiolite. Teaching Hospital (CHU) Robert-Debré-APHP, Paris, France
| | - S. Durand
- Department of Massage Therapy/Physiotherapy, Réseau bronchiolite Ile de France
| | - M. Bibal
- Department of Massage Therapy/Physiotherapy, Réseau bronchiolite Ile de France
| | - V. Geninasca
- Department of Massage Therapy/Physiotherapy, Réseau Bronchiolite Ile de France
| | - M. Saux
- Department of Massage Therapy/Physiotherapy, Réseau Bronchiolite Ile de France
| | - J.C. Jeulin
- Department of Massage Therapy/Physiotherapy, Réseau Bronchiolite, France
| |
Collapse
|
20
|
Jung K, Kashyap S, Avati A, Harman S, Shaw H, Li R, Smith M, Shum K, Javitz J, Vetteth Y, Seto T, Bagley SC, Shah NH. A framework for making predictive models useful in practice. J Am Med Inform Assoc 2021; 28:1149-1158. [PMID: 33355350 PMCID: PMC8200271 DOI: 10.1093/jamia/ocaa318] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Accepted: 11/27/2020] [Indexed: 01/03/2023] Open
Abstract
OBJECTIVE To analyze the impact of factors in healthcare delivery on the net benefit of triggering an Advanced Care Planning (ACP) workflow based on predictions of 12-month mortality. MATERIALS AND METHODS We built a predictive model of 12-month mortality using electronic health record data and evaluated the impact of healthcare delivery factors on the net benefit of triggering an ACP workflow based on the models' predictions. Factors included nonclinical reasons that make ACP inappropriate: limited capacity for ACP, inability to follow up due to patient discharge, and availability of an outpatient workflow to follow up on missed cases. We also quantified the relative benefits of increasing capacity for inpatient ACP versus outpatient ACP. RESULTS Work capacity constraints and discharge timing can significantly reduce the net benefit of triggering the ACP workflow based on a model's predictions. However, the reduction can be mitigated by creating an outpatient ACP workflow. Given limited resources to either add capacity for inpatient ACP versus developing outpatient ACP capability, the latter is likely to provide more benefit to patient care. DISCUSSION The benefit of using a predictive model for identifying patients for interventions is highly dependent on the capacity to execute the workflow triggered by the model. We provide a framework for quantifying the impact of healthcare delivery factors and work capacity constraints on achieved benefit. CONCLUSION An analysis of the sensitivity of the net benefit realized by a predictive model triggered clinical workflow to various healthcare delivery factors is necessary for making predictive models useful in practice.
Collapse
Affiliation(s)
- Kenneth Jung
- Stanford Center for Biomedical Informatics, School of Medicine, Stanford University, Stanford, California, USA
| | - Sehj Kashyap
- Stanford Center for Biomedical Informatics, School of Medicine, Stanford University, Stanford, California, USA
| | - Anand Avati
- Department of Computer Science, School of Engineering, Stanford University, Stanford, California, USA
| | - Stephanie Harman
- Department of Medicine, School of Medicine, Stanford University, Stanford, California, USA
| | | | - Ron Li
- Department of Medicine, School of Medicine, Stanford University, Stanford, California, USA
| | - Margaret Smith
- Department of Medicine, School of Medicine, Stanford University, Stanford, California, USA
| | - Kenny Shum
- Department of Technology and Digital Solutions, Stanford Medicine, Stanford, California, USA
| | - Jacob Javitz
- Department of Technology and Digital Solutions, Stanford Medicine, Stanford, California, USA
| | - Yohan Vetteth
- Department of Technology and Digital Solutions, Stanford Medicine, Stanford, California, USA
| | - Tina Seto
- Department of Technology and Digital Solutions, Stanford Medicine, Stanford, California, USA
| | - Steven C Bagley
- Stanford Center for Biomedical Informatics, School of Medicine, Stanford University, Stanford, California, USA
| | - Nigam H Shah
- Stanford Center for Biomedical Informatics, School of Medicine, Stanford University, Stanford, California, USA
| |
Collapse
|
21
|
McIntosh C, Conroy L, Tjong MC, Craig T, Bayley A, Catton C, Gospodarowicz M, Helou J, Isfahanian N, Kong V, Lam T, Raman S, Warde P, Chung P, Berlin A, Purdie TG. Clinical integration of machine learning for curative-intent radiation treatment of patients with prostate cancer. Nat Med 2021; 27:999-1005. [PMID: 34083812 DOI: 10.1038/s41591-021-01359-w] [Citation(s) in RCA: 93] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Accepted: 04/20/2021] [Indexed: 12/20/2022]
Abstract
Machine learning (ML) holds great promise for impacting healthcare delivery; however, to date most methods are tested in 'simulated' environments that cannot recapitulate factors influencing real-world clinical practice. We prospectively deployed and evaluated a random forest algorithm for therapeutic curative-intent radiation therapy (RT) treatment planning for prostate cancer in a blinded, head-to-head study with full integration into the clinical workflow. ML- and human-generated RT treatment plans were directly compared in a retrospective simulation with retesting (n = 50) and a prospective clinical deployment (n = 50) phase. Consistently throughout the study phases, treating physicians assessed ML- and human-generated RT treatment plans in a blinded manner following a priori defined standardized criteria and peer review processes, with the selected RT plan in the prospective phase delivered for patient treatment. Overall, 89% of ML-generated RT plans were considered clinically acceptable and 72% were selected over human-generated RT plans in head-to-head comparisons. RT planning using ML reduced the median time required for the entire RT planning process by 60.1% (118 to 47 h). While ML RT plan acceptability remained stable between the simulation and deployment phases (92 versus 86%), the number of ML RT plans selected for treatment was significantly reduced (83 versus 61%, respectively). These findings highlight that retrospective or simulated evaluation of ML methods, even under expert blinded review, may not be representative of algorithm acceptance in a real-world clinical setting when patient care is at stake.
Collapse
Affiliation(s)
- Chris McIntosh
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Techna Institute, University Health Network, Toronto, Ontario, Canada.,Peter Munk Cardiac Centre, University Health Network, Toronto, Ontario, Canada.,Joint Department of Medical Imaging, University Health Network, Toronto, Ontario, Canada.,Vector Institute, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Leigh Conroy
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Techna Institute, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Michael C Tjong
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Tim Craig
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Techna Institute, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Andrew Bayley
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Charles Catton
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Mary Gospodarowicz
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Joelle Helou
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Naghmeh Isfahanian
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Vickie Kong
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Tony Lam
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Srinivas Raman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Padraig Warde
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Peter Chung
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada
| | - Alejandro Berlin
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada. .,Techna Institute, University Health Network, Toronto, Ontario, Canada. .,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada.
| | - Thomas G Purdie
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada. .,Techna Institute, University Health Network, Toronto, Ontario, Canada. .,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada. .,Department of Radiation Oncology, University of Toronto, Toronto, Ontario, Canada.
| |
Collapse
|
22
|
Ko M, Chen E, Agrawal A, Rajpurkar P, Avati A, Ng A, Basu S, Shah NH. Improving hospital readmission prediction using individualized utility analysis. J Biomed Inform 2021; 119:103826. [PMID: 34087428 DOI: 10.1016/j.jbi.2021.103826] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 05/23/2021] [Accepted: 05/28/2021] [Indexed: 11/24/2022]
Abstract
OBJECTIVE Machine learning (ML) models for allocating readmission-mitigating interventions are typically selected according to their discriminative ability, which may not necessarily translate into utility in allocation of resources. Our objective was to determine whether ML models for allocating readmission-mitigating interventions have different usefulness based on their overall utility and discriminative ability. MATERIALS AND METHODS We conducted a retrospective utility analysis of ML models using claims data acquired from the Optum Clinformatics Data Mart, including 513,495 commercially-insured inpatients (mean [SD] age 69 [19] years; 294,895 [57%] Female) over the period January 2016 through January 2017 from all 50 states with mean 90 day cost of $11,552. Utility analysis estimates the cost, in dollars, of allocating interventions for lowering readmission risk based on the reduction in the 90-day cost. RESULTS Allocating readmission-mitigating interventions based on a GBDT model trained to predict readmissions achieved an estimated utility gain of $104 per patient, and an AUC of 0.76 (95% CI 0.76, 0.77); allocating interventions based on a model trained to predict cost as a proxy achieved a higher utility of $175.94 per patient, and an AUC of 0.62 (95% CI 0.61, 0.62). A hybrid model combining both intervention strategies is comparable with the best models on either metric. Estimated utility varies by intervention cost and efficacy, with each model performing the best under different intervention settings. CONCLUSION We demonstrate that machine learning models may be ranked differently based on overall utility and discriminative ability. Machine learning models for allocation of limited health resources should consider directly optimizing for utility.
Collapse
Affiliation(s)
- Michael Ko
- Department of Computer Science, Stanford University, CA, USA
| | - Emma Chen
- Department of Computer Science, Stanford University, CA, USA
| | - Ashwin Agrawal
- Department of Computer Science, Stanford University, CA, USA
| | | | - Anand Avati
- Department of Computer Science, Stanford University, CA, USA
| | - Andrew Ng
- Department of Computer Science, Stanford University, CA, USA
| | - Sanjay Basu
- Center for Primary Care, Harvard Medical School, MA, USA
| | - Nigam H Shah
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA
| |
Collapse
|
23
|
Hernandez-Boussard T, Bozkurt S, Ioannidis JPA, Shah NH. MINIMAR (MINimum Information for Medical AI Reporting): Developing reporting standards for artificial intelligence in health care. J Am Med Inform Assoc 2020; 27:2011-2015. [PMID: 32594179 PMCID: PMC7727333 DOI: 10.1093/jamia/ocaa088] [Citation(s) in RCA: 138] [Impact Index Per Article: 34.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Revised: 04/24/2020] [Accepted: 04/29/2020] [Indexed: 12/23/2022] Open
Abstract
The rise of digital data and computing power have contributed to significant advancements in artificial intelligence (AI), leading to the use of classification and prediction models in health care to enhance clinical decision-making for diagnosis, treatment and prognosis. However, such advances are limited by the lack of reporting standards for the data used to develop those models, the model architecture, and the model evaluation and validation processes. Here, we present MINIMAR (MINimum Information for Medical AI Reporting), a proposal describing the minimum information necessary to understand intended predictions, target populations, and hidden biases, and the ability to generalize these emerging technologies. We call for a standard to accurately and responsibly report on AI in health care. This will facilitate the design and implementation of these models and promote the development and use of associated clinical decision support tools, as well as manage concerns regarding accuracy and bias.
Collapse
Affiliation(s)
- Tina Hernandez-Boussard
- Department of Medicine, Stanford University, Stanford, California, USA
- Department of Biomedical Data Science, Stanford University, Stanford, California, USA
- Department of Surgery, Stanford University, Stanford, California, USA
| | - Selen Bozkurt
- Department of Medicine, Stanford University, Stanford, California, USA
| | - John P A Ioannidis
- Department of Medicine, Stanford University, Stanford, California, USA
- Department of Statistics, Stanford University, Stanford, California, USA
- Meta-Research Innovation Center at Stanford, Stanford University, Stanford, California, USA
| | - Nigam H Shah
- Department of Medicine, Stanford University, Stanford, California, USA
- Department of Biomedical Data Science, Stanford University, Stanford, California, USA
| |
Collapse
|
24
|
O'Shaughnessy F, O'Reilly D, Ní Áinle F. Current opinion and emerging trends on the treatment, diagnosis, and prevention of pregnancy-associated venous thromboembolic disease: a review. Transl Res 2020; 225:20-32. [PMID: 32554071 DOI: 10.1016/j.trsl.2020.06.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/01/2020] [Revised: 05/10/2020] [Accepted: 06/09/2020] [Indexed: 12/23/2022]
Abstract
Pregnancy associated venous thromboembolism (PA-VTE) is a leading cause of maternal morbidity and mortality worldwide. Despite the availability of international guidance on the prevention, diagnosis and treatment, practice differs between countries and clinical institutions. The evidence base in this area is limited due to the vulnerable population who are affected, with the majority of guidelines deriving their recommendations from experience in surgical and medical venous thromboembolic disease. This review includes best evidence in PA-VTE management, highlighting specific literature which supports current diagnosis, prevention, and treatment strategies. Additionally, we hope to demonstrate emerging trends in the field through discussion of ongoing trials designed to progress towards evidence-based practice in the context of PA-VTE.
Collapse
Affiliation(s)
- Fergal O'Shaughnessy
- Pharmacy Department, Rotunda Hospital, Dublin 1, Ireland; Division of Population Health Sciences, Royal College of Surgeons in Ireland, Dublin 2, Ireland; Department of Haematology, Mater University Hospital, Dublin 7, Ireland
| | - Daniel O'Reilly
- Department of Paediatrics, Children's Health Ireland at Tallaght, Dublin 24, Ireland; SPHERE research group, Conway Institute, University College Dublin, Dublin 4, Ireland; Department of Haematology, Mater University Hospital, Dublin 7, Ireland.
| | - Fionnuala Ní Áinle
- SPHERE research group, Conway Institute, University College Dublin, Dublin 4, Ireland; School of Medicine, University College Dublin, Dublin 4, Ireland; Department of Haematology, Rotunda Hospital, Dublin 1, Ireland; Department of Haematology, Mater University Hospital, Dublin 7, Ireland
| |
Collapse
|
25
|
Li RC, Asch SM, Shah NH. Developing a delivery science for artificial intelligence in healthcare. NPJ Digit Med 2020; 3:107. [PMID: 32885053 PMCID: PMC7443141 DOI: 10.1038/s41746-020-00318-y] [Citation(s) in RCA: 66] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Accepted: 07/06/2020] [Indexed: 11/09/2022] Open
Abstract
Artificial Intelligence (AI) has generated a large amount of excitement in healthcare, mostly driven by the emergence of increasingly accurate machine learning models. However, the promise of AI delivering scalable and sustained value for patient care in the real world setting has yet to be realized. In order to safely and effectively bring AI into use in healthcare, there needs to be a concerted effort around not just the creation, but also the delivery of AI. This AI "delivery science" will require a broader set of tools, such as design thinking, process improvement, and implementation science, as well as a broader definition of what AI will look like in practice, which includes not just machine learning models and their predictions, but also the new systems for care delivery that they enable. The careful design, implementation, and evaluation of these AI enabled systems will be important in the effort to understand how AI can improve healthcare.
Collapse
Affiliation(s)
- Ron C. Li
- Division of Hospital Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
- Center for Biomedical Informatics Research, Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Steven M. Asch
- Division of Primary Care and Population Health, Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
- Center for Innovation to Implementation, Department of Veterans Affairs, Palo Alto, CA USA
| | - Nigam H. Shah
- Center for Biomedical Informatics Research, Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| |
Collapse
|
26
|
Seveso A, Campagner A, Ciucci D, Cabitza F. Ordinal labels in machine learning: a user-centered approach to improve data validity in medical settings. BMC Med Inform Decis Mak 2020; 20:142. [PMID: 32819345 PMCID: PMC7439656 DOI: 10.1186/s12911-020-01152-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Accepted: 06/08/2020] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Despite the vagueness and uncertainty that is intrinsic in any medical act, interpretation and decision (including acts of data reporting and representation of relevant medical conditions), still little research has focused on how to explicitly take this uncertainty into account. In this paper, we focus on the representation of a general and wide-spread medical terminology, which is grounded on a traditional and well-established convention, to represent severity of health conditions (for instance, pain, visible signs), ranging from Absent to Extreme. Specifically, we will study how both potential patients and doctors perceive the different levels of the terminology in both quantitative and qualitative terms, and if the embedded user knowledge could improve the representation of ordinal values in the construction of machine learning models. METHODS To this aim, we conducted a questionnaire-based research study involving a relatively large sample of 1,152 potential patients and 31 clinicians to represent numerically the perceived meaning of standard and widely-applied labels to describe health conditions. Using these collected values, we then present and discuss different possible fuzzy-set based representations that address the vagueness of medical interpretation by taking into account the perceptions of domain experts. We also apply the findings of this user study to evaluate the impact of different encodings on the predictive performance of common machine learning models in regard to a real-world medical prognostic task. RESULTS We found significant differences in the perception of pain levels between the two user groups. We also show that the proposed encodings can improve the performances of specific classes of models, and discuss when this is the case. CONCLUSIONS In perspective, our hope is that the proposed techniques for ordinal scale representation and ordinal encoding may be useful to the research community, and also that our methodology will be applied to other widely used ordinal scales for improving validity of datasets and bettering the results of machine learning tasks.
Collapse
Affiliation(s)
- Andrea Seveso
- Dipartimento di Informatica, Sistemistica e Comunicazione, Università degli Studi di Milano-Bicocca, Viale Sarca 336, Milan, 20126, Italy
| | - Andrea Campagner
- IRCCS Istituto Ortopedico Galeazzi, Via Riccardo Galeazzi 4, Milan, 20161, Italy
| | - Davide Ciucci
- Dipartimento di Informatica, Sistemistica e Comunicazione, Università degli Studi di Milano-Bicocca, Viale Sarca 336, Milan, 20126, Italy
| | - Federico Cabitza
- Dipartimento di Informatica, Sistemistica e Comunicazione, Università degli Studi di Milano-Bicocca, Viale Sarca 336, Milan, 20126, Italy
| |
Collapse
|
27
|
Robert R, Kentish-Barnes N, Boyer A, Laurent A, Azoulay E, Reignier J. Ethical dilemmas due to the Covid-19 pandemic. Ann Intensive Care 2020; 10:84. [PMID: 32556826 PMCID: PMC7298921 DOI: 10.1186/s13613-020-00702-7] [Citation(s) in RCA: 158] [Impact Index Per Article: 39.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 06/11/2020] [Indexed: 01/04/2023] Open
Abstract
The devastating pandemic that has stricken the worldwide population induced an unprecedented influx of patients in ICUs, raising ethical concerns not only surrounding triage and withdrawal of life support decisions, but also regarding family visits and quality of end-of-life support. These ingredients are liable to shake up our ethical principles, sharpen our ethical dilemmas, and lead to situations of major caregiver sufferings. Proposals have been made to rationalize triage policies in conjunction with ethical justifications. However, whatever the angle of approach, imbalance between utilitarian and individual ethics leads to unsolvable discomforts that caregivers will need to overcome. With this in mind, we aimed to point out some critical ethical choices with which ICU caregivers have been confronted during the Covid-19 pandemic and to underline their limits. The formalized strategies integrating the relevant tools of ethical reflection were disseminated without deviating from usual practices, leaving to intensivists the ultimate choice of decision.
Collapse
Affiliation(s)
- René Robert
- Université de Poitiers, Poitiers, France.
- Inserm CIC 1402, Axe Alive, Poitiers, France.
- Service de Médecine Intensive Réanimation, CHU Poitiers, Poitiers, France.
| | - Nancy Kentish-Barnes
- Service de Réanimation Médicale, APHP, CHU Saint-Louis, Paris, France
- Groupe de Recherche Famiréa, Paris, France
| | - Alexandre Boyer
- Université de Bordeaux, Bordeaux, France
- Service de Médecine Intensive Réanimation, CHU Bordeaux, Bordeaux, France
| | - Alexandra Laurent
- Laboratoire psy-DREPI, Université de Bourgogne Franche-Comté, 7458, Dijon, France
- Service de Réanimation Chirurgicale, Dijon, France
| | - Elie Azoulay
- Service de Réanimation Médicale, APHP, CHU Saint-Louis, Paris, France
- Groupe de Recherche Famiréa, Paris, France
| | - Jean Reignier
- Université de Nantes, Nantes, France
- Service de Médecine Intensive Réanimation, CHU de Nantes, Nantes, France
| |
Collapse
|
28
|
Carayon P, Hoonakker P, Hundt AS, Salwei M, Wiegmann D, Brown RL, Kleinschmidt P, Novak C, Pulia M, Wang Y, Wirkus E, Patterson B. Application of human factors to improve usability of clinical decision support for diagnostic decision-making: a scenario-based simulation study. BMJ Qual Saf 2020; 29:329-340. [PMID: 31776197 PMCID: PMC7490974 DOI: 10.1136/bmjqs-2019-009857] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 10/11/2019] [Accepted: 11/05/2019] [Indexed: 12/14/2022]
Abstract
OBJECTIVE In this study, we used human factors (HF) methods and principles to design a clinical decision support (CDS) that provides cognitive support to the pulmonary embolism (PE) diagnostic decision-making process in the emergency department. We hypothesised that the application of HF methods and principles will produce a more usable CDS that improves PE diagnostic decision-making, in particular decision about appropriate clinical pathway. MATERIALS AND METHODS We conducted a scenario-based simulation study to compare a HF-based CDS (the so-called CDS for PE diagnosis (PE-Dx CDS)) with a web-based CDS (MDCalc); 32 emergency physicians performed various tasks using both CDS. PE-Dx integrated HF design principles such as automating information acquisition and analysis, and minimising workload. We assessed all three dimensions of usability using both objective and subjective measures: effectiveness (eg, appropriate decision regarding the PE diagnostic pathway), efficiency (eg, time spent, perceived workload) and satisfaction (perceived usability of CDS). RESULTS Emergency physicians made more appropriate diagnostic decisions (94% with PE-Dx; 84% with web-based CDS; p<0.01) and performed experimental tasks faster with the PE-Dx CDS (on average 96 s per scenario with PE-Dx; 117 s with web-based CDS; p<0.001). They also reported lower workload (p<0.001) and higher satisfaction (p<0.001) with PE-Dx. CONCLUSIONS This simulation study shows that HF methods and principles can improve usability of CDS and diagnostic decision-making. Aspects of the HF-based CDS that provided cognitive support to emergency physicians and improved diagnostic performance included automation of information acquisition (eg, auto-populating risk scoring algorithms), minimisation of workload and support of decision selection (eg, recommending a clinical pathway). These HF design principles can be applied to the design of other CDS technologies to improve diagnostic safety.
Collapse
Affiliation(s)
- Pascale Carayon
- Department of Industrial and Systems Engineering, Wisconsin Institute for Healthcare Systems Engineering, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Peter Hoonakker
- Center for Quality and Productivity Improvement, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Ann Schoofs Hundt
- Center for Quality and Productivity Improvement, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Megan Salwei
- Department of Industrial and Systems Engineering, Wisconsin Institute for Healthcare Systems Engineering, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Douglas Wiegmann
- Department of Industrial and Systems Engineering, Wisconsin Institute for Healthcare Systems Engineering, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Roger L Brown
- School of Nursing, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Peter Kleinschmidt
- Department of Medicine, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | | | - Michael Pulia
- Department of Emergency Medicine, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Yudi Wang
- Department of Medicine, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Emily Wirkus
- Department of Population Health Sciences, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Brian Patterson
- Department of Emergency Medicine, University of Wisconsin-Madison, Madison, Wisconsin, USA
| |
Collapse
|
29
|
|