1
|
Matos J, Gallifant J, Chowdhury A, Economou-Zavlanos N, Charpignon ML, Gichoya J, Celi LA, Nazer L, King H, Wong AKI. A Clinician's Guide to Understanding Bias in Critical Clinical Prediction Models. Crit Care Clin 2024; 40:827-857. [PMID: 39218488 DOI: 10.1016/j.ccc.2024.05.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
This narrative review focuses on the role of clinical prediction models in supporting informed decision-making in critical care, emphasizing their 2 forms: traditional scores and artificial intelligence (AI)-based models. Acknowledging the potential for both types to embed biases, the authors underscore the importance of critical appraisal to increase our trust in models. The authors outline recommendations and critical care examples to manage risk of bias in AI models. The authors advocate for enhanced interdisciplinary training for clinicians, who are encouraged to explore various resources (books, journals, news Web sites, and social media) and events (Datathons) to deepen their understanding of risk of bias.
Collapse
Affiliation(s)
- João Matos
- University of Porto (FEUP), Porto, Portugal; Institute for Systems and Computer Engineering, Technology and Science (INESC TEC), Porto, Portugal; Laboratory for Computational Physiology, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Jack Gallifant
- Laboratory for Computational Physiology, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Critical Care, Guy's and St Thomas' NHS Trust, London, UK
| | - Anand Chowdhury
- Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, Duke University, Durham, NC, USA
| | | | - Marie-Laure Charpignon
- Institute for Data Systems and Society, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Judy Gichoya
- Department of Radiology, Emory University, Atlanta, GA, USA
| | - Leo Anthony Celi
- Laboratory for Computational Physiology, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Lama Nazer
- Department of Pharmacy, King Hussein Cancer Center, Amman, Jordan
| | - Heather King
- Durham VA Health Care System, Health Services Research and Development, Center of Innovation to Accelerate Discovery and Practice Transformation (ADAPT), Durham, NC, USA; Department of Population Health Sciences, Duke University, Durham, NC, USA; Division of General Internal Medicine, Duke University, Duke University School of Medicine, Durham, NC, USA
| | - An-Kwok Ian Wong
- Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, Duke University, Durham, NC, USA; Department of Biostatistics and Bioinformatics, Duke University, Division of Translational Biomedical Informatics, Durham, NC, USA.
| |
Collapse
|
2
|
Schouten JS, Kalden MACM, van Twist E, Reiss IKM, Gommers DAMPJ, van Genderen ME, Taal HR. From bytes to bedside: a systematic review on the use and readiness of artificial intelligence in the neonatal and pediatric intensive care unit. Intensive Care Med 2024:10.1007/s00134-024-07629-8. [PMID: 39264415 DOI: 10.1007/s00134-024-07629-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Accepted: 08/10/2024] [Indexed: 09/13/2024]
Abstract
PURPOSE Despite its promise to enhance patient outcomes and support clinical decision making, clinical use of artificial intelligence (AI) models at the bedside remains limited. Translation of advancements in AI research into tangible clinical benefits is necessary to improve neonatal and pediatric care for critically ill patients. This systematic review seeks to assess the maturity of AI models in neonatal and pediatric intensive care unit (NICU and PICU) treatment, and their risk of bias and objectives. METHODS We conducted a systematic search in Medline ALL, Embase, Web of Science Core Collection, Cochrane Central Register of Controlled Trials, and Google Scholar. Studies using AI models during NICU or PICU stay were eligible for inclusion. Study design, objective, dataset size, level of validation, risk of bias, and technological readiness of the models were extracted. RESULTS Out of the 1257 identified studies 262 were included. The majority of studies was conducted in the NICU (66%) and most had a high risk of bias (77%). An insufficient sample size was the main cause for this high risk of bias. No studies were identified that integrated an AI model in routine clinical practice and the majority of the studies remained in the prototyping and model development phase. CONCLUSION The majority of AI models remain within the testing and prototyping phase and have a high risk of bias. Bridging the gap between designing and clinical implementation of AI models is needed to warrant safe and trustworthy AI models. Specific guidelines and approaches can help improve clinical outcome with usage of AI.
Collapse
Affiliation(s)
- Janno S Schouten
- Department of Neonatal and Pediatric Intensive Care, Division of Neonatology, Erasmus MC, University Medical Center-Sophia Children's Hospital, Rotterdam, The Netherlands
- Datahub, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
| | - Melissa A C M Kalden
- Department of Neonatal and Pediatric Intensive Care, Division of Neonatology, Erasmus MC, University Medical Center-Sophia Children's Hospital, Rotterdam, The Netherlands
- Datahub, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
- Department of Information and (Medical) Technology, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
| | - Eris van Twist
- Department of Neonatal and Pediatric Intensive Care, Division of Pediatrics, Erasmus MC, University Medical Center-Sophia Children's Hospital, Rotterdam, The Netherlands
| | - Irwin K M Reiss
- Department of Neonatal and Pediatric Intensive Care, Division of Neonatology, Erasmus MC, University Medical Center-Sophia Children's Hospital, Rotterdam, The Netherlands
| | - Diederik A M P J Gommers
- Datahub, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
- Department of Adult Intensive Care, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
| | - Michel E van Genderen
- Datahub, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
- Department of Adult Intensive Care, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
| | - H Rob Taal
- Department of Neonatal and Pediatric Intensive Care, Division of Neonatology, Erasmus MC, University Medical Center-Sophia Children's Hospital, Rotterdam, The Netherlands.
- Datahub, Erasmus MC, University Medical Center, Rotterdam, The Netherlands.
| |
Collapse
|
3
|
Mooghali M, Stroud AM, Yoo DW, Barry BA, Grimshaw AA, Ross JS, Zhu X, Miller JE. Trustworthy and ethical AI-enabled cardiovascular care: a rapid review. BMC Med Inform Decis Mak 2024; 24:247. [PMID: 39232725 PMCID: PMC11373417 DOI: 10.1186/s12911-024-02653-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 08/26/2024] [Indexed: 09/06/2024] Open
Abstract
BACKGROUND Artificial intelligence (AI) is increasingly used for prevention, diagnosis, monitoring, and treatment of cardiovascular diseases. Despite the potential for AI to improve care, ethical concerns and mistrust in AI-enabled healthcare exist among the public and medical community. Given the rapid and transformative recent growth of AI in cardiovascular care, to inform practice guidelines and regulatory policies that facilitate ethical and trustworthy use of AI in medicine, we conducted a literature review to identify key ethical and trust barriers and facilitators from patients' and healthcare providers' perspectives when using AI in cardiovascular care. METHODS In this rapid literature review, we searched six bibliographic databases to identify publications discussing transparency, trust, or ethical concerns (outcomes of interest) associated with AI-based medical devices (interventions of interest) in the context of cardiovascular care from patients', caregivers', or healthcare providers' perspectives. The search was completed on May 24, 2022 and was not limited by date or study design. RESULTS After reviewing 7,925 papers from six databases and 3,603 papers identified through citation chasing, 145 articles were included. Key ethical concerns included privacy, security, or confidentiality issues (n = 59, 40.7%); risk of healthcare inequity or disparity (n = 36, 24.8%); risk of patient harm (n = 24, 16.6%); accountability and responsibility concerns (n = 19, 13.1%); problematic informed consent and potential loss of patient autonomy (n = 17, 11.7%); and issues related to data ownership (n = 11, 7.6%). Major trust barriers included data privacy and security concerns, potential risk of patient harm, perceived lack of transparency about AI-enabled medical devices, concerns about AI replacing human aspects of care, concerns about prioritizing profits over patients' interests, and lack of robust evidence related to the accuracy and limitations of AI-based medical devices. Ethical and trust facilitators included ensuring data privacy and data validation, conducting clinical trials in diverse cohorts, providing appropriate training and resources to patients and healthcare providers and improving their engagement in different phases of AI implementation, and establishing further regulatory oversights. CONCLUSION This review revealed key ethical concerns and barriers and facilitators of trust in AI-enabled medical devices from patients' and healthcare providers' perspectives. Successful integration of AI into cardiovascular care necessitates implementation of mitigation strategies. These strategies should focus on enhanced regulatory oversight on the use of patient data and promoting transparency around the use of AI in patient care.
Collapse
Affiliation(s)
- Maryam Mooghali
- Section of General Internal Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA.
- Yale Center for Outcomes Research and Evaluation (CORE), 195 Church Street, New Haven, CT, 06510, USA.
| | - Austin M Stroud
- Biomedical Ethics Research Program, Mayo Clinic, Rochester, MN, USA
| | - Dong Whi Yoo
- School of Information, Kent State University, Kent, OH, USA
| | - Barbara A Barry
- Robert D. and Patricia E. Kern Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, MN, USA
- Division of Health Care Delivery Research, Mayo Clinic, Rochester, MN, USA
| | - Alyssa A Grimshaw
- Harvey Cushing/John Hay Whitney Medical Library, Yale University, New Haven, CT, USA
| | - Joseph S Ross
- Section of General Internal Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
- Department of Health Policy and Management, Yale School of Public Health, New Haven, CT, USA
| | - Xuan Zhu
- Robert D. and Patricia E. Kern Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, MN, USA
| | - Jennifer E Miller
- Section of General Internal Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
| |
Collapse
|
4
|
Isavand P, Aghamiri SS, Amin R. Applications of Multimodal Artificial Intelligence in Non-Hodgkin Lymphoma B Cells. Biomedicines 2024; 12:1753. [PMID: 39200217 PMCID: PMC11351272 DOI: 10.3390/biomedicines12081753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Revised: 07/22/2024] [Accepted: 08/01/2024] [Indexed: 09/02/2024] Open
Abstract
Given advancements in large-scale data and AI, integrating multimodal artificial intelligence into cancer research can enhance our understanding of tumor behavior by simultaneously processing diverse biomedical data types. In this review, we explore the potential of multimodal AI in comprehending B-cell non-Hodgkin lymphomas (B-NHLs). B-cell non-Hodgkin lymphomas (B-NHLs) represent a particular challenge in oncology due to tumor heterogeneity and the intricate ecosystem in which tumors develop. These complexities complicate diagnosis, prognosis, and therapy response, emphasizing the need to use sophisticated approaches to enhance personalized treatment strategies for better patient outcomes. Therefore, multimodal AI can be leveraged to synthesize critical information from available biomedical data such as clinical record, imaging, pathology and omics data, to picture the whole tumor. In this review, we first define various types of modalities, multimodal AI frameworks, and several applications in precision medicine. Then, we provide several examples of its usage in B-NHLs, for analyzing the complexity of the ecosystem, identifying immune biomarkers, optimizing therapy strategy, and its clinical applications. Lastly, we address the limitations and future directions of multimodal AI, highlighting the need to overcome these challenges for better clinical practice and application in healthcare.
Collapse
Affiliation(s)
- Pouria Isavand
- Department of Radiology, School of Medicine, Zanjan University of Medical Sciences, Zanjan 4513956184, Iran
| | | | - Rada Amin
- Department of Biochemistry, University of Nebraska, Lincoln, NE 68503, USA
| |
Collapse
|
5
|
Bienefeld N, Keller E, Grote G. Human-AI Teaming in Critical Care: A Comparative Analysis of Data Scientists' and Clinicians' Perspectives on AI Augmentation and Automation. J Med Internet Res 2024; 26:e50130. [PMID: 39038285 DOI: 10.2196/50130] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 05/03/2024] [Accepted: 05/23/2024] [Indexed: 07/24/2024] Open
Abstract
BACKGROUND Artificial intelligence (AI) holds immense potential for enhancing clinical and administrative health care tasks. However, slow adoption and implementation challenges highlight the need to consider how humans can effectively collaborate with AI within broader socio-technical systems in health care. OBJECTIVE In the example of intensive care units (ICUs), we compare data scientists' and clinicians' assessments of the optimal utilization of human and AI capabilities by determining suitable levels of human-AI teaming for safely and meaningfully augmenting or automating 6 core tasks. The goal is to provide actionable recommendations for policy makers and health care practitioners regarding AI design and implementation. METHODS In this multimethod study, we combine a systematic task analysis across 6 ICUs with an international Delphi survey involving 19 health data scientists from the industry and academia and 61 ICU clinicians (25 physicians and 36 nurses) to define and assess optimal levels of human-AI teaming (level 1=no performance benefits; level 2=AI augments human performance; level 3=humans augment AI performance; level 4=AI performs without human input). Stakeholder groups also considered ethical and social implications. RESULTS Both stakeholder groups chose level 2 and 3 human-AI teaming for 4 out of 6 core tasks in the ICU. For one task (monitoring), level 4 was the preferred design choice. For the task of patient interactions, both data scientists and clinicians agreed that AI should not be used regardless of technological feasibility due to the importance of the physician-patient and nurse-patient relationship and ethical concerns. Human-AI design choices rely on interpretability, predictability, and control over AI systems. If these conditions are not met and AI performs below human-level reliability, a reduction to level 1 or shifting accountability away from human end users is advised. If AI performs at or beyond human-level reliability and these conditions are not met, shifting to level 4 automation should be considered to ensure safe and efficient human-AI teaming. CONCLUSIONS By considering the sociotechnical system and determining appropriate levels of human-AI teaming, our study showcases the potential for improving the safety and effectiveness of AI usage in ICUs and broader health care settings. Regulatory measures should prioritize interpretability, predictability, and control if clinicians hold full accountability. Ethical and social implications must be carefully evaluated to ensure effective collaboration between humans and AI, particularly considering the most recent advancements in generative AI.
Collapse
Affiliation(s)
- Nadine Bienefeld
- Department of Management, Technology, and Economics, ETH Zurich, Zurich, Switzerland
| | - Emanuela Keller
- Institute of Intensive Care Medicine, Department of Neurosurgery, University Hospital and University of Zurich, Zurich, Switzerland
| | - Gudela Grote
- Department of Management, Technology, and Economics, ETH Zurich, Zurich, Switzerland
| |
Collapse
|
6
|
Patel DJ, Chaudhari K, Acharya N, Shrivastava D, Muneeba S. Artificial Intelligence in Obstetrics and Gynecology: Transforming Care and Outcomes. Cureus 2024; 16:e64725. [PMID: 39156405 PMCID: PMC11329325 DOI: 10.7759/cureus.64725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Accepted: 07/17/2024] [Indexed: 08/20/2024] Open
Abstract
The integration of artificial intelligence (AI) in obstetrics and gynecology (OB/GYN) is revolutionizing the landscape of women's healthcare. This review article explores the transformative impact of AI technologies on the diagnosis, treatment, and management of obstetric and gynecological conditions. We examine key advancements in AI-driven imaging techniques, predictive analytics, and personalized medicine, highlighting their roles in enhancing prenatal care, improving maternal and fetal outcomes, and optimizing gynecological interventions. The article also addresses the challenges and ethical considerations associated with the implementation of AI in clinical practice. This paper highlights the potential of AI to greatly improve the standard of care in OB/GYN, ultimately leading to better health outcomes for women, by offering a thorough overview of present AI uses and future prospects.
Collapse
Affiliation(s)
- Dharmesh J Patel
- Department of Obstetrics and Gynecology, Jawaharlal Nehru Medical College, Datta Meghe Institute of Higher Education & Research, Wardha, IND
| | - Kamlesh Chaudhari
- Department of Obstetrics and Gynecology, Jawaharlal Nehru Medical College, Datta Meghe Institute of Higher Education & Research, Wardha, IND
| | - Neema Acharya
- Department of Obstetrics and Gynecology, Jawaharlal Nehru Medical College, Datta Meghe Institute of Higher Education & Research, Wardha, IND
| | - Deepti Shrivastava
- Department of Obstetrics and Gynecology, Jawaharlal Nehru Medical College, Datta Meghe Institute of Higher Education & Research, Wardha, IND
| | - Shaikh Muneeba
- Department of Obstetrics and Gynecology, Jawaharlal Nehru Medical College, Datta Meghe Institute of Higher Education & Research, Wardha, IND
| |
Collapse
|
7
|
Johns E, Alkanj A, Beck M, Dal Mas L, Gourieux B, Sauleau EA, Michel B. Using machine learning or deep learning models in a hospital setting to detect inappropriate prescriptions: a systematic review. Eur J Hosp Pharm 2024; 31:289-294. [PMID: 38050067 PMCID: PMC11265547 DOI: 10.1136/ejhpharm-2023-003857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 11/07/2023] [Indexed: 12/06/2023] Open
Abstract
OBJECTIVES The emergence of artificial intelligence (AI) is catching the interest of hospital pharmacists. A massive collection of health data is now available to train AI models and hold the promise of disrupting codes and practices. The objective of this systematic review was to examine the state of the art of machine learning or deep learning models that detect inappropriate hospital medication orders. METHODS A systematic review was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. MEDLINE and Embase databases were searched from inception to May 2023. Studies were included if they reported and described an AI model intended for use by clinical pharmacists in hospitals. Risk of bias was assessed using the Prediction model Risk Of Bias ASsessment Tool (PROBAST). RESULTS 13 articles were selected after review: 12 studies were judged to have high risk of bias; 11 studies were published between 2020 and 2023; 8 were conducted in North America and Asia; 6 analysed orders and detected inappropriate prescriptions according to patient profiles and medication orders; and 7 detected specific inappropriate prescriptions, such as detecting antibiotic resistance, dosage abnormality in prescriptions, high alert drugs errors from prescriptions or predicting the risk of adverse drug events. Various AI models were used, mainly supervised learning techniques. The training datasets used were very heterogeneous; the length of study varied from 2 weeks to 7 years and the number of prescription orders analysed went from 31 to 5 804 192. CONCLUSIONS This systematic review points out that, to date, few original research studies report AI tools based on machine or deep learning in the field of hospital clinical pharmacy. However, these original articles, while preliminary, highlighted the potential value of integrating AI into clinical hospital pharmacy practice.
Collapse
Affiliation(s)
- Erin Johns
- Direction de la Qualité, de la Performance et de l'Innovation, Agence Régionale de Santé Grand Est Site de Strasbourg, Strasbourg, Grand Est, France
- IMAGeS, Laboratoire des Sciences de l'Ingénieur de l'Informatique et de l'Imagerie, Illkirch, Grand Est, France
| | - Ahmad Alkanj
- Laboratoire de Pharmacologie et de Toxicologie Neurocardiovasculaire, Université de Strasbourg, Strasbourg, Grand Est, France
| | - Morgane Beck
- Direction de la Qualité, de la Performance et de l'Innovation, Agence Régionale de Santé Grand Est Site de Strasbourg, Strasbourg, Grand Est, France
| | - Laurent Dal Mas
- Direction de la Qualité, de la Performance et de l'Innovation, Agence Régionale de Santé Grand Est Site de Strasbourg, Strasbourg, Grand Est, France
| | - Benedicte Gourieux
- Laboratoire de Pharmacologie et de Toxicologie Neurocardiovasculaire, Université de Strasbourg, Strasbourg, Grand Est, France
- Service Pharmacie - Stérilisation, Les Hopitaux Universitaires de Strasbourg, Strasbourg, Grand Est, France
| | - Erik-André Sauleau
- IMAGeS, Laboratoire des Sciences de l'Ingénieur de l'Informatique et de l'Imagerie, Illkirch, Grand Est, France
- Département de Santé Publique - Groupe Méthodes Recherche Clinique, Les Hopitaux Universitaires de Strasbourg, Strasbourg, Grand Est, France
| | - Bruno Michel
- Laboratoire de Pharmacologie et de Toxicologie Neurocardiovasculaire, Université de Strasbourg, Strasbourg, Grand Est, France
- Service Pharmacie - Stérilisation, Les Hopitaux Universitaires de Strasbourg, Strasbourg, Grand Est, France
| |
Collapse
|
8
|
Agius R, Riis-Jensen AC, Wimmer B, da Cunha-Bang C, Murray DD, Poulsen CB, Bertelsen MB, Schwartz B, Lundgren JD, Langberg H, Niemann CU. Deployment and validation of the CLL treatment infection model adjoined to an EHR system. NPJ Digit Med 2024; 7:147. [PMID: 38839920 PMCID: PMC11153589 DOI: 10.1038/s41746-024-01132-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 05/08/2024] [Indexed: 06/07/2024] Open
Abstract
Research algorithms are seldom externally validated or integrated into clinical practice, leaving unknown challenges in deployment. In such efforts, one needs to address challenges related to data harmonization, the performance of an algorithm in unforeseen missingness, automation and monitoring of predictions, and legal frameworks. We here describe the deployment of a high-dimensional data-driven decision support model into an EHR and derive practical guidelines informed by this deployment that includes the necessary processes, stakeholders and design requirements for a successful deployment. For this, we describe our deployment of the chronic lymphocytic leukemia (CLL) treatment infection model (CLL-TIM) as a stand-alone platform adjoined to an EPIC-based Danish Electronic Health Record (EHR), with the presentation of personalized predictions in a clinical context. CLL-TIM is an 84-variable data-driven prognostic model utilizing 7-year medical patient records and predicts the 2-year risk composite outcome of infection and/or treatment post-CLL diagnosis. As an independent validation cohort for this deployment, we used a retrospective population-based cohort of patients diagnosed with CLL from 2018 onwards (n = 1480). Unexpectedly high levels of missingness for key CLL-TIM variables were exhibited upon deployment. High dimensionality, with the handling of missingness, and predictive confidence were critical design elements that enabled trustworthy predictions and thus serves as a priority for prognostic models seeking deployment in new EHRs. Our setup for deployment, including automation and monitoring into EHR that meets Medical Device Regulations, may be used as step-by-step guidelines for others aiming at designing and deploying research algorithms into clinical practice.
Collapse
Affiliation(s)
- Rudi Agius
- Department of Hematology, Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark
| | | | - Bettina Wimmer
- SP Sundhedsdata, The Data Unit, Capital Region of Denmark, Copenhagen, Denmark
| | - Caspar da Cunha-Bang
- Department of Hematology, Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark
| | - Daniel Dawson Murray
- Center of Excellence for Health, Immunity, and Infections (CHIP), Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark
| | | | | | - Berit Schwartz
- Rigshospitalets Innoovationscenter, Copenhagen University Hospital Rigshopsitalet, Copenhagen, Denmark
| | - Jens Dilling Lundgren
- Center of Excellence for Health, Immunity, and Infections (CHIP), Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark
| | - Henning Langberg
- Rigshospitalets Innoovationscenter, Copenhagen University Hospital Rigshopsitalet, Copenhagen, Denmark
| | - Carsten Utoft Niemann
- Department of Hematology, Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark.
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
9
|
Agafonov O, Babic A, Sousa S, Alagaratnam S. Editorial: Trustworthy AI for healthcare. Front Digit Health 2024; 6:1427233. [PMID: 38887594 PMCID: PMC11180822 DOI: 10.3389/fdgth.2024.1427233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 05/06/2024] [Indexed: 06/20/2024] Open
Affiliation(s)
- Oleg Agafonov
- Healthcare Programme, Group Research and Development, DNV, Høvik, Norway
| | - Aleksandar Babic
- Healthcare Programme, Group Research and Development, DNV, Høvik, Norway
| | - Sonia Sousa
- School of Digital Technologies, Tallinn University, Tallinn, Estonia
| | | |
Collapse
|
10
|
Kuziemsky CE, Chrimes D, Minshall S, Mannerow M, Lau F. AI Quality Standards in Health Care: Rapid Umbrella Review. J Med Internet Res 2024; 26:e54705. [PMID: 38776538 PMCID: PMC11153979 DOI: 10.2196/54705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2023] [Revised: 04/03/2024] [Accepted: 04/04/2024] [Indexed: 05/25/2024] Open
Abstract
BACKGROUND In recent years, there has been an upwelling of artificial intelligence (AI) studies in the health care literature. During this period, there has been an increasing number of proposed standards to evaluate the quality of health care AI studies. OBJECTIVE This rapid umbrella review examines the use of AI quality standards in a sample of health care AI systematic review articles published over a 36-month period. METHODS We used a modified version of the Joanna Briggs Institute umbrella review method. Our rapid approach was informed by the practical guide by Tricco and colleagues for conducting rapid reviews. Our search was focused on the MEDLINE database supplemented with Google Scholar. The inclusion criteria were English-language systematic reviews regardless of review type, with mention of AI and health in the abstract, published during a 36-month period. For the synthesis, we summarized the AI quality standards used and issues noted in these reviews drawing on a set of published health care AI standards, harmonized the terms used, and offered guidance to improve the quality of future health care AI studies. RESULTS We selected 33 review articles published between 2020 and 2022 in our synthesis. The reviews covered a wide range of objectives, topics, settings, designs, and results. Over 60 AI approaches across different domains were identified with varying levels of detail spanning different AI life cycle stages, making comparisons difficult. Health care AI quality standards were applied in only 39% (13/33) of the reviews and in 14% (25/178) of the original studies from the reviews examined, mostly to appraise their methodological or reporting quality. Only a handful mentioned the transparency, explainability, trustworthiness, ethics, and privacy aspects. A total of 23 AI quality standard-related issues were identified in the reviews. There was a recognized need to standardize the planning, conduct, and reporting of health care AI studies and address their broader societal, ethical, and regulatory implications. CONCLUSIONS Despite the growing number of AI standards to assess the quality of health care AI studies, they are seldom applied in practice. With increasing desire to adopt AI in different health topics, domains, and settings, practitioners and researchers must stay abreast of and adapt to the evolving landscape of health care AI quality standards and apply these standards to improve the quality of their AI studies.
Collapse
Affiliation(s)
| | - Dillon Chrimes
- School of Health Information Science, University of Victoria, Victoria, BC, Canada
| | - Simon Minshall
- School of Health Information Science, University of Victoria, Victoria, BC, Canada
| | | | - Francis Lau
- School of Health Information Science, University of Victoria, Victoria, BC, Canada
| |
Collapse
|
11
|
Ruksakulpiwat S, Thorngthip S, Niyomyart A, Benjasirisan C, Phianhasin L, Aldossary H, Ahmed BH, Samai T. A Systematic Review of the Application of Artificial Intelligence in Nursing Care: Where are We, and What's Next? J Multidiscip Healthc 2024; 17:1603-1616. [PMID: 38628616 PMCID: PMC11020344 DOI: 10.2147/jmdh.s459946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 03/05/2024] [Indexed: 04/19/2024] Open
Abstract
Background Integrating Artificial Intelligence (AI) into healthcare has transformed the landscape of patient care and healthcare delivery. Despite this, there remains a notable gap in the existing literature synthesizing the comprehensive understanding of AI's utilization in nursing care. Objective This systematic review aims to synthesize the available evidence to comprehensively understand the application of AI in nursing care. Methods Studies published between January 2019 and December 2023, identified through CINAHL Plus with Full Text, Web of Science, PubMed, and Medline, were included in this review. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines guided the identification, screening, exclusion, and inclusion of articles. The convergent integrated analysis framework, as proposed by the Joanna Briggs Institute, was employed to synthesize data from the included studies for theme generation. Results A total of 337 records were identified from databases. Among them, 35 duplicates were removed, and 302 records underwent eligibility screening. After applying inclusion and exclusion criteria, eleven studies were deemed eligible and included in this review. Through data synthesis of these studies, six themes pertaining to the use of AI in nursing care were identified: 1) Risk Identification, 2) Health Assessment, 3) Patient Classification, 4) Research Development, 5) Improved Care Delivery and Medical Records, and 6) Developing a Nursing Care Plan. Conclusion This systematic review contributes valuable insights into the multifaceted applications of AI in nursing care. Through the synthesis of data from the included studies, six distinct themes emerged. These findings not only consolidate the current knowledge base but also underscore the diverse ways in which AI is shaping and improving nursing care practices.
Collapse
Affiliation(s)
- Suebsarn Ruksakulpiwat
- Department of Medical Nursing, Faculty of Nursing, Mahidol University, Bangkok, Thailand
| | - Sutthinee Thorngthip
- Department of Nursing Siriraj Hospital, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Atsadaporn Niyomyart
- Ramathibodi School of Nursing, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
| | | | - Lalipat Phianhasin
- Department of Medical Nursing, Faculty of Nursing, Mahidol University, Bangkok, Thailand
| | - Heba Aldossary
- Department of Nursing, Prince Sultan Military College of Health Sciences, Dammam, Saudi Arabia
| | - Bootan Hasan Ahmed
- Frances Payne Bolton School of Nursing, Case Western Reserve University, Cleveland, OH, USA
| | - Thanistha Samai
- Department of Public Health Nursing, Faculty of Nursing, Mahidol University, Nakhon Pathom, Thailand
| |
Collapse
|
12
|
Boag W, Hasan A, Kim JY, Revoir M, Nichols M, Ratliff W, Gao M, Zilberstein S, Samad Z, Hoodbhoy Z, Ali M, Khan NS, Patel M, Balu S, Sendak M. The algorithm journey map: a tangible approach to implementing AI solutions in healthcare. NPJ Digit Med 2024; 7:87. [PMID: 38594344 PMCID: PMC11003994 DOI: 10.1038/s41746-024-01061-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 02/19/2024] [Indexed: 04/11/2024] Open
Abstract
When integrating AI tools in healthcare settings, complex interactions between technologies and primary users are not always fully understood or visible. This deficient and ambiguous understanding hampers attempts by healthcare organizations to adopt AI/ML, and it also creates new challenges for researchers to identify opportunities for simplifying adoption and developing best practices for the use of AI-based solutions. Our study fills this gap by documenting the process of designing, building, and maintaining an AI solution called SepsisWatch at Duke University Health System. We conducted 20 interviews with the team of engineers and scientists that led the multi-year effort to build the tool, integrate it into practice, and maintain the solution. This "Algorithm Journey Map" enumerates all social and technical activities throughout the AI solution's procurement, development, integration, and full lifecycle management. In addition to mapping the "who?" and "what?" of the adoption of the AI tool, we also show several 'lessons learned' throughout the algorithm journey maps including modeling assumptions, stakeholder inclusion, and organizational structure. In doing so, we identify generalizable insights about how to recognize and navigate barriers to AI/ML adoption in healthcare settings. We expect that this effort will further the development of best practices for operationalizing and sustaining ethical principles-in algorithmic systems.
Collapse
Affiliation(s)
- William Boag
- Duke Institute for Health Innovation, Durham, NC, USA
| | - Alifia Hasan
- Duke Institute for Health Innovation, Durham, NC, USA
| | - Jee Young Kim
- Duke Institute for Health Innovation, Durham, NC, USA
| | - Mike Revoir
- Duke Institute for Health Innovation, Durham, NC, USA
| | | | | | - Michael Gao
- Duke Institute for Health Innovation, Durham, NC, USA
| | - Shira Zilberstein
- Duke Institute for Health Innovation, Durham, NC, USA
- Harvard University, Cambridge, MA, USA
| | | | | | | | | | - Manesh Patel
- Duke University School of Medicine, Durham, NC, USA
| | - Suresh Balu
- Duke Institute for Health Innovation, Durham, NC, USA
| | - Mark Sendak
- Duke Institute for Health Innovation, Durham, NC, USA.
| |
Collapse
|
13
|
Campion JR, O'Connor DB, Lahiff C. Human-artificial intelligence interaction in gastrointestinal endoscopy. World J Gastrointest Endosc 2024; 16:126-135. [PMID: 38577646 PMCID: PMC10989254 DOI: 10.4253/wjge.v16.i3.126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/31/2023] [Revised: 01/18/2024] [Accepted: 02/23/2024] [Indexed: 03/14/2024] Open
Abstract
The number and variety of applications of artificial intelligence (AI) in gastrointestinal (GI) endoscopy is growing rapidly. New technologies based on machine learning (ML) and convolutional neural networks (CNNs) are at various stages of development and deployment to assist patients and endoscopists in preparing for endoscopic procedures, in detection, diagnosis and classification of pathology during endoscopy and in confirmation of key performance indicators. Platforms based on ML and CNNs require regulatory approval as medical devices. Interactions between humans and the technologies we use are complex and are influenced by design, behavioural and psychological elements. Due to the substantial differences between AI and prior technologies, important differences may be expected in how we interact with advice from AI technologies. Human–AI interaction (HAII) may be optimised by developing AI algorithms to minimise false positives and designing platform interfaces to maximise usability. Human factors influencing HAII may include automation bias, alarm fatigue, algorithm aversion, learning effect and deskilling. Each of these areas merits further study in the specific setting of AI applications in GI endoscopy and professional societies should engage to ensure that sufficient emphasis is placed on human-centred design in development of new AI technologies.
Collapse
Affiliation(s)
- John R Campion
- Department of Gastroenterology, Mater Misericordiae University Hospital, Dublin D07 AX57, Ireland
- School of Medicine, University College Dublin, Dublin D04 C7X2, Ireland
| | - Donal B O'Connor
- Department of Surgery, Trinity College Dublin, Dublin D02 R590, Ireland
| | - Conor Lahiff
- Department of Gastroenterology, Mater Misericordiae University Hospital, Dublin D07 AX57, Ireland
- School of Medicine, University College Dublin, Dublin D04 C7X2, Ireland
| |
Collapse
|
14
|
Reddy S. Generative AI in healthcare: an implementation science informed translational path on application, integration and governance. Implement Sci 2024; 19:27. [PMID: 38491544 PMCID: PMC10941464 DOI: 10.1186/s13012-024-01357-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 03/06/2024] [Indexed: 03/18/2024] Open
Abstract
BACKGROUND Artificial intelligence (AI), particularly generative AI, has emerged as a transformative tool in healthcare, with the potential to revolutionize clinical decision-making and improve health outcomes. Generative AI, capable of generating new data such as text and images, holds promise in enhancing patient care, revolutionizing disease diagnosis and expanding treatment options. However, the utility and impact of generative AI in healthcare remain poorly understood, with concerns around ethical and medico-legal implications, integration into healthcare service delivery and workforce utilisation. Also, there is not a clear pathway to implement and integrate generative AI in healthcare delivery. METHODS This article aims to provide a comprehensive overview of the use of generative AI in healthcare, focusing on the utility of the technology in healthcare and its translational application highlighting the need for careful planning, execution and management of expectations in adopting generative AI in clinical medicine. Key considerations include factors such as data privacy, security and the irreplaceable role of clinicians' expertise. Frameworks like the technology acceptance model (TAM) and the Non-Adoption, Abandonment, Scale-up, Spread and Sustainability (NASSS) model are considered to promote responsible integration. These frameworks allow anticipating and proactively addressing barriers to adoption, facilitating stakeholder participation and responsibly transitioning care systems to harness generative AI's potential. RESULTS Generative AI has the potential to transform healthcare through automated systems, enhanced clinical decision-making and democratization of expertise with diagnostic support tools providing timely, personalized suggestions. Generative AI applications across billing, diagnosis, treatment and research can also make healthcare delivery more efficient, equitable and effective. However, integration of generative AI necessitates meticulous change management and risk mitigation strategies. Technological capabilities alone cannot shift complex care ecosystems overnight; rather, structured adoption programs grounded in implementation science are imperative. CONCLUSIONS It is strongly argued in this article that generative AI can usher in tremendous healthcare progress, if introduced responsibly. Strategic adoption based on implementation science, incremental deployment and balanced messaging around opportunities versus limitations helps promote safe, ethical generative AI integration. Extensive real-world piloting and iteration aligned to clinical priorities should drive development. With conscientious governance centred on human wellbeing over technological novelty, generative AI can enhance accessibility, affordability and quality of care. As these models continue advancing rapidly, ongoing reassessment and transparent communication around their strengths and weaknesses remain vital to restoring trust, realizing positive potential and, most importantly, improving patient outcomes.
Collapse
Affiliation(s)
- Sandeep Reddy
- Deakin School of Medicine, Waurn Ponds, Geelong, VIC, 3215, Australia.
| |
Collapse
|
15
|
Hashimoto DA, Sambasastry SK, Singh V, Kurada S, Altieri M, Yoshida T, Madani A, Jogan M. A foundation for evaluating the surgical artificial intelligence literature. EUROPEAN JOURNAL OF SURGICAL ONCOLOGY 2024:108014. [PMID: 38360498 DOI: 10.1016/j.ejso.2024.108014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2023] [Revised: 01/06/2024] [Accepted: 02/09/2024] [Indexed: 02/17/2024]
Abstract
With increasing growth in applications of artificial intelligence (AI) in surgery, it has become essential for surgeons to gain a foundation of knowledge to critically appraise the scientific literature, commercial claims regarding products, and regulatory and legal frameworks that govern the development and use of AI. This guide offers surgeons a framework with which to evaluate manuscripts that incorporate the use of AI. It provides a glossary of common terms, an overview of prerequisite knowledge to maximize understanding of methodology, and recommendations on how to carefully consider each element of a manuscript to assess the quality of the data on which an algorithm was trained, the appropriateness of the methodological approach, the potential for reproducibility of the experiment, and the applicability to surgical practice, including considerations on generalizability and scalability.
Collapse
Affiliation(s)
- Daniel A Hashimoto
- Penn Computer Assisted Surgery and Outcomes Laboratory, Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA; Global Surgical AI Collaborative, Toronto, ON, USA.
| | - Sai Koushik Sambasastry
- Penn Computer Assisted Surgery and Outcomes Laboratory, Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Vivek Singh
- Penn Computer Assisted Surgery and Outcomes Laboratory, Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Sruthi Kurada
- Penn Computer Assisted Surgery and Outcomes Laboratory, Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Maria Altieri
- Penn Computer Assisted Surgery and Outcomes Laboratory, Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Global Surgical AI Collaborative, Toronto, ON, USA
| | - Takuto Yoshida
- Surgical AI Research Academy, Department of Surgery, University Health Network, Toronto, ON, USA
| | - Amin Madani
- Global Surgical AI Collaborative, Toronto, ON, USA; Surgical AI Research Academy, Department of Surgery, University Health Network, Toronto, ON, USA
| | - Matjaz Jogan
- Penn Computer Assisted Surgery and Outcomes Laboratory, Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
16
|
Dijkstra H, van de Kuit A, de Groot T, Canta O, Groot OQ, Oosterhoff JH, Doornberg JN. Systematic review of machine-learning models in orthopaedic trauma. Bone Jt Open 2024; 5:9-19. [PMID: 38226447 PMCID: PMC10790183 DOI: 10.1302/2633-1462.51.bjo-2023-0095.r1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/17/2024] Open
Abstract
Aims Machine-learning (ML) prediction models in orthopaedic trauma hold great promise in assisting clinicians in various tasks, such as personalized risk stratification. However, an overview of current applications and critical appraisal to peer-reviewed guidelines is lacking. The objectives of this study are to 1) provide an overview of current ML prediction models in orthopaedic trauma; 2) evaluate the completeness of reporting following the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement; and 3) assess the risk of bias following the Prediction model Risk Of Bias Assessment Tool (PROBAST) tool. Methods A systematic search screening 3,252 studies identified 45 ML-based prediction models in orthopaedic trauma up to January 2023. The TRIPOD statement assessed transparent reporting and the PROBAST tool the risk of bias. Results A total of 40 studies reported on training and internal validation; four studies performed both development and external validation, and one study performed only external validation. The most commonly reported outcomes were mortality (33%, 15/45) and length of hospital stay (9%, 4/45), and the majority of prediction models were developed in the hip fracture population (60%, 27/45). The overall median completeness for the TRIPOD statement was 62% (interquartile range 30 to 81%). The overall risk of bias in the PROBAST tool was low in 24% (11/45), high in 69% (31/45), and unclear in 7% (3/45) of the studies. High risk of bias was mainly due to analysis domain concerns including small datasets with low number of outcomes, complete-case analysis in case of missing data, and no reporting of performance measures. Conclusion The results of this study showed that despite a myriad of potential clinically useful applications, a substantial part of ML studies in orthopaedic trauma lack transparent reporting, and are at high risk of bias. These problems must be resolved by following established guidelines to instil confidence in ML models among patients and clinicians. Otherwise, there will remain a sizeable gap between the development of ML prediction models and their clinical application in our day-to-day orthopaedic trauma practice.
Collapse
Affiliation(s)
- Hidde Dijkstra
- Department of Orthopaedic Surgery, University Medical Centre Groningen, Groningen, Netherlands
- University Center for Geriatric Medicine, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Anouk van de Kuit
- Department of Orthopaedic Surgery, University Medical Centre Groningen, Groningen, Netherlands
| | - Tom de Groot
- Department of Orthopaedic Surgery, University Medical Centre Groningen, Groningen, Netherlands
- Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, USA
| | - Olga Canta
- Department of Orthopaedic Surgery, University Medical Centre Groningen, Groningen, Netherlands
| | - Olivier Q. Groot
- Department of Orthopaedic Surgery, University Medical Centre Utrecht, University of Utrecht, Utrecht, Netherlands
| | - Jacobien H. Oosterhoff
- Department of Engineering Systems & Services, Faculty Technology Policy and Management, Delft University of Technology, Delft, Netherlands
| | - Job N. Doornberg
- Department of Orthopaedic Surgery, University Medical Centre Groningen, Groningen, Netherlands
- Department of Orthopaedic Trauma Surgery, Flinders Medical Center, Flinders University, Adelaide, Australia
| | | |
Collapse
|
17
|
Derbal Y. Adaptive Cancer Therapy in the Age of Generative Artificial Intelligence. Cancer Control 2024; 31:10732748241264704. [PMID: 38897721 PMCID: PMC11189021 DOI: 10.1177/10732748241264704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Revised: 05/17/2024] [Accepted: 06/06/2024] [Indexed: 06/21/2024] Open
Abstract
Therapeutic resistance is a major challenge facing the design of effective cancer treatments. Adaptive cancer therapy is in principle the most viable approach to manage cancer's adaptive dynamics through drug combinations with dose timing and modulation. However, there are numerous open issues facing the clinical success of adaptive therapy. Chief among these issues is the feasibility of real-time predictions of treatment response which represent a bedrock requirement of adaptive therapy. Generative artificial intelligence has the potential to learn prediction models of treatment response from clinical, molecular, and radiomics data about patients and their treatments. The article explores this potential through a proposed integration model of Generative Pre-Trained Transformers (GPTs) in a closed loop with adaptive treatments to predict the trajectories of disease progression. The conceptual model and the challenges facing its realization are discussed in the broader context of artificial intelligence integration in oncology.
Collapse
Affiliation(s)
- Youcef Derbal
- Ted Rogers School of Information Technology Management, Toronto Metropolitan University, Toronto, ON, Canada
| |
Collapse
|
18
|
Bélisle-Pipon JC, Powell M, English R, Malo MF, Ravitsky V, Bensoussan Y. Stakeholder perspectives on ethical and trustworthy voice AI in health care. Digit Health 2024; 10:20552076241260407. [PMID: 39055787 PMCID: PMC11271113 DOI: 10.1177/20552076241260407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Accepted: 05/21/2024] [Indexed: 07/27/2024] Open
Abstract
Objective Voice as a health biomarker using artificial intelligence (AI) is gaining momentum in research. The noninvasiveness of voice data collection through accessible technology (such as smartphones, telehealth, and ambient recordings) or within clinical contexts means voice AI may help address health disparities and promote the inclusion of marginalized communities. However, the development of AI-ready voice datasets free from bias and discrimination is a complex task. The objective of this study is to better understand the perspectives of engaged and interested stakeholders regarding ethical and trustworthy voice AI, to inform both further ethical inquiry and technology innovation. Methods A questionnaire was administered to voice AI experts, clinicians, scholars, patients, trainees, and policy-makers who participated at the 2023 Voice AI Symposium organized by the Bridge2AI-Voice AI Consortium. The survey used a mix of Likert scale, ranking and open-ended questions. A total of 27 stakeholders participated in the study. Results The main results of the study are the identification of priorities in terms of ethical issues, an initial definition of ethically sourced data for voice AI, insights into the use of synthetic voice data, and proposals for acting on the trustworthiness of voice AI. The study shows a diversity of perspectives and adds nuance to the planning and development of ethical and trustworthy voice AI. Conclusions This study represents the first stakeholder survey related to voice as a biomarker of health published to date. This study sheds light on the critical importance of ethics and trustworthiness in the development of voice AI technologies for health applications.
Collapse
Affiliation(s)
| | - Maria Powell
- Vanderbilt University Medical Center, Department of Otolaryngology-Head & Neck Surgery, Nashville, TN, Canada
| | - Renee English
- Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | | | - Vardit Ravitsky
- Hastings Center, Garrison, NY, USA
- Department of Global Health and Social Medicine, Harvard University, Cambridge, MA, USA
| | | | - Yael Bensoussan
- Department of Otolaryngology-Head & Neck Surgery, University of South Florida, Tampa, FL, USA
| |
Collapse
|
19
|
de Groot TM, Ramsey D, Groot OQ, Fourman M, Karhade AV, Twining PK, Berner EA, Fenn BP, Collins AK, Raskin K, Lozano S, Newman E, Ferrone M, Doornberg JN, Schwab JH. Does the SORG Machine-learning Algorithm for Extremity Metastases Generalize to a Contemporary Cohort of Patients? Temporal Validation From 2016 to 2020. Clin Orthop Relat Res 2023; 481:2419-2430. [PMID: 37229565 PMCID: PMC10642892 DOI: 10.1097/corr.0000000000002698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/15/2023] [Accepted: 04/21/2023] [Indexed: 05/27/2023]
Abstract
BACKGROUND The ability to predict survival accurately in patients with osseous metastatic disease of the extremities is vital for patient counseling and guiding surgical intervention. We, the Skeletal Oncology Research Group (SORG), previously developed a machine-learning algorithm (MLA) based on data from 1999 to 2016 to predict 90-day and 1-year survival of surgically treated patients with extremity bone metastasis. As treatment regimens for oncology patients continue to evolve, this SORG MLA-driven probability calculator requires temporal reassessment of its accuracy. QUESTION/PURPOSE Does the SORG-MLA accurately predict 90-day and 1-year survival in patients who receive surgical treatment for a metastatic long-bone lesion in a more recent cohort of patients treated between 2016 and 2020? METHODS Between 2017 and 2021, we identified 674 patients 18 years and older through the ICD codes for secondary malignant neoplasm of bone and bone marrow and CPT codes for completed pathologic fractures or prophylactic treatment of an impending fracture. We excluded 40% (268 of 674) of patients, including 18% (118) who did not receive surgery; 11% (72) who had metastases in places other than the long bones of the extremities; 3% (23) who received treatment other than intramedullary nailing, endoprosthetic reconstruction, or dynamic hip screw; 3% (23) who underwent revision surgery, 3% (17) in whom there was no tumor, and 2% (15) who were lost to follow-up within 1 year. Temporal validation was performed using data on 406 patients treated surgically for bony metastatic disease of the extremities from 2016 to 2020 at the same two institutions where the MLA was developed. Variables used to predict survival in the SORG algorithm included perioperative laboratory values, tumor characteristics, and general demographics. To assess the models' discrimination, we computed the c-statistic, commonly referred to as the area under the receiver operating characteristic (AUC) curve for binary classification. This value ranged from 0.5 (representing chance-level performance) to 1.0 (indicating excellent discrimination) Generally, an AUC of 0.75 is considered high enough for use in clinical practice. To evaluate the agreement between predicted and observed outcomes, a calibration plot was used, and the calibration slope and intercept were calculated. Perfect calibration would result in a slope of 1 and intercept of 0. For overall performance, the Brier score and null-model Brier score were determined. The Brier score can range from 0 (representing perfect prediction) to 1 (indicating the poorest prediction). Proper interpretation of the Brier score necessitates a comparison with the null-model Brier score, which represents the score for an algorithm that predicts a probability equal to the population prevalence of the outcome for each patient. Finally, a decision curve analysis was conducted to compare the potential net benefit of the algorithm with other decision-support methods, such as treating all or none of the patients. Overall, 90-day and 1-year mortality were lower in the temporal validation cohort than in the development cohort (90 day: 23% versus 28%; p < 0.001, and 1 year: 51% versus 59%; p<0.001). RESULTS Overall survival of the patients in the validation cohort improved from 28% mortality at the 90-day timepoint in the cohort on which the model was trained to 23%, and 59% mortality at the 1-year timepoint to 51%. The AUC was 0.78 (95% CI 0.72 to 0.82) for 90-day survival and 0.75 (95% CI 0.70 to 0.79) for 1-year survival, indicating the model could distinguish the two outcomes reasonably. For the 90-day model, the calibration slope was 0.71 (95% CI 0.53 to 0.89), and the intercept was -0.66 (95% CI -0.94 to -0.39), suggesting the predicted risks were overly extreme, and that in general, the risk of the observed outcome was overestimated. For the 1-year model, the calibration slope was 0.73 (95% CI 0.56 to 0.91) and the intercept was -0.67 (95% CI -0.90 to -0.43). With respect to overall performance, the model's Brier scores for the 90-day and 1-year models were 0.16 and 0.22. These scores were higher than the Brier scores of internal validation of the development study (0.13 and 0.14) models, indicating the models' performance has declined over time. CONCLUSION The SORG MLA to predict survival after surgical treatment of extremity metastatic disease showed decreased performance on temporal validation. Moreover, in patients undergoing innovative immunotherapy, the possibility of mortality risk was overestimated in varying severity. Clinicians should be aware of this overestimation and discount the prediction of the SORG MLA according to their own experience with this patient population. Generally, these results show that temporal reassessment of these MLA-driven probability calculators is of paramount importance because the predictive performance may decline over time as treatment regimens evolve. The SORG-MLA is available as a freely accessible internet application at https://sorg-apps.shinyapps.io/extremitymetssurvival/ .Level of Evidence Level III, prognostic study.
Collapse
Affiliation(s)
- Tom M. de Groot
- Massachusetts General Hospital, Boston, MA, USA
- University Medical Center Groningen, Groningen, the Netherlands
| | - Duncan Ramsey
- University of Texas RGV School of Medicine, Edinburg, TX, USA
| | | | | | | | | | | | | | | | | | | | - Eric Newman
- Massachusetts General Hospital, Boston, MA, USA
| | | | | | | |
Collapse
|
20
|
Wesdorp NJ, Zeeuw JM, Postma SCJ, Roor J, van Waesberghe JHTM, van den Bergh JE, Nota IM, Moos S, Kemna R, Vadakkumpadan F, Ambrozic C, van Dieren S, van Amerongen MJ, Chapelle T, Engelbrecht MRW, Gerhards MF, Grunhagen D, van Gulik TM, Hermans JJ, de Jong KP, Klaase JM, Liem MSL, van Lienden KP, Molenaar IQ, Patijn GA, Rijken AM, Ruers TM, Verhoef C, de Wilt JHW, Marquering HA, Stoker J, Swijnenburg RJ, Punt CJA, Huiskens J, Kazemier G. Deep learning models for automatic tumor segmentation and total tumor volume assessment in patients with colorectal liver metastases. Eur Radiol Exp 2023; 7:75. [PMID: 38038829 PMCID: PMC10692044 DOI: 10.1186/s41747-023-00383-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Accepted: 09/08/2023] [Indexed: 12/02/2023] Open
Abstract
BACKGROUND We developed models for tumor segmentation to automate the assessment of total tumor volume (TTV) in patients with colorectal liver metastases (CRLM). METHODS In this prospective cohort study, pre- and post-systemic treatment computed tomography (CT) scans of 259 patients with initially unresectable CRLM of the CAIRO5 trial (NCT02162563) were included. In total, 595 CT scans comprising 8,959 CRLM were divided into training (73%), validation (6.5%), and test sets (21%). Deep learning models were trained with ground truth segmentations of the liver and CRLM. TTV was calculated based on the CRLM segmentations. An external validation cohort was included, comprising 72 preoperative CT scans of patients with 112 resectable CRLM. Image segmentation evaluation metrics and intraclass correlation coefficient (ICC) were calculated. RESULTS In the test set (122 CT scans), the autosegmentation models showed a global Dice similarity coefficient (DSC) of 0.96 (liver) and 0.86 (CRLM). The corresponding median per-case DSC was 0.96 (interquartile range [IQR] 0.95-0.96) and 0.80 (IQR 0.67-0.87). For tumor segmentation, the intersection-over-union, precision, and recall were 0.75, 0.89, and 0.84, respectively. An excellent agreement was observed between the reference and automatically computed TTV for the test set (ICC 0.98) and external validation cohort (ICC 0.98). In the external validation, the global DSC was 0.82 and the median per-case DSC was 0.60 (IQR 0.29-0.76) for tumor segmentation. CONCLUSIONS Deep learning autosegmentation models were able to segment the liver and CRLM automatically and accurately in patients with initially unresectable CRLM, enabling automatic TTV assessment in such patients. RELEVANCE STATEMENT Automatic segmentation enables the assessment of total tumor volume in patients with colorectal liver metastases, with a high potential of decreasing radiologist's workload and increasing accuracy and consistency. KEY POINTS • Tumor response evaluation is time-consuming, manually performed, and ignores total tumor volume. • Automatic models can accurately segment tumors in patients with colorectal liver metastases. • Total tumor volume can be accurately calculated based on automatic segmentations.
Collapse
Affiliation(s)
- Nina J Wesdorp
- Department of Surgery, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, the Netherlands.
| | - J Michiel Zeeuw
- Department of Surgery, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, the Netherlands.
| | - Sam C J Postma
- Department of Surgery, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, the Netherlands
| | - Joran Roor
- Department of Health, SAS Institute B.V, Huizen, the Netherlands
| | - Jan Hein T M van Waesberghe
- Department of Radiology and Nuclear Medicine, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Janneke E van den Bergh
- Department of Radiology and Nuclear Medicine, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Irene M Nota
- Department of Radiology and Nuclear Medicine, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Shira Moos
- Department of Radiology and Nuclear Medicine, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Ruby Kemna
- Department of Surgery, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, the Netherlands
| | - Fijoy Vadakkumpadan
- Department of Computer Vision and Machine Learning, SAS Institute Inc, Cary, NC, USA
| | - Courtney Ambrozic
- Department of Computer Vision and Machine Learning, SAS Institute Inc, Cary, NC, USA
| | - Susan van Dieren
- Department of Surgery, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, the Netherlands
| | | | - Thiery Chapelle
- Department of Hepatobiliary, Transplantation, and Endocrine Surgery, Antwerp University Hospital, Antwerp, Belgium
| | - Marc R W Engelbrecht
- Department of Radiology and Nuclear Medicine, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | | | - Dirk Grunhagen
- Department of Surgical Oncology and Gastrointestinal Surgery, Erasmus MC Cancer Institute, Rotterdam, the Netherlands
| | - Thomas M van Gulik
- Department of Surgery, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, the Netherlands
| | - John J Hermans
- Department of Medical Imaging, Radboud University Medical Center, Radboud University Nijmegen, Nijmegen, the Netherlands
| | - Koert P de Jong
- Department of HPB Surgery and Liver Transplantation, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Joost M Klaase
- Department of HPB Surgery and Liver Transplantation, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Mike S L Liem
- Department of Surgery, Medical Spectrum Twente, Enschede, the Netherlands
| | - Krijn P van Lienden
- Department of Interventional Radiology, St Antonius Hospital, Nieuwegein, the Netherlands
| | - I Quintus Molenaar
- Department of Surgery, Regional Academic Cancer Center Utrecht, University Medical Center Utrecht, Utrecht, the Netherlands
- Department of Surgery, St Antonius Hospital, Nieuwegein, the Netherlands
| | - Gijs A Patijn
- Department of Surgery, Isala Hospital, Zwolle, the Netherlands
| | - Arjen M Rijken
- Department of Surgery, Amphia Hospital, Breda, the Netherlands
| | - Theo M Ruers
- Department of Surgery, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, the Netherlands
| | - Cornelis Verhoef
- Department of Surgical Oncology and Gastrointestinal Surgery, Erasmus MC Cancer Institute, Rotterdam, the Netherlands
| | - Johannes H W de Wilt
- Department of Surgery, Radboud University Medical Center, Radboud University Nijmegen, Nijmegen, the Netherlands
| | - Henk A Marquering
- Department of Radiology and Nuclear Medicine, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Department of Biomedical Engineering and Physics, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands
| | - Jaap Stoker
- Department of Radiology and Nuclear Medicine, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Rutger-Jan Swijnenburg
- Department of Surgery, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, the Netherlands
| | - Cornelis J A Punt
- Department of Medical Oncology, Cancer Center Amsterdam, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands
- Department of Epidemiology, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Joost Huiskens
- Department of Surgery, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, the Netherlands
| | - Geert Kazemier
- Department of Surgery, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV, Amsterdam, the Netherlands
| |
Collapse
|
21
|
Wang X, Sanders HM, Liu Y, Seang K, Tran BX, Atanasov AG, Qiu Y, Tang S, Car J, Wang YX, Wong TY, Tham YC, Chung KC. ChatGPT: promise and challenges for deployment in low- and middle-income countries. THE LANCET REGIONAL HEALTH. WESTERN PACIFIC 2023; 41:100905. [PMID: 37731897 PMCID: PMC10507635 DOI: 10.1016/j.lanwpc.2023.100905] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 08/14/2023] [Accepted: 09/03/2023] [Indexed: 09/22/2023]
Abstract
In low- and middle-income countries (LMICs), the fields of medicine and public health grapple with numerous challenges that continue to hinder patients' access to healthcare services. ChatGPT, a publicly accessible chatbot, has emerged as a potential tool in aiding public health efforts in LMICs. This viewpoint details the potential benefits of employing ChatGPT in LMICs to improve medicine and public health encompassing a broad spectrum of domains ranging from health literacy, screening, triaging, remote healthcare support, mental health support, multilingual capabilities, healthcare communication and documentation, medical training and education, and support for healthcare professionals. Additionally, we also share potential concerns and limitations associated with the use of ChatGPT and provide a balanced discussion on the opportunities and challenges of using ChatGPT in LMICs.
Collapse
Affiliation(s)
- Xiaofei Wang
- Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, China
| | - Hayley M. Sanders
- Section of Plastic Surgery, Department of Surgery, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Yuchen Liu
- Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, China
| | - Kennarey Seang
- Grant Management Office, University of Health Sciences, Phnom Penh, Cambodia
| | - Bach Xuan Tran
- Department of Health Economics, Institute for Preventive Medicine and Public Health, Hanoi Medical University, Hanoi, Vietnam
- Institute of Health Economics and Technology, Hanoi, Vietnam
| | - Atanas G. Atanasov
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Medical University of Vienna, Spitalgasse 23, 1090, Vienna, Austria
- Institute of Genetics and Animal Biotechnology of the Polish Academy of Sciences, Jastrzebiec, 05-552, Magdalenka, Poland
| | - Yue Qiu
- Institute for Hospital Management, Tsinghua University, Beijing, China
| | - Shenglan Tang
- Duke Global Health Institute, Duke University, Durham, NC, USA
| | - Josip Car
- Centre for Population Health Sciences, Lee Kong Chian School of Medicine, Nanyang Technological University Singapore, Singapore
- Department of Primary Care and Public Health, School of Public Health, Imperial College London, London, United Kingdom
| | - Ya Xing Wang
- Beijing Institute of Ophthalmology, Beijing Ophthalmology and Visual Science Key Lab, Beijing Tongren Eye Center, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Tien Yin Wong
- Singapore Eye Research Institute, Singapore National Eye Centre, Singapore
- Tsinghua Medicine, Tsinghua University, Beijing, China
- School of Clinical Medicine, Beijing Tsinghua Changgung Hospital, Beijing, China
| | - Yih-Chung Tham
- Singapore Eye Research Institute, Singapore National Eye Centre, Singapore
- Centre for Innovation and Precision Eye Health, Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Ophthalmology and Visual Science Academic Clinical Program, Duke-NUS Medical School, Singapore
| | - Kevin C. Chung
- Section of Plastic Surgery, Department of Surgery, University of Michigan Medical School, Ann Arbor, MI, USA
| |
Collapse
|
22
|
McCradden MD, Joshi S, Anderson JA, London AJ. A normative framework for artificial intelligence as a sociotechnical system in healthcare. PATTERNS (NEW YORK, N.Y.) 2023; 4:100864. [PMID: 38035190 PMCID: PMC10682751 DOI: 10.1016/j.patter.2023.100864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/02/2023]
Abstract
Artificial intelligence (AI) tools are of great interest to healthcare organizations for their potential to improve patient care, yet their translation into clinical settings remains inconsistent. One of the reasons for this gap is that good technical performance does not inevitably result in patient benefit. We advocate for a conceptual shift wherein AI tools are seen as components of an intervention ensemble. The intervention ensemble describes the constellation of practices that, together, bring about benefit to patients or health systems. Shifting from a narrow focus on the tool itself toward the intervention ensemble prioritizes a "sociotechnical" vision for translation of AI that values all components of use that support beneficial patient outcomes. The intervention ensemble approach can be used for regulation, institutional oversight, and for AI adopters to responsibly and ethically appraise, evaluate, and use AI tools.
Collapse
Affiliation(s)
- Melissa D. McCradden
- Department of Bioethics, The Hospital for Sick Children, Toronto, ON, Canada
- Genetics & Genome Biology Research Program, Peter Gilgan Center for Research & Learning, Toronto, ON, Canada
- Division of Clinical & Public Health, Dalla Lana School of Public Health, Toronto, ON, Canada
| | - Shalmali Joshi
- Department of Biomedical Informatics, Department of Computer Science (Affliate), Data Science Institute, Columbia University, New York, NY, USA
| | - James A. Anderson
- Department of Bioethics, The Hospital for Sick Children, Toronto, ON, Canada
- Institute for Health Policy, Management, and Evaluation, University of Toronto, Toronto, ON, Canada
| | - Alex John London
- Department of Philosophy and Center for Ethics and Policy, Carnegie Mellon University, Pittsburgh, PA, USA
| |
Collapse
|
23
|
|
24
|
Fischer A, Rietveld A, Teunissen P, Hoogendoorn M, Bakker P. What is the future of artificial intelligence in obstetrics? A qualitative study among healthcare professionals. BMJ Open 2023; 13:e076017. [PMID: 37879682 PMCID: PMC10603416 DOI: 10.1136/bmjopen-2023-076017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/27/2023] Open
Abstract
OBJECTIVE This work explores the perceptions of obstetrical clinicians about artificial intelligence (AI) in order to bridge the gap in uptake of AI between research and medical practice. Identifying potential areas where AI can contribute to clinical practice, enables AI research to align with the needs of clinicians and ultimately patients. DESIGN Qualitative interview study. SETTING A national study conducted in the Netherlands between November 2022 and February 2023. PARTICIPANTS Dutch clinicians working in obstetrics with varying relevant work experience, gender and age. ANALYSIS Thematic analysis of qualitative interview transcripts. RESULTS Thirteen gynaecologists were interviewed about hypothetical scenarios of an implemented AI model. Thematic analysis identified two major themes: perceived usefulness and trust. Usefulness involved AI extending human brain capacity in complex pattern recognition and information processing, reducing contextual influence and saving time. Trust required validation, explainability and successful personal experience. This result shows two paradoxes: first, AI is expected to provide added value by surpassing human capabilities, yet also a need to understand the parameters and their influence on predictions for trust and adoption was expressed. Second, participants recognised the value of incorporating numerous parameters into a model, but they also believed that certain contextual factors should only be considered by humans, as it would be undesirable for AI models to use that information. CONCLUSIONS Obstetricians' opinions on the potential value of AI highlight the need for clinician-AI researcher collaboration. Trust can be built through conventional means like randomised controlled trials and guidelines. Holistic impact metrics, such as changes in workflow, not just clinical outcomes, should guide AI model development. Further research is needed for evaluating evolving AI systems beyond traditional validation methods.
Collapse
Affiliation(s)
- Anne Fischer
- Department of Obstetrics and Gynecology, Amsterdam UMC Location VUmc, Amsterdam, The Netherlands
- Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development Research Institute, Amsterdam, The Netherlands
| | - Anna Rietveld
- Department of Obstetrics and Gynecology, Amsterdam UMC Location VUmc, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development Research Institute, Amsterdam, The Netherlands
| | - Pim Teunissen
- School of Health Professions Education, Faculty of Health Medicine and Life Sciences, Maastricht University, Maastricht, The Netherlands
- Department of Gynaecology & Obstetrics, Maastricht UMC, Maastricht, The Netherlands
| | - Mark Hoogendoorn
- Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Petra Bakker
- Department of Obstetrics and Gynecology, Amsterdam UMC Location VUmc, Amsterdam, The Netherlands
- Amsterdam Reproduction and Development Research Institute, Amsterdam, The Netherlands
| |
Collapse
|
25
|
Ramaekers M, Viviers CGA, Janssen BV, Hellström TAE, Ewals L, van der Wulp K, Nederend J, Jacobs I, Pluyter JR, Mavroeidis D, van der Sommen F, Besselink MG, Luyer MDP. Computer-Aided Detection for Pancreatic Cancer Diagnosis: Radiological Challenges and Future Directions. J Clin Med 2023; 12:4209. [PMID: 37445243 DOI: 10.3390/jcm12134209] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 06/08/2023] [Accepted: 06/19/2023] [Indexed: 07/15/2023] Open
Abstract
Radiological imaging plays a crucial role in the detection and treatment of pancreatic ductal adenocarcinoma (PDAC). However, there are several challenges associated with the use of these techniques in daily clinical practice. Determination of the presence or absence of cancer using radiological imaging is difficult and requires specific expertise, especially after neoadjuvant therapy. Early detection and characterization of tumors would potentially increase the number of patients who are eligible for curative treatment. Over the last decades, artificial intelligence (AI)-based computer-aided detection (CAD) has rapidly evolved as a means for improving the radiological detection of cancer and the assessment of the extent of disease. Although the results of AI applications seem promising, widespread adoption in clinical practice has not taken place. This narrative review provides an overview of current radiological CAD systems in pancreatic cancer, highlights challenges that are pertinent to clinical practice, and discusses potential solutions for these challenges.
Collapse
Affiliation(s)
- Mark Ramaekers
- Department of Surgery, Catharina Cancer Institute, Catharina Hospital Eindhoven, 5623 EJ Eindhoven, The Netherlands
| | - Christiaan G A Viviers
- Department of Electrical Engineering, Eindhoven University of Technology, 5612 AZ Eindhoven, The Netherlands
| | - Boris V Janssen
- Department of Surgery, Amsterdam UMC, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
- Cancer Center Amsterdam, 1081 HV Amsterdam, The Netherlands
| | - Terese A E Hellström
- Department of Electrical Engineering, Eindhoven University of Technology, 5612 AZ Eindhoven, The Netherlands
| | - Lotte Ewals
- Department of Radiology, Catharina Cancer Institute, Catharina Hospital Eindhoven, 5623 EJ Eindhoven, The Netherlands
| | - Kasper van der Wulp
- Department of Radiology, Catharina Cancer Institute, Catharina Hospital Eindhoven, 5623 EJ Eindhoven, The Netherlands
| | - Joost Nederend
- Department of Radiology, Catharina Cancer Institute, Catharina Hospital Eindhoven, 5623 EJ Eindhoven, The Netherlands
| | - Igor Jacobs
- Department of Hospital Services and Informatics, Philips Research, 5656 AE Eindhoven, The Netherlands
| | - Jon R Pluyter
- Department of Experience Design, Philips Design, 5656 AE Eindhoven, The Netherlands
| | - Dimitrios Mavroeidis
- Department of Data Science, Philips Research, 5656 AE Eindhoven, The Netherlands
| | - Fons van der Sommen
- Department of Electrical Engineering, Eindhoven University of Technology, 5612 AZ Eindhoven, The Netherlands
| | - Marc G Besselink
- Department of Surgery, Amsterdam UMC, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
- Cancer Center Amsterdam, 1081 HV Amsterdam, The Netherlands
| | - Misha D P Luyer
- Department of Surgery, Catharina Cancer Institute, Catharina Hospital Eindhoven, 5623 EJ Eindhoven, The Netherlands
| |
Collapse
|
26
|
Bienefeld N, Boss JM, Lüthy R, Brodbeck D, Azzati J, Blaser M, Willms J, Keller E. Solving the explainable AI conundrum by bridging clinicians' needs and developers' goals. NPJ Digit Med 2023; 6:94. [PMID: 37217779 DOI: 10.1038/s41746-023-00837-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 05/05/2023] [Indexed: 05/24/2023] Open
Abstract
Explainable artificial intelligence (XAI) has emerged as a promising solution for addressing the implementation challenges of AI/ML in healthcare. However, little is known about how developers and clinicians interpret XAI and what conflicting goals and requirements they may have. This paper presents the findings of a longitudinal multi-method study involving 112 developers and clinicians co-designing an XAI solution for a clinical decision support system. Our study identifies three key differences between developer and clinician mental models of XAI, including opposing goals (model interpretability vs. clinical plausibility), different sources of truth (data vs. patient), and the role of exploring new vs. exploiting old knowledge. Based on our findings, we propose design solutions that can help address the XAI conundrum in healthcare, including the use of causal inference models, personalized explanations, and ambidexterity between exploration and exploitation mindsets. Our study highlights the importance of considering the perspectives of both developers and clinicians in the design of XAI systems and provides practical recommendations for improving the effectiveness and usability of XAI in healthcare.
Collapse
Affiliation(s)
- Nadine Bienefeld
- Department of Management, Technology, and Economics, ETH Zurich, Zürich, Switzerland.
| | - Jens Michael Boss
- Neurocritical Care Unit, Department of Neurosurgery and Institute of Intensive Care Medicine, Clinical Neuroscience Center, University Hospital Zurich and University of Zurich, Zürich, Switzerland
| | - Rahel Lüthy
- Institute for Medical Engineering and Medical Informatics, School of Life Sciences FHNW, Muttenz, Switzerland
| | - Dominique Brodbeck
- Institute for Medical Engineering and Medical Informatics, School of Life Sciences FHNW, Muttenz, Switzerland
| | - Jan Azzati
- Institute for Medical Engineering and Medical Informatics, School of Life Sciences FHNW, Muttenz, Switzerland
| | - Mirco Blaser
- Institute for Medical Engineering and Medical Informatics, School of Life Sciences FHNW, Muttenz, Switzerland
| | - Jan Willms
- Neurocritical Care Unit, Department of Neurosurgery and Institute of Intensive Care Medicine, Clinical Neuroscience Center, University Hospital Zurich and University of Zurich, Zürich, Switzerland
| | - Emanuela Keller
- Neurocritical Care Unit, Department of Neurosurgery and Institute of Intensive Care Medicine, Clinical Neuroscience Center, University Hospital Zurich and University of Zurich, Zürich, Switzerland
| |
Collapse
|
27
|
Diab KM, Deng J, Wu Y, Yesha Y, Collado-Mesa F, Nguyen P. Natural Language Processing for Breast Imaging: A Systematic Review. Diagnostics (Basel) 2023; 13:diagnostics13081420. [PMID: 37189521 DOI: 10.3390/diagnostics13081420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 04/05/2023] [Accepted: 04/11/2023] [Indexed: 05/17/2023] Open
Abstract
Natural Language Processing (NLP) has gained prominence in diagnostic radiology, offering a promising tool for improving breast imaging triage, diagnosis, lesion characterization, and treatment management in breast cancer and other breast diseases. This review provides a comprehensive overview of recent advances in NLP for breast imaging, covering the main techniques and applications in this field. Specifically, we discuss various NLP methods used to extract relevant information from clinical notes, radiology reports, and pathology reports and their potential impact on the accuracy and efficiency of breast imaging. In addition, we reviewed the state-of-the-art in NLP-based decision support systems for breast imaging, highlighting the challenges and opportunities of NLP applications for breast imaging in the future. Overall, this review underscores the potential of NLP in enhancing breast imaging care and offers insights for clinicians and researchers interested in this exciting and rapidly evolving field.
Collapse
Affiliation(s)
- Kareem Mahmoud Diab
- Institute for Data Science and Computing, University of Miami, Miami, FL 33146, USA
| | - Jamie Deng
- Department of Computer Science, University of Miami, Miami, FL 33146, USA
| | - Yusen Wu
- Institute for Data Science and Computing, University of Miami, Miami, FL 33146, USA
| | - Yelena Yesha
- Institute for Data Science and Computing, University of Miami, Miami, FL 33146, USA
- Department of Computer Science, University of Miami, Miami, FL 33146, USA
- Department of Radiology, Miller School of Medicine, University of Miami, Miami, FL 33146, USA
| | - Fernando Collado-Mesa
- Department of Radiology, Miller School of Medicine, University of Miami, Miami, FL 33146, USA
| | - Phuong Nguyen
- Institute for Data Science and Computing, University of Miami, Miami, FL 33146, USA
- Department of Computer Science, University of Miami, Miami, FL 33146, USA
- OpenKnect Inc., Halethorpe, MD 21227, USA
| |
Collapse
|
28
|
Tovar DR, Rosenthal MH, Maitra A, Koay EJ. Potential of artificial intelligence in the risk stratification for and early detection of pancreatic cancer. ARTIFICIAL INTELLIGENCE SURGERY 2023; 3:14-26. [PMID: 37124705 PMCID: PMC10141523 DOI: 10.20517/ais.2022.38] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Pancreatic ductal adenocarcinoma (PDAC) is the third most lethal cancer in the United States, with a 5-year life expectancy of 11%. Most symptoms manifest at an advanced stage of the disease when surgery is no longer appropriate. The dire prognosis of PDAC warrants new strategies to improve the outcomes of patients, and early detection has garnered significant attention. However, early detection of PDAC is most often incidental, emphasizing the importance of developing new early detection screening strategies. Due to the low incidence of the disease in the general population, much of the focus for screening has turned to individuals at high risk of PDAC. This enriches the screening population and balances the risks associated with pancreas interventions. The cancers that are found in these high-risk individuals by MRI and/or EUS screening show favorable 73% 5-year overall survival. Even with the emphasis on screening in enriched high-risk populations, only a minority of incident cancers are detected this way. One strategy to improve early detection outcomes is to integrate artificial intelligence (AI) into biomarker discovery and risk models. This expert review summarizes recent publications that have developed AI algorithms for the applications of risk stratification of PDAC using radiomics and electronic health records. Furthermore, this review illustrates the current uses of radiomics and biomarkers in AI for early detection of PDAC. Finally, various challenges and potential solutions are highlighted regarding the use of AI in medicine for early detection purposes.
Collapse
Affiliation(s)
- Daniela R. Tovar
- Department of Gastrointestinal Radiation Oncology, The University of Texas, Anderson Cancer Center, Houston, TX 77030, USA
| | | | - Anirban Maitra
- Department of Radiology, The University of Texas, Anderson Cancer Center, Houston, TX 77030, USA
| | - Eugene J. Koay
- Department of Gastrointestinal Radiation Oncology, The University of Texas, Anderson Cancer Center, Houston, TX 77030, USA
| |
Collapse
|
29
|
Gill SK, Karwath A, Uh HW, Cardoso VR, Gu Z, Barsky A, Slater L, Acharjee A, Duan J, Dall'Olio L, el Bouhaddani S, Chernbumroong S, Stanbury M, Haynes S, Asselbergs FW, Grobbee DE, Eijkemans MJC, Gkoutos GV, Kotecha D. Artificial intelligence to enhance clinical value across the spectrum of cardiovascular healthcare. Eur Heart J 2023; 44:713-725. [PMID: 36629285 PMCID: PMC9976986 DOI: 10.1093/eurheartj/ehac758] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 11/22/2022] [Accepted: 12/05/2022] [Indexed: 01/12/2023] Open
Abstract
Artificial intelligence (AI) is increasingly being utilized in healthcare. This article provides clinicians and researchers with a step-wise foundation for high-value AI that can be applied to a variety of different data modalities. The aim is to improve the transparency and application of AI methods, with the potential to benefit patients in routine cardiovascular care. Following a clear research hypothesis, an AI-based workflow begins with data selection and pre-processing prior to analysis, with the type of data (structured, semi-structured, or unstructured) determining what type of pre-processing steps and machine-learning algorithms are required. Algorithmic and data validation should be performed to ensure the robustness of the chosen methodology, followed by an objective evaluation of performance. Seven case studies are provided to highlight the wide variety of data modalities and clinical questions that can benefit from modern AI techniques, with a focus on applying them to cardiovascular disease management. Despite the growing use of AI, further education for healthcare workers, researchers, and the public are needed to aid understanding of how AI works and to close the existing gap in knowledge. In addition, issues regarding data access, sharing, and security must be addressed to ensure full engagement by patients and the public. The application of AI within healthcare provides an opportunity for clinicians to deliver a more personalized approach to medical care by accounting for confounders, interactions, and the rising prevalence of multi-morbidity.
Collapse
Affiliation(s)
- Simrat K Gill
- Institute of Cardiovascular Sciences, University of Birmingham, Vincent Drive, B15 2TT Birmingham, UK
- Health Data Research UK Midlands, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - Andreas Karwath
- Health Data Research UK Midlands, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Institute of Cancer and Genomic Sciences, University of Birmingham, Vincent Drive, B15 2TT Birmingham, UK
| | - Hae-Won Uh
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht, The Netherlands
| | - Victor Roth Cardoso
- Institute of Cardiovascular Sciences, University of Birmingham, Vincent Drive, B15 2TT Birmingham, UK
- Health Data Research UK Midlands, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Institute of Cancer and Genomic Sciences, University of Birmingham, Vincent Drive, B15 2TT Birmingham, UK
| | - Zhujie Gu
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht, The Netherlands
| | - Andrey Barsky
- Health Data Research UK Midlands, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Institute of Cancer and Genomic Sciences, University of Birmingham, Vincent Drive, B15 2TT Birmingham, UK
| | - Luke Slater
- Health Data Research UK Midlands, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Institute of Cancer and Genomic Sciences, University of Birmingham, Vincent Drive, B15 2TT Birmingham, UK
| | - Animesh Acharjee
- Health Data Research UK Midlands, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Institute of Cancer and Genomic Sciences, University of Birmingham, Vincent Drive, B15 2TT Birmingham, UK
| | - Jinming Duan
- School of Computer Science, University of Birmingham, Birmingham, UK
- Alan Turing Institute, London, UK
| | - Lorenzo Dall'Olio
- Department of Physics and Astronomy, University of Bologna, Bologna, Italy
| | - Said el Bouhaddani
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht, The Netherlands
| | - Saisakul Chernbumroong
- Health Data Research UK Midlands, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Institute of Cancer and Genomic Sciences, University of Birmingham, Vincent Drive, B15 2TT Birmingham, UK
| | | | | | - Folkert W Asselbergs
- Amsterdam University Medical Center, Department of Cardiology, University of Amsterdam, Amsterdam, The Netherlands
- Health Data Research UK and Institute of Health Informatics, University College London, London, UK
| | - Diederick E Grobbee
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht, The Netherlands
| | - Marinus J C Eijkemans
- Julius Center for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht, The Netherlands
| | - Georgios V Gkoutos
- Health Data Research UK Midlands, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Institute of Cancer and Genomic Sciences, University of Birmingham, Vincent Drive, B15 2TT Birmingham, UK
| | - Dipak Kotecha
- Institute of Cardiovascular Sciences, University of Birmingham, Vincent Drive, B15 2TT Birmingham, UK
- Health Data Research UK Midlands, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
30
|
Liljendahl MS, Loft N, Egeberg A, Skov L, Nguyen TL. Development and internal validation of a diagnostic prediction model for psoriasis severity. Diagn Progn Res 2023; 7:2. [PMID: 36747306 PMCID: PMC9903561 DOI: 10.1186/s41512-023-00141-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 01/10/2023] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND While administrative health records such as national registries may be useful data sources to study the epidemiology of psoriasis, they do not generally contain information on disease severity. OBJECTIVES To develop a diagnostic model to distinguish psoriasis severity based on administrative register data. METHOD We conducted a retrospective registry-based cohort study using the Danish Skin Cohort linked with the Danish national registries. We developed a diagnostic model using a gradient boosting machine learning technique to predict moderate-to-severe psoriasis. We performed an internal validation of the model by bootstrapping to account for any optimism. RESULTS Among 4016 adult psoriasis patients (55.8% women, mean age 59 years) included in this study, 1212 (30.2%) patients were identified as having moderate-to-severe psoriasis. The diagnostic prediction model yielded a bootstrap-corrected discrimination performance: c-statistic equal to 0.73 [95% CI: 0.71-0.74]. The internal validation by bootstrap correction showed no substantial optimism in the results with a c-statistic of 0.72 [95% CI: 0.70-0.74]. A bootstrap-corrected slope of 1.10 [95% CI: 1.07-1.13] indicated a slight under-fitting. CONCLUSION Based on register data, we developed a gradient boosting diagnostic model returning acceptable prediction of patients with moderate-to-severe psoriasis.
Collapse
Affiliation(s)
- Mie Sylow Liljendahl
- Department of Dermatology and Allergy, Herlev and Gentofte Hospital, University of Copenhagen, Gentofte Hospitalsvej 15, 2900, Hellerup, Denmark.
- Department of Dermatology and Venereology, Bispebjerg and Frederiksberg Hospital, University of Copenhagen, Copenhagen, Denmark.
| | - Nikolai Loft
- Department of Dermatology and Allergy, Herlev and Gentofte Hospital, University of Copenhagen, Gentofte Hospitalsvej 15, 2900, Hellerup, Denmark
| | - Alexander Egeberg
- Department of Dermatology and Venereology, Bispebjerg and Frederiksberg Hospital, University of Copenhagen, Copenhagen, Denmark
| | - Lone Skov
- Department of Dermatology and Allergy, Herlev and Gentofte Hospital, University of Copenhagen, Gentofte Hospitalsvej 15, 2900, Hellerup, Denmark
| | - Tri-Long Nguyen
- Department of Public Health, Section of Epidemiology, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
31
|
Koçak B, Cuocolo R, dos Santos DP, Stanzione A, Ugga L. Must-have Qualities of Clinical Research on Artificial Intelligence and Machine Learning. Balkan Med J 2023; 40:3-12. [PMID: 36578657 PMCID: PMC9874249 DOI: 10.4274/balkanmedj.galenos.2022.2022-11-51] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 12/06/2022] [Indexed: 12/30/2022] Open
Abstract
In the field of computer science, known as artificial intelligence, algorithms imitate reasoning tasks that are typically performed by humans. The techniques that allow machines to learn and get better at tasks such as recognition and prediction, which form the basis of clinical practice, are referred to as machine learning, which is a subfield of artificial intelligence. The number of artificial intelligence-and machine learnings-related publications in clinical journals has grown exponentially, driven by recent developments in computation and the accessibility of simple tools. However, clinicians are often not included in data science teams, which may limit the clinical relevance, explanability, workflow compatibility, and quality improvement of artificial intelligence solutions. Thus, this results in the language barrier between clinicians and artificial intelligence developers. Healthcare practitioners sometimes lack a basic understanding of artificial intelligence research because the approach is difficult for non-specialists to understand. Furthermore, many editors and reviewers of medical publications might not be familiar with the fundamental ideas behind these technologies, which may prevent journals from publishing high-quality artificial intelligence studies or, worse still, could allow for the publication of low-quality works. In this review, we aim to improve readers’ artificial intelligence literacy and critical thinking. As a result, we concentrated on what we consider the 10 most important qualities of artificial intelligence research: valid scientific purpose, high-quality data set, robust reference standard, robust input, no information leakage, optimal bias-variance tradeoff, proper model evaluation, proven clinical utility, transparent reporting, and open science. Before designing a study, one should have defined a sound scientific purpose. Then, it should be backed by a high-quality data set, robust input, and a solid reference standard. The artificial intelligence development pipeline should prevent information leakage. For the models, optimal bias-variance tradeoff should be achieved, and generalizability assessment must be adequately performed. The clinical value of the final models must also be established. After the study, thought should be given to transparency in publishing the process and results as well as open science for sharing data, code, and models. We hope this work may improve the artificial intelligence literacy and mindset of the readers.
Collapse
Affiliation(s)
- Burak Koçak
- Clinic of Radiology, University of Health Sciences Turkey, Başakşehir Çam and Sakura City Hospital, İstanbul, Turkey
| | - Renato Cuocolo
- Department of Medicine, Surgery and Dentistry University of Salerno, Baronissi, Italy
| | - Daniel Pinto dos Santos
- Department of Radiology, University Hospital of Cologne, Cologne, Germany
- Department of Radiology, University Hospital of Frankfurt, Frankfurt, Germany
| | - Arnaldo Stanzione
- Department of Advanced Biomedical Sciences, University of Naples “Federico II”, Napoli, Italy
| | - Lorenzo Ugga
- Department of Advanced Biomedical Sciences, University of Naples “Federico II”, Napoli, Italy
| |
Collapse
|
32
|
Kushniruk A, de Hond AAH, Thoral PJ, Steyerberg EW, Kant IMJ, Cinà G, Arbous MS. Intensive Care Unit Physicians' Perspectives on Artificial Intelligence-Based Clinical Decision Support Tools: Preimplementation Survey Study. JMIR Hum Factors 2023; 10:e39114. [PMID: 36602843 PMCID: PMC9853335 DOI: 10.2196/39114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 11/21/2022] [Accepted: 11/27/2022] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND Artificial intelligence-based clinical decision support (AI-CDS) tools have great potential to benefit intensive care unit (ICU) patients and physicians. There is a gap between the development and implementation of these tools. OBJECTIVE We aimed to investigate physicians' perspectives and their current decision-making behavior before implementing a discharge AI-CDS tool for predicting readmission and mortality risk after ICU discharge. METHODS We conducted a survey of physicians involved in decision-making on discharge of patients at two Dutch academic ICUs between July and November 2021. Questions were divided into four domains: (1) physicians' current decision-making behavior with respect to discharging ICU patients, (2) perspectives on the use of AI-CDS tools in general, (3) willingness to incorporate a discharge AI-CDS tool into daily clinical practice, and (4) preferences for using a discharge AI-CDS tool in daily workflows. RESULTS Most of the 64 respondents (of 93 contacted, 69%) were familiar with AI (62/64, 97%) and had positive expectations of AI, with 55 of 64 (86%) believing that AI could support them in their work as a physician. The respondents disagreed on whether the decision to discharge a patient was complex (23/64, 36% agreed and 22/64, 34% disagreed); nonetheless, most (59/64, 92%) agreed that a discharge AI-CDS tool could be of value. Significant differences were observed between physicians from the 2 academic sites, which may be related to different levels of involvement in the development of the discharge AI-CDS tool. CONCLUSIONS ICU physicians showed a favorable attitude toward the integration of AI-CDS tools into the ICU setting in general, and in particular toward a tool to predict a patient's risk of readmission and mortality within 7 days after discharge. The findings of this questionnaire will be used to improve the implementation process and training of end users.
Collapse
Affiliation(s)
| | - Anne A H de Hond
- Clinical AI Implementation and Research Lab, Leiden University Medical Center, Leiden, Netherlands.,Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, Netherlands
| | - Patrick J Thoral
- Department of Intensive Care Medicine, Laboratory for Critical Care Computational Intelligence, Amsterdam Medical Data Science, Amsterdam University Medical Centers, Amsterdam, Netherlands
| | - Ewout W Steyerberg
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, Netherlands
| | - Ilse M J Kant
- Clinical AI Implementation and Research Lab, Leiden University Medical Center, Leiden, Netherlands.,Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, Netherlands
| | - Giovanni Cinà
- Pacmed, Amsterdam, Netherlands.,Institute for Logic, Language and Computation, University of Amsterdam, Amsterdam, Netherlands.,Department of Medical Informatics, Amsterdam University Medical Center, University of Amsterdam, Amsterdam, Netherlands
| | - M Sesmu Arbous
- Department of Intensive Care Medicine, Leiden University Medical Center, Leiden, Netherlands
| |
Collapse
|
33
|
Hasan MM, Islam MU, Sadeq MJ, Fung WK, Uddin J. Review on the Evaluation and Development of Artificial Intelligence for COVID-19 Containment. SENSORS (BASEL, SWITZERLAND) 2023; 23:527. [PMID: 36617124 PMCID: PMC9824505 DOI: 10.3390/s23010527] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 12/23/2022] [Accepted: 12/29/2022] [Indexed: 06/17/2023]
Abstract
Artificial intelligence has significantly enhanced the research paradigm and spectrum with a substantiated promise of continuous applicability in the real world domain. Artificial intelligence, the driving force of the current technological revolution, has been used in many frontiers, including education, security, gaming, finance, robotics, autonomous systems, entertainment, and most importantly the healthcare sector. With the rise of the COVID-19 pandemic, several prediction and detection methods using artificial intelligence have been employed to understand, forecast, handle, and curtail the ensuing threats. In this study, the most recent related publications, methodologies and medical reports were investigated with the purpose of studying artificial intelligence's role in the pandemic. This study presents a comprehensive review of artificial intelligence with specific attention to machine learning, deep learning, image processing, object detection, image segmentation, and few-shot learning studies that were utilized in several tasks related to COVID-19. In particular, genetic analysis, medical image analysis, clinical data analysis, sound analysis, biomedical data classification, socio-demographic data analysis, anomaly detection, health monitoring, personal protective equipment (PPE) observation, social control, and COVID-19 patients' mortality risk approaches were used in this study to forecast the threatening factors of COVID-19. This study demonstrates that artificial-intelligence-based algorithms integrated into Internet of Things wearable devices were quite effective and efficient in COVID-19 detection and forecasting insights which were actionable through wide usage. The results produced by the study prove that artificial intelligence is a promising arena of research that can be applied for disease prognosis, disease forecasting, drug discovery, and to the development of the healthcare sector on a global scale. We prove that artificial intelligence indeed played a significantly important role in helping to fight against COVID-19, and the insightful knowledge provided here could be extremely beneficial for practitioners and research experts in the healthcare domain to implement the artificial-intelligence-based systems in curbing the next pandemic or healthcare disaster.
Collapse
Affiliation(s)
- Md. Mahadi Hasan
- Department of Computer Science and Engineering, Asian University of Bangladesh, Ashulia 1349, Bangladesh
| | - Muhammad Usama Islam
- School of Computing and Informatics, University of Louisiana at Lafayette, Lafayette, LA 70504, USA
| | - Muhammad Jafar Sadeq
- Department of Computer Science and Engineering, Asian University of Bangladesh, Ashulia 1349, Bangladesh
| | - Wai-Keung Fung
- Department of Applied Computing and Engineering, Cardiff School of Technologies, Cardiff Metropolitan University, Cardiff CF5 2YB, UK
| | - Jasim Uddin
- Department of Applied Computing and Engineering, Cardiff School of Technologies, Cardiff Metropolitan University, Cardiff CF5 2YB, UK
| |
Collapse
|
34
|
From computer to bedside, involving neonatologists in artificial intelligence models for neonatal medicine. Pediatr Res 2023; 93:437-439. [PMID: 36526854 DOI: 10.1038/s41390-022-02413-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 10/08/2022] [Accepted: 11/27/2022] [Indexed: 12/23/2022]
Abstract
In recent years, data have become the main driver of medical innovation. With increased availability and decreased price of storage and computing power, the potential for improvement in care is enormous. Many data-driven explorations have started. However, the actual implementation of artificial intelligence in healthcare remains scarce. We describe essential elements during a computer-to-bedside process in a data science project that support the crucial role of the neonatologist. IMPACT: There is a great potential for data science in neonatal medicine. Multidisciplinary teams form the foundation of a data science project. Domain experts will need to play a pivotal role. We need an open learning environment.
Collapse
|
35
|
Improving child health through Big Data and data science. Pediatr Res 2023; 93:342-349. [PMID: 35974162 PMCID: PMC9380977 DOI: 10.1038/s41390-022-02264-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 06/10/2022] [Accepted: 06/28/2022] [Indexed: 12/04/2022]
Abstract
Child health is defined by a complex, dynamic network of genetic, cultural, nutritional, infectious, and environmental determinants at distinct, developmentally determined epochs from preconception to adolescence. This network shapes the future of children, susceptibilities to adult diseases, and individual child health outcomes. Evolution selects characteristics during fetal life, infancy, childhood, and adolescence that adapt to predictable and unpredictable exposures/stresses by creating alternative developmental phenotype trajectories. While child health has improved in the United States and globally over the past 30 years, continued improvement requires access to data that fully represent the complexity of these interactions and to new analytic methods. Big Data and innovative data science methods provide tools to integrate multiple data dimensions for description of best clinical, predictive, and preventive practices, for reducing racial disparities in child health outcomes, for inclusion of patient and family input in medical assessments, and for defining individual disease risk, mechanisms, and therapies. However, leveraging these resources will require new strategies that intentionally address institutional, ethical, regulatory, cultural, technical, and systemic barriers as well as developing partnerships with children and families from diverse backgrounds that acknowledge historical sources of mistrust. We highlight existing pediatric Big Data initiatives and identify areas of future research. IMPACT: Big Data and data science can improve child health. This review highlights the importance for child health of child-specific and life course-based Big Data and data science strategies. This review provides recommendations for future pediatric-specific Big Data and data science research.
Collapse
|
36
|
Rebooting the Electronic Health Record. J Med Syst 2022; 46:48. [PMID: 35670870 DOI: 10.1007/s10916-022-01834-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 05/13/2022] [Accepted: 05/16/2022] [Indexed: 10/18/2022]
Abstract
Justifications for the widespread adoption and integration of an electronic health record (EHR) have long leaned on the purported benefits of the technology. However, the performance of the EHR has been underwhelming relative to the promises of immediate access to relevant patient information, clinical decision supports, computerized ordering, and transferable patient data. In this narrative review, we provide an overview of the historical problems and limitations of the EHR, detail the core principles that define agile processes that may overcome the barriers faced by the current EHR, and re-imagine what an integrated, seamless EHR that serves its users and patients might look like. Moving forward, the EHR should be redesigned using a middle-out framework and empowering dual-type champions to maintain the sustainable diffusion of future innovations.
Collapse
|
37
|
Fritsch SJ, Blankenheim A, Wahl A, Hetfeld P, Maassen O, Deffge S, Kunze J, Rossaint R, Riedel M, Marx G, Bickenbach J. Attitudes and perception of artificial intelligence in healthcare: A cross-sectional survey among patients. Digit Health 2022; 8:20552076221116772. [PMID: 35983102 PMCID: PMC9380417 DOI: 10.1177/20552076221116772] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 07/13/2022] [Indexed: 12/23/2022] Open
Abstract
Objective The attitudes about the usage of artificial intelligence in healthcare are
controversial. Unlike the perception of healthcare professionals, the
attitudes of patients and their companions have been of less interest so
far. In this study, we aimed to investigate the perception of artificial
intelligence in healthcare among this highly relevant group along with the
influence of digital affinity and sociodemographic factors. Methods We conducted a cross-sectional study using a paper-based questionnaire with
patients and their companions at a German tertiary referral hospital from
December 2019 to February 2020. The questionnaire consisted of three
sections examining (a) the respondents’ technical affinity, (b) their
perception of different aspects of artificial intelligence in healthcare and
(c) sociodemographic characteristics. Results From a total of 452 participants, more than 90% already read or heard about
artificial intelligence, but only 24% reported good or expert knowledge.
Asked on their general perception, 53.18% of the respondents rated the use
of artificial intelligence in medicine as positive or very positive, but
only 4.77% negative or very negative. The respondents denied concerns about
artificial intelligence, but strongly agreed that artificial intelligence
must be controlled by a physician. Older patients, women, persons with lower
education and technical affinity were more cautious on the
healthcare-related artificial intelligence usage. Conclusions German patients and their companions are open towards the usage of artificial
intelligence in healthcare. Although showing only a mediocre knowledge about
artificial intelligence, a majority rated artificial intelligence in
healthcare as positive. Particularly, patients insist that a physician
supervises the artificial intelligence and keeps ultimate responsibility for
diagnosis and therapy.
Collapse
Affiliation(s)
- Sebastian J Fritsch
- Department of Intensive Care Medicine, University Hospital RWTH Aachen, Germany
- SMITH Consortium of the German Medical Informatics Initiative, Germany
- Juelich Supercomputing Centre, Forschungszentrum Juelich, Germany
| | - Andrea Blankenheim
- Department of Intensive Care Medicine, University Hospital RWTH Aachen, Germany
| | - Alina Wahl
- Department of Intensive Care Medicine, University Hospital RWTH Aachen, Germany
| | - Petra Hetfeld
- Department of Intensive Care Medicine, University Hospital RWTH Aachen, Germany
- SMITH Consortium of the German Medical Informatics Initiative, Germany
| | - Oliver Maassen
- Department of Intensive Care Medicine, University Hospital RWTH Aachen, Germany
- SMITH Consortium of the German Medical Informatics Initiative, Germany
| | - Saskia Deffge
- Department of Intensive Care Medicine, University Hospital RWTH Aachen, Germany
- SMITH Consortium of the German Medical Informatics Initiative, Germany
| | - Julian Kunze
- SMITH Consortium of the German Medical Informatics Initiative, Germany
- Department of Anesthesiology, University Hospital RWTH Aachen, Germany
| | - Rolf Rossaint
- Department of Anesthesiology, University Hospital RWTH Aachen, Germany
| | - Morris Riedel
- SMITH Consortium of the German Medical Informatics Initiative, Germany
- Juelich Supercomputing Centre, Forschungszentrum Juelich, Germany
- Faculty of Industrial Engineering, Mechanical Engineering and Computer Science, University of Iceland, Iceland
| | - Gernot Marx
- Department of Intensive Care Medicine, University Hospital RWTH Aachen, Germany
- SMITH Consortium of the German Medical Informatics Initiative, Germany
| | - Johannes Bickenbach
- Department of Intensive Care Medicine, University Hospital RWTH Aachen, Germany
- SMITH Consortium of the German Medical Informatics Initiative, Germany
| |
Collapse
|