Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ploug T, Sundby A, Moeslund TB, Holm S. Population Preferences for Performance and Explainability of Artificial Intelligence in Health Care: Choice-Based Conjoint Survey. J Med Internet Res 2021;23:e26611. [PMID: 34898454 PMCID: PMC8713089 DOI: 10.2196/26611] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 05/31/2021] [Accepted: 11/11/2021] [Indexed: 01/04/2023] Open

For:	Ploug T, Sundby A, Moeslund TB, Holm S. Population Preferences for Performance and Explainability of Artificial Intelligence in Health Care: Choice-Based Conjoint Survey. J Med Internet Res 2021;23:e26611. [PMID: 34898454 PMCID: PMC8713089 DOI: 10.2196/26611] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 05/31/2021] [Accepted: 11/11/2021] [Indexed: 01/04/2023] Open

Number

Cited by Other Article(s)

Bouhouita-Guermech S, Haidar H. Scoping Review Shows the Dynamics and Complexities Inherent to the Notion of "Responsibility" in Artificial Intelligence within the Healthcare Context. Asian Bioeth Rev 2024;16:315-344. [PMID: 39022380 PMCID: PMC11250714 DOI: 10.1007/s41649-024-00292-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 03/06/2024] [Accepted: 03/07/2024] [Indexed: 07/20/2024] Open

Frost EK, Bosward R, Aquino YSJ, Braunack-Mayer A, Carter SM. Facilitating public involvement in research about healthcare AI: A scoping review of empirical methods. Int J Med Inform 2024;186:105417. [PMID: 38564959 DOI: 10.1016/j.ijmedinf.2024.105417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 03/06/2024] [Accepted: 03/17/2024] [Indexed: 04/04/2024]

Lin S, Ma Y, Jiang Y, Li W, Peng Y, Yu T, Xu Y, Zhu J, Lu L, Zou H. Service Quality and Residents' Preferences for Facilitated Self-Service Fundus Disease Screening: Cross-Sectional Study. J Med Internet Res 2024;26:e45545. [PMID: 38630535 PMCID: PMC11063888 DOI: 10.2196/45545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 10/15/2023] [Accepted: 03/12/2024] [Indexed: 04/19/2024] Open

Abstract

BACKGROUND

Fundus photography is the most important examination in eye disease screening. A facilitated self-service eye screening pattern based on the fully automatic fundus camera was developed in 2022 in Shanghai, China; it may help solve the problem of insufficient human resources in primary health care institutions. However, the service quality and residents' preference for this new pattern are unclear.

OBJECTIVE

This study aimed to compare the service quality and residents' preferences between facilitated self-service eye screening and traditional manual screening and to explore the relationships between the screening service's quality and residents' preferences.

METHODS

We conducted a cross-sectional study in Shanghai, China. Residents who underwent facilitated self-service fundus disease screening at one of the screening sites were assigned to the exposure group; those who were screened with a traditional fundus camera operated by an optometrist at an adjacent site comprised the control group. The primary outcome was the screening service quality, including effectiveness (image quality and screening efficiency), physiological discomfort, safety, convenience, and trustworthiness. The secondary outcome was the participants' preferences. Differences in service quality and the participants' preferences between the 2 groups were compared using chi-square tests separately. Subgroup analyses for exploring the relationships between the screening service's quality and residents' preference were conducted using generalized logit models.

RESULTS

A total of 358 residents enrolled; among them, 176 (49.16%) were included in the exposure group and the remaining 182 (50.84%) in the control group. Residents' basic characteristics were balanced between the 2 groups. There was no significant difference in service quality between the 2 groups (image quality pass rate: P=.79; average screening time: P=.57; no physiological discomfort rate: P=.92; safety rate: P=.78; convenience rate: P=.95; trustworthiness rate: P=.20). However, the proportion of participants who were willing to use the same technology for their next screening was significantly lower in the exposure group than in the control group (P<.001). Subgroup analyses suggest that distrust in the facilitated self-service eye screening might increase the probability of refusal to undergo screening (P=.02).

CONCLUSIONS

This study confirms that the facilitated self-service fundus disease screening pattern could achieve good service quality. However, it was difficult to reverse residents' preferences for manual screening in a short period, especially when the original manual service was already excellent. Therefore, the digital transformation of health care must be cautious. We suggest that attention be paid to the residents' individual needs. More efficient man-machine collaboration and personalized health management solutions based on large language models are both needed.

Collapse

Affiliation(s)

Senlin Lin Shanghai Eye Diseases Prevention &Treatment Center/ Shanghai Eye Hospital, School of Medicine, Tongji University, Shanghai, China National Clinical Research Center for Eye Diseases, Shanghai, China Shanghai Engineering Research Center of Precise Diagnosis and Treatment of Eye Diseases, Shanghai, China
Yingyan Ma Shanghai Eye Diseases Prevention &Treatment Center/ Shanghai Eye Hospital, School of Medicine, Tongji University, Shanghai, China National Clinical Research Center for Eye Diseases, Shanghai, China Shanghai Engineering Research Center of Precise Diagnosis and Treatment of Eye Diseases, Shanghai, China Shanghai General Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
Yanwei Jiang Shanghai Hongkou Center for Disease Control and Prevention, Shanghai, China
Wenwen Li School of Management, Fudan University, Shanghai, China
Yajun Peng Shanghai Eye Diseases Prevention &Treatment Center/ Shanghai Eye Hospital, School of Medicine, Tongji University, Shanghai, China National Clinical Research Center for Eye Diseases, Shanghai, China Shanghai Engineering Research Center of Precise Diagnosis and Treatment of Eye Diseases, Shanghai, China
Tao Yu Shanghai Eye Diseases Prevention &Treatment Center/ Shanghai Eye Hospital, School of Medicine, Tongji University, Shanghai, China National Clinical Research Center for Eye Diseases, Shanghai, China Shanghai Engineering Research Center of Precise Diagnosis and Treatment of Eye Diseases, Shanghai, China
Yi Xu Shanghai Eye Diseases Prevention &Treatment Center/ Shanghai Eye Hospital, School of Medicine, Tongji University, Shanghai, China National Clinical Research Center for Eye Diseases, Shanghai, China Shanghai Engineering Research Center of Precise Diagnosis and Treatment of Eye Diseases, Shanghai, China
Jianfeng Zhu Shanghai Eye Diseases Prevention &Treatment Center/ Shanghai Eye Hospital, School of Medicine, Tongji University, Shanghai, China National Clinical Research Center for Eye Diseases, Shanghai, China Shanghai Engineering Research Center of Precise Diagnosis and Treatment of Eye Diseases, Shanghai, China
Lina Lu Shanghai Eye Diseases Prevention &Treatment Center/ Shanghai Eye Hospital, School of Medicine, Tongji University, Shanghai, China National Clinical Research Center for Eye Diseases, Shanghai, China Shanghai Engineering Research Center of Precise Diagnosis and Treatment of Eye Diseases, Shanghai, China
Haidong Zou Shanghai Eye Diseases Prevention &Treatment Center/ Shanghai Eye Hospital, School of Medicine, Tongji University, Shanghai, China National Clinical Research Center for Eye Diseases, Shanghai, China Shanghai Engineering Research Center of Precise Diagnosis and Treatment of Eye Diseases, Shanghai, China Shanghai General Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China

Collapse

Evans RP, Bryant LD, Russell G, Absolom K. Trust and acceptability of data-driven clinical recommendations in everyday practice: A scoping review. Int J Med Inform 2024;183:105342. [PMID: 38266426 DOI: 10.1016/j.ijmedinf.2024.105342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 12/08/2023] [Accepted: 01/14/2024] [Indexed: 01/26/2024]

Abstract

BACKGROUND

Increasing attention is being given to the analysis of large health datasets to derive new clinical decision support systems (CDSS). However, few data-driven CDSS are being adopted into clinical practice. Trust in these tools is believed to be fundamental for acceptance and uptake but to date little attention has been given to defining or evaluating trust in clinical settings.

OBJECTIVES

A scoping review was conducted to explore how and where acceptability and trustworthiness of data-driven CDSS have been assessed from the health professional's perspective.

METHODS

Medline, Embase, PsycInfo, Web of Science, Scopus, ACM Digital, IEEE Xplore and Google Scholar were searched in March 2022 using terms expanded from: "data-driven" AND "clinical decision support" AND "acceptability". Included studies focused on healthcare practitioner-facing data-driven CDSS, relating directly to clinical care. They included trust or a proxy as an outcome, or in the discussion. The preferred reporting items for systematic reviews and meta-analyses extension for scoping reviews (PRISMA-ScR) is followed in the reporting of this review.

RESULTS

3291 papers were screened, with 85 primary research studies eligible for inclusion. Studies covered a diverse range of clinical specialisms and intended contexts, but hypothetical systems (24) outnumbered those in clinical use (18). Twenty-five studies measured trust, via a wide variety of quantitative, qualitative and mixed methods. A further 24 discussed themes of trust without it being explicitly evaluated, and from these, themes of transparency, explainability, and supporting evidence were identified as factors influencing healthcare practitioner trust in data-driven CDSS.

CONCLUSION

There is a growing body of research on data-driven CDSS, but few studies have explored stakeholder perceptions in depth, with limited focused research on trustworthiness. Further research on healthcare practitioner acceptance, including requirements for transparency and explainability, should inform clinical implementation.

Collapse

Hwang EJ, Jeong WG, David PM, Arentz M, Ruhwald M, Yoon SH. AI for Detection of Tuberculosis: Implications for Global Health. Radiol Artif Intell 2024;6:e230327. [PMID: 38197795 PMCID: PMC10982823 DOI: 10.1148/ryai.230327] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 12/03/2023] [Accepted: 12/18/2023] [Indexed: 01/11/2024]

Affiliation(s)

Eui Jin Hwang From the Department of Radiology, Seoul National University Hospital and Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (E.J.H., S.H.Y.); Department of Radiology, Chonnam National University Hwasun Hospital, Hwasun, Korea (W.G.J.); Faculty of Pharmacy, University of Montréal, Montréal, Canada (P.M.D.); OBVIA–Observatoire sur les Impacts Sociétaux de l'IA et du Numérique, Québec, Canada (P.M.D.); and FIND–The Global Alliance for Diagnostics, Geneva, Switzerland (M.A., M.R.)
Won Gi Jeong From the Department of Radiology, Seoul National University Hospital and Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (E.J.H., S.H.Y.); Department of Radiology, Chonnam National University Hwasun Hospital, Hwasun, Korea (W.G.J.); Faculty of Pharmacy, University of Montréal, Montréal, Canada (P.M.D.); OBVIA–Observatoire sur les Impacts Sociétaux de l'IA et du Numérique, Québec, Canada (P.M.D.); and FIND–The Global Alliance for Diagnostics, Geneva, Switzerland (M.A., M.R.)
Pierre-Marie David From the Department of Radiology, Seoul National University Hospital and Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (E.J.H., S.H.Y.); Department of Radiology, Chonnam National University Hwasun Hospital, Hwasun, Korea (W.G.J.); Faculty of Pharmacy, University of Montréal, Montréal, Canada (P.M.D.); OBVIA–Observatoire sur les Impacts Sociétaux de l'IA et du Numérique, Québec, Canada (P.M.D.); and FIND–The Global Alliance for Diagnostics, Geneva, Switzerland (M.A., M.R.)
Matthew Arentz From the Department of Radiology, Seoul National University Hospital and Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (E.J.H., S.H.Y.); Department of Radiology, Chonnam National University Hwasun Hospital, Hwasun, Korea (W.G.J.); Faculty of Pharmacy, University of Montréal, Montréal, Canada (P.M.D.); OBVIA–Observatoire sur les Impacts Sociétaux de l'IA et du Numérique, Québec, Canada (P.M.D.); and FIND–The Global Alliance for Diagnostics, Geneva, Switzerland (M.A., M.R.)
Morten Ruhwald From the Department of Radiology, Seoul National University Hospital and Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (E.J.H., S.H.Y.); Department of Radiology, Chonnam National University Hwasun Hospital, Hwasun, Korea (W.G.J.); Faculty of Pharmacy, University of Montréal, Montréal, Canada (P.M.D.); OBVIA–Observatoire sur les Impacts Sociétaux de l'IA et du Numérique, Québec, Canada (P.M.D.); and FIND–The Global Alliance for Diagnostics, Geneva, Switzerland (M.A., M.R.)
Soon Ho Yoon From the Department of Radiology, Seoul National University Hospital and Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea (E.J.H., S.H.Y.); Department of Radiology, Chonnam National University Hwasun Hospital, Hwasun, Korea (W.G.J.); Faculty of Pharmacy, University of Montréal, Montréal, Canada (P.M.D.); OBVIA–Observatoire sur les Impacts Sociétaux de l'IA et du Numérique, Québec, Canada (P.M.D.); and FIND–The Global Alliance for Diagnostics, Geneva, Switzerland (M.A., M.R.)

Collapse

Sageshima J, Than P, Goussous N, Mineyev N, Perez R. Prediction of High-Risk Donors for Kidney Discard and Nonrecovery Using Structured Donor Characteristics and Unstructured Donor Narratives. JAMA Surg 2024;159:60-68. [PMID: 37910090 PMCID: PMC10620675 DOI: 10.1001/jamasurg.2023.4679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 07/27/2023] [Indexed: 11/03/2023]

Abstract

Importance

Despite the unmet need, many deceased-donor kidneys are discarded or not recovered. Inefficient allocation and prolonged ischemia time are contributing factors, and early detection of high-risk donors may reduce organ loss.

Objective

To evaluate the feasibility of machine learning (ML) and natural language processing (NLP) classification of donors with kidneys that are used vs not used for organ transplant.

Design, Setting, and Participants

This retrospective cohort study used donor information (structured donor characteristics and unstructured donor narratives) from the United Network for Organ Sharing (UNOS). All donor offers to a single transplant center between January 2015 and December 2020 were used to train and validate ML models to predict donors who had at least 1 kidney transplanted (at our center or another center). The donor data from 2021 were used to test each model.

Exposures

Donor information was provided by UNOS to the transplant centers with potential transplant candidates. Each center evaluated the donor and decided within an allotted time whether to accept the kidney for organ transplant.

Main Outcomes and Measures

Outcome metrics of the test cohort included area under the receiver operating characteristic curve (AUROC), F1 score, accuracy, precision, and recall of each ML classifier. Feature importance and Shapley additive explanation (SHAP) summaries were assessed for model explainability.

Results

The training/validation cohort included 9555 donors (median [IQR] age, 50 [36-58] years; 5571 male [58.3%]), and the test cohort included 2481 donors (median [IQR] age, 52 [40-59] years; 1496 male [60.3%]). Only 20% to 30% of potential donors had at least 1 kidney transplanted. The ML model with a single variable (Kidney Donor Profile Index) showed an AUROC of 0.69, F1 score of 0.42, and accuracy of 0.64. Multivariable ML models based on basic a priori structured donor data showed similar metrics (logistic regression: AUROC = 0.70; F1 score = 0.42; accuracy = 0.62; random forest classifier: AUROC = 0.69; F1 score = 0.42; accuracy = 0.64). The classic NLP model (bag-of-words model) showed its best metrics (AUROC = 0.60; F1 score = 0.35; accuracy = 0.59) by the logistic regression classifier. The advanced Bidirectional Encoder Representations From Transformers model showed comparable metrics (AUROC = 0.62; F1 score = 0.39; accuracy = 0.69) only after appending basic donor information. Feature importance and SHAP detected the variables (and words) that affected the models most.

Conclusions and Relevance

Results of this cohort study suggest that models using ML can be applied to predict donors with high-risk kidneys not used for organ transplant, but the models still need further elaboration. The use of unstructured data is likely to expand the possibilities; further exploration of new approaches will be necessary to develop models with better predictive metrics.

Collapse

Holm S, Ploug T. Population preferences for AI system features across eight different decision-making contexts. PLoS One 2023;18:e0295277. [PMID: 38039320 PMCID: PMC10691677 DOI: 10.1371/journal.pone.0295277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 11/18/2023] [Indexed: 12/03/2023] Open

Aziz D, Maganti K, Yanamala N, Sengupta P. The Role of Artificial Intelligence in Echocardiography: A Clinical Update. Curr Cardiol Rep 2023;25:1897-1907. [PMID: 38091196 DOI: 10.1007/s11886-023-02005-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/21/2023] [Indexed: 01/26/2024]

Gould DJ, Dowsey MM, Glanville-Hearst M, Spelman T, Bailey JA, Choong PFM, Bunzli S. Patients' Views on AI for Risk Prediction in Shared Decision-Making for Knee Replacement Surgery: Qualitative Interview Study. J Med Internet Res 2023;25:e43632. [PMID: 37721797 PMCID: PMC10546266 DOI: 10.2196/43632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 05/04/2023] [Accepted: 08/21/2023] [Indexed: 09/19/2023] Open

Abstract

BACKGROUND

The use of artificial intelligence (AI) in decision-making around knee replacement surgery is increasing, and this technology holds promise to improve the prediction of patient outcomes. Ambiguity surrounds the definition of AI, and there are mixed views on its application in clinical settings.

OBJECTIVE

In this study, we aimed to explore the understanding and attitudes of patients who underwent knee replacement surgery regarding AI in the context of risk prediction for shared clinical decision-making.

METHODS

This qualitative study involved patients who underwent knee replacement surgery at a tertiary referral center for joint replacement surgery. The participants were selected based on their age and sex. Semistructured interviews explored the participants' understanding of AI and their opinions on its use in shared clinical decision-making. Data collection and reflexive thematic analyses were conducted concurrently. Recruitment continued until thematic saturation was achieved.

RESULTS

Thematic saturation was achieved with 19 interviews and confirmed with 1 additional interview, resulting in 20 participants being interviewed (female participants: n=11, 55%; male participants: n=9, 45%; median age: 66 years). A total of 11 (55%) participants had a substantial postoperative complication. Three themes captured the participants' understanding of AI and their perceptions of its use in shared clinical decision-making. The theme Expectations captured the participants' views of themselves as individuals with the right to self-determination as they sought therapeutic solutions tailored to their circumstances, needs, and desires, including whether to use AI at all. The theme Empowerment highlighted the potential of AI to enable patients to develop realistic expectations and equip them with personalized risk information to discuss in shared decision-making conversations with the surgeon. The theme Partnership captured the importance of symbiosis between AI and clinicians because AI has varied levels of interpretability and understanding of human emotions and empathy.

CONCLUSIONS

Patients who underwent knee replacement surgery in this study had varied levels of familiarity with AI and diverse conceptualizations of its definitions and capabilities. Educating patients about AI through nontechnical explanations and illustrative scenarios could help inform their decision to use it for risk prediction in the shared decision-making process with their surgeon. These findings could be used in the process of developing a questionnaire to ascertain the views of patients undergoing knee replacement surgery on the acceptability of AI in shared clinical decision-making. Future work could investigate the accuracy of this patient group's understanding of AI, beyond their familiarity with it, and how this influences their acceptance of its use. Surgeons may play a key role in finding a place for AI in the clinical setting as the uptake of this technology in health care continues to grow.

Collapse

Wang H, Wu W, Dou Z, He L, Yang L. Performance and exploration of ChatGPT in medical examination, records and education in Chinese: Pave the way for medical AI. Int J Med Inform 2023;177:105173. [PMID: 37549499 DOI: 10.1016/j.ijmedinf.2023.105173] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 07/01/2023] [Accepted: 07/08/2023] [Indexed: 08/09/2023]

Abstract

BACKGROUND

Although chat generative pre-trained transformer (ChatGPT) has made several successful attempts in the medical field, most notably in answering medical questions in English, no studies have evaluated ChatGPT's performance in a Chinese context for a medical task.

OBJECTIVE

The aim of this study was to evaluate ChatGPT's ability to understand medical knowledge in Chinese, as well as its potential to serve as an electronic health infrastructure for medical development, by evaluating its performance in medical examinations, records, and education.

METHOD

The Chinese (CNMLE) and English (ENMLE) datasets of the China National Medical Licensing Examination and the Chinese dataset (NEEPM) of the China National Entrance Examination for Postgraduate Clinical Medicine Comprehensive Ability were used to evaluate the performance of ChatGPT (GPT-3.5 and GPT-4). We assessed answer accuracy, verbal fluency, and the classification of incorrect responses owing to hallucinations on multiple occasions. In addition, we tested ChatGPT's performance on discharge summaries and group learning in a Chinese context on a small scale.

RESULTS

The accuracy of GPT-3.5 in CNMLE, ENMLE, and NEEPM was 56% (56/100), 76% (76/100), and 62% (62/100), respectively, compared to that of GPT-4, which was of 84% (84/100), 86% (86/100), and 82% (82/100). The verbal fluency of all the ChatGPT responses exceeded 95%. Among the GPT-3.5 incorrect responses, the proportions of open-domain hallucinations were 66 % (29/44), 54 % (14/24), and 63 % (24/38), whereas close-domain hallucinations accounted for 34 % (15/44), 46 % (14/24), and 37 % (14/38), respectively. By contrast, GPT-4 open-domain hallucinations accounted for 56% (9/16), 43% (6/14), and 83% (15/18), while close-domain hallucinations accounted for 44% (7/16), 57% (8/14), and 17% (3/18), respectively. In the discharge summary, ChatGPT demonstrated logical coherence, however GPT-3.5 could not fulfill the quality requirements, while GPT-4 met the qualification of 60% (6/10). In group learning, the verbal fluency and interaction satisfaction with ChatGPT were 100% (10/10).

CONCLUSION

ChatGPT based on GPT-4 is at par with Chinese medical practitioners who passed the CNMLE and at the standard required for admission to clinical medical graduate programs in China. The GPT-4 shows promising potential for discharge summarization and group learning. Additionally, it shows high verbal fluency, resulting in a positive human-computer interaction experience. GPT-4 significantly improves multiple capabilities and reduces hallucinations compared to the previous GPT-3.5 model, with a particular leap forward in the Chinese comprehension capability of medical tasks. Artificial intelligence (AI) systems face the challenges of hallucinations, legal risks, and ethical issues. However, we discovered ChatGPT's potential to promote medical development as an electronic health infrastructure, paving the way for Medical AI to become necessary.

Collapse

Borondy Kitts A. Patient Perspectives on Artificial Intelligence in Radiology. J Am Coll Radiol 2023;20:863-867. [PMID: 37453601 DOI: 10.1016/j.jacr.2023.05.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 04/24/2023] [Accepted: 05/03/2023] [Indexed: 07/18/2023]

Herbert P, Hou K, Bradley C, Hager G, Boland MV, Ramulu P, Unberath M, Yohannan J. Forecasting Risk of Future Rapid Glaucoma Worsening Using Early Visual Field, OCT, and Clinical Data. Ophthalmol Glaucoma 2023;6:466-473. [PMID: 36944385 PMCID: PMC10509314 DOI: 10.1016/j.ogla.2023.03.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 01/20/2023] [Accepted: 03/10/2023] [Indexed: 03/22/2023]

Abstract

PURPOSE

To assess whether we can forecast future rapid visual field (VF) worsening using deep learning models (DLMs) trained on early VF, OCT, and clinical data.

DESIGN

A retrospective cohort study.

SUBJECTS

In total, 4536 eyes from 2962 patients. Overall, 263 (5.80%) eyes underwent rapid VF worsening (mean deviation slope less than -1 dB/year across all VFs).

METHODS

We included eyes that met the following criteria: (1) followed for glaucoma or suspect status; (2) had at least 5 longitudinal reliable VFs (VF1, VF2, VF3, VF4, and VF5); and (3) had 1 reliable baseline OCT scan (OCT1) and 1 set of baseline clinical measurements (clinical1) at the time of VF1. We designed a DLM to forecast future rapid VF worsening. The input consisted of spatially oriented total deviation values from VF1 (including or not including VF2 and VF3 in some models) and retinal nerve fiber layer thickness values from the baseline OCT. We passed this VF/OCT stack into a vision transformer feature extractor, the output of which was concatenated with baseline clinical data before putting it through a linear classifier to predict the eye's risk of rapid VF worsening across the 5 VFs. We compared the performance of models with differing inputs by computing area under the curve (AUC) in the test set. Specifically, we trained models with the following inputs: (1) model V: VF1; (2) VC: VF1+ Clinical1; (3) VO: VF1+ OCT1; (4) VOC: VF1+ Clinical1+ OCT1; (5) V2: VF1 + VF2; (6) V2OC: VF1 + VF2 + Clinical1 + OCT1; (7) V3: VF1 + VF2 + VF3; and (8) V3OC: VF1 + VF2 + VF3 + Clinical1 + OCT1.

MAIN OUTCOME MEASURES

The AUC of DLMs when forecasting rapidly worsening eyes.

RESULTS

Model V3OC best forecasted rapid worsening with an AUC (95% confidence interval [CI]) of 0.87 (0.77-0.97). Remaining models in descending order of performance and their respective AUC (95% CI) were as follows: (1) model V3 (0.84 [0.74-0.95]), (2) model V2OC (0.81 [0.70-0.92]), (3) model V2 (0.81 [0.70-0.82]), (4) model VOC (0.77 [0.65-0.88]), (5) model VO (0.75 [0.64-0.88]), (6) model VC (0.75 [0.63-0.87]), and (7) model V (0.74 [0.62-0.86]).

CONCLUSIONS

Deep learning models can forecast future rapid glaucoma worsening with modest to high performance when trained using data from early in the disease course. Including baseline data from multiple modalities and subsequent visits improves performance beyond using VF data alone.

FINANCIAL DISCLOSURE(S)

Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

Collapse

Hong S, Hwang EJ, Kim S, Song J, Lee T, Jo GD, Choi Y, Park CM, Goo JM. Methods of Visualizing the Results of an Artificial-Intelligence-Based Computer-Aided Detection System for Chest Radiographs: Effect on the Diagnostic Performance of Radiologists. Diagnostics (Basel) 2023;13:diagnostics13061089. [PMID: 36980397 PMCID: PMC10046978 DOI: 10.3390/diagnostics13061089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 03/02/2023] [Accepted: 03/12/2023] [Indexed: 03/16/2023] Open

Calvas P. Chapitre 7. Un regard de généticien. JOURNAL INTERNATIONAL DE BIOETHIQUE ET D'ETHIQUE DES SCIENCES 2023;34:111-120. [PMID: 37684198 DOI: 10.3917/jibes.342.0111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/10/2023]

Tang L, Li J, Fantus S. Medical artificial intelligence ethics: A systematic review of empirical studies. Digit Health 2023;9:20552076231186064. [PMID: 37434728 PMCID: PMC10331228 DOI: 10.1177/20552076231186064] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 06/16/2023] [Indexed: 07/13/2023] Open

Wellnhofer E. Real-World and Regulatory Perspectives of Artificial Intelligence in Cardiovascular Imaging. Front Cardiovasc Med 2022;9:890809. [PMID: 35935648 PMCID: PMC9354141 DOI: 10.3389/fcvm.2022.890809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Accepted: 06/13/2022] [Indexed: 12/02/2022] Open

Fritsch SJ, Blankenheim A, Wahl A, Hetfeld P, Maassen O, Deffge S, Kunze J, Rossaint R, Riedel M, Marx G, Bickenbach J. Attitudes and perception of artificial intelligence in healthcare: A cross-sectional survey among patients. Digit Health 2022;8:20552076221116772. [PMID: 35983102 PMCID: PMC9380417 DOI: 10.1177/20552076221116772] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 07/13/2022] [Indexed: 12/23/2022] Open

Abstract

Objective

The attitudes about the usage of artificial intelligence in healthcare are controversial. Unlike the perception of healthcare professionals, the attitudes of patients and their companions have been of less interest so far. In this study, we aimed to investigate the perception of artificial intelligence in healthcare among this highly relevant group along with the influence of digital affinity and sociodemographic factors.

Methods

We conducted a cross-sectional study using a paper-based questionnaire with patients and their companions at a German tertiary referral hospital from December 2019 to February 2020. The questionnaire consisted of three sections examining (a) the respondents’ technical affinity, (b) their perception of different aspects of artificial intelligence in healthcare and (c) sociodemographic characteristics.

Results

From a total of 452 participants, more than 90% already read or heard about artificial intelligence, but only 24% reported good or expert knowledge. Asked on their general perception, 53.18% of the respondents rated the use of artificial intelligence in medicine as positive or very positive, but only 4.77% negative or very negative. The respondents denied concerns about artificial intelligence, but strongly agreed that artificial intelligence must be controlled by a physician. Older patients, women, persons with lower education and technical affinity were more cautious on the healthcare-related artificial intelligence usage.

Conclusions

German patients and their companions are open towards the usage of artificial intelligence in healthcare. Although showing only a mediocre knowledge about artificial intelligence, a majority rated artificial intelligence in healthcare as positive. Particularly, patients insist that a physician supervises the artificial intelligence and keeps ultimate responsibility for diagnosis and therapy.

Collapse