Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Levant B, Zückert W, Paolo A. Post-exam feedback with question rationales improves re-test performance of medical students on a multiple-choice exam. Adv Health Sci Educ Theory Pract 2018;23:995-1003. [PMID: 30043313 DOI: 10.1007/s10459-018-9844-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Accepted: 07/18/2018] [Indexed: 06/08/2023]

For:	Levant B, Zückert W, Paolo A. Post-exam feedback with question rationales improves re-test performance of medical students on a multiple-choice exam. Adv Health Sci Educ Theory Pract 2018;23:995-1003. [PMID: 30043313 DOI: 10.1007/s10459-018-9844-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Accepted: 07/18/2018] [Indexed: 06/08/2023]

Number

Cited by Other Article(s)

Ch'en PY, Day W, Pekson RC, Barrientos J, Burton WB, Ludwig AB, Jariwala SP, Cassese T. GPT-4 generated answer rationales to multiple choice assessment questions in undergraduate medical education. BMC MEDICAL EDUCATION 2025;25:333. [PMID: 40038669 PMCID: PMC11877964 DOI: 10.1186/s12909-025-06862-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Accepted: 02/11/2025] [Indexed: 03/06/2025]

Abstract

BACKGROUND

Pre-clerkship medical students benefit from practice questions that provide rationales for answer choices. Creating these rationales is a time-intensive endeavor. Therefore, not all practice multiple choice questions (MCQ) have corresponding explanations to aid learning. The authors examined artificial intelligence's (AI) potential to create high-quality answer rationales for clinical vignette-style MCQs.

METHODS

The authors conducted a single-center pre-post intervention survey study in August 2023 assessing 8 pre-clerkship course director (CD) attitudes towards GPT-4 generated answer rationales to clinical vignette style MCQs. Ten MCQs from each course's question bank were selected and input into GPT-4 with instructions to select the best answer and generate rationales for each answer choice. CDs were provided their unmodified GPT-4 interactions to assess the accuracy, clarity, appropriateness, and likelihood of implementation of the rationales. CDs were asked about time spent reviewing and making necessary modifications, satisfaction, and receptiveness in using GPT-4 for this purpose.

RESULTS

GPT-4 correctly answered 75/80 (93.8%) questions on the first attempt. CDs were receptive to using GPT-4 for rationale generation and all were satisfied with the generated rationales. CDs determined that the majority of rationales were very accurate (77.5%), very clear (83.8%) and very appropriate (93.8%). Most rationales could be implemented with little or no modification (88.3%). All CDs would implement AI-generated answer rationales with CD editorial insights. Most CDs (75%) took ≤ 4 min to review a set of generated rationales for a question.

CONCLUSION

GPT-4 is an acceptable and feasible tool for generating accurate, clear and appropriate answer rationales for MCQs in medical education. Future studies should examine students' feedback to generated rationales and further explore generating rationales for question with media. The authors plan to explore the implementation of this technological application at their medical school, including logistics and training to create a streamlined process that benefits both learners and educators.

CLINICAL TRIAL

Not applicable; not a clinical trial.

Collapse

Huang G, Zhang H, Zeng J, Chen W. A study of the effect of question feedback types on learning engagement in panoramic videos. Front Psychol 2025;16:1321712. [PMID: 40083766 PMCID: PMC11903445 DOI: 10.3389/fpsyg.2025.1321712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 02/17/2025] [Indexed: 03/16/2025] Open

Gauthier TP, Cady E, Liu T, Landayan AM. Resources and strategies for learning infectious diseases pharmacotherapy during advanced pharmacy practice experiences and pharmacy residency. Am J Health Syst Pharm 2025;82:228-234. [PMID: 39185681 DOI: 10.1093/ajhp/zxae250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Indexed: 08/27/2024] Open

Wu Z, Gan W, Xue Z, Ni Z, Zheng X, Zhang Y. Performance of ChatGPT on Nursing Licensure Examinations in the United States and China: Cross-Sectional Study. JMIR MEDICAL EDUCATION 2024;10:e52746. [PMID: 39363539 PMCID: PMC11466054 DOI: 10.2196/52746] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 06/12/2024] [Accepted: 06/15/2024] [Indexed: 10/05/2024]

Abstract

Background

The creation of large language models (LLMs) such as ChatGPT is an important step in the development of artificial intelligence, which shows great potential in medical education due to its powerful language understanding and generative capabilities. The purpose of this study was to quantitatively evaluate and comprehensively analyze ChatGPT's performance in handling questions for the National Nursing Licensure Examination (NNLE) in China and the United States, including the National Council Licensure Examination for Registered Nurses (NCLEX-RN) and the NNLE.

Objective

This study aims to examine how well LLMs respond to the NCLEX-RN and the NNLE multiple-choice questions (MCQs) in various language inputs. To evaluate whether LLMs can be used as multilingual learning assistance for nursing, and to assess whether they possess a repository of professional knowledge applicable to clinical nursing practice.

Methods

First, we compiled 150 NCLEX-RN Practical MCQs, 240 NNLE Theoretical MCQs, and 240 NNLE Practical MCQs. Then, the translation function of ChatGPT 3.5 was used to translate NCLEX-RN questions from English to Chinese and NNLE questions from Chinese to English. Finally, the original version and the translated version of the MCQs were inputted into ChatGPT 4.0, ChatGPT 3.5, and Google Bard. Different LLMs were compared according to the accuracy rate, and the differences between different language inputs were compared.

Results

The accuracy rates of ChatGPT 4.0 for NCLEX-RN practical questions and Chinese-translated NCLEX-RN practical questions were 88.7% (133/150) and 79.3% (119/150), respectively. Despite the statistical significance of the difference (P=.03), the correct rate was generally satisfactory. Around 71.9% (169/235) of NNLE Theoretical MCQs and 69.1% (161/233) of NNLE Practical MCQs were correctly answered by ChatGPT 4.0. The accuracy of ChatGPT 4.0 in processing NNLE Theoretical MCQs and NNLE Practical MCQs translated into English was 71.5% (168/235; P=.92) and 67.8% (158/233; P=.77), respectively, and there was no statistically significant difference between the results of text input in different languages. ChatGPT 3.5 (NCLEX-RN P=.003, NNLE Theoretical P<.001, NNLE Practical P=.12) and Google Bard (NCLEX-RN P<.001, NNLE Theoretical P<.001, NNLE Practical P<.001) had lower accuracy rates for nursing-related MCQs than ChatGPT 4.0 in English input. English accuracy was higher when compared with ChatGPT 3.5's Chinese input, and the difference was statistically significant (NCLEX-RN P=.02, NNLE Practical P=.02). Whether submitted in Chinese or English, the MCQs from the NCLEX-RN and NNLE demonstrated that ChatGPT 4.0 had the highest number of unique correct responses and the lowest number of unique incorrect responses among the 3 LLMs.

Conclusions

This study, focusing on 618 nursing MCQs including NCLEX-RN and NNLE exams, found that ChatGPT 4.0 outperformed ChatGPT 3.5 and Google Bard in accuracy. It excelled in processing English and Chinese inputs, underscoring its potential as a valuable tool in nursing education and clinical decision-making.

Collapse

de Lange T, Møystad A, Torgersen G, Ahlqvist J, Jäghagen EL. Students' perceptions of post-exam feedback in oral radiology-A comparative study from two dental hygienist educational settings. EUROPEAN JOURNAL OF DENTAL EDUCATION : OFFICIAL JOURNAL OF THE ASSOCIATION FOR DENTAL EDUCATION IN EUROPE 2024;28:377-387. [PMID: 37885281 DOI: 10.1111/eje.12959] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 08/15/2023] [Accepted: 09/17/2023] [Indexed: 10/28/2023]

Manteghinejad A. Web-Based Medical Examinations During the COVID-19 Era: Reconsidering Learning as the Main Goal of Examination. JMIR MEDICAL EDUCATION 2021;7:e25355. [PMID: 34329178 PMCID: PMC8360339 DOI: 10.2196/25355] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 02/18/2021] [Accepted: 05/23/2021] [Indexed: 06/13/2023]

Prochazka J, Ovcari M, Durinik M. Sandwich feedback: The empirical evidence of its effectiveness. LEARNING AND MOTIVATION 2020. [DOI: 10.1016/j.lmot.2020.101649] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Burgess A, Bateman K, Schucker J. Postexamination review using a standardized examination review form. TEACHING AND LEARNING IN NURSING 2020. [DOI: 10.1016/j.teln.2019.07.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Yune SJ, Lee SY, Im S. How Do Medical Students Prepare for Examinations: Pre‐assessment Cognitive and Meta‐cognitive Activities. KOREAN MEDICAL EDUCATION REVIEW 2019;21:51-58. [DOI: 10.17496/kmer.2019.21.1.51] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/30/2024]