Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Swanson DB, van der Vleuten CPM. Assessment of clinical skills with standardized patients: state of the art revisited. Teach Learn Med 2013;25 Suppl 1:S17-25. [PMID: 24246102 DOI: 10.1080/10401334.2013.842916] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]

For:	Swanson DB, van der Vleuten CPM. Assessment of clinical skills with standardized patients: state of the art revisited. Teach Learn Med 2013;25 Suppl 1:S17-25. [PMID: 24246102 DOI: 10.1080/10401334.2013.842916] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]

Number

Cited by Other Article(s)

Matos Sousa R, Collares CF, Pereira VH. Longitudinal variation of correlations between different components of assessment within a medical school. BMC MEDICAL EDUCATION 2024;24:850. [PMID: 39112948 PMCID: PMC11308138 DOI: 10.1186/s12909-024-05822-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/28/2024] [Accepted: 07/25/2024] [Indexed: 08/10/2024]

Abstract

BACKGROUND

An assessment program should be inclusive and ensure that the various components of medical knowledge, clinical skills, and professionalism are assessed. The level and the variation over time in the strength of the correlation between these components of assessment is still a matter of study. Based on the meaningful learning theory and the integrated learning theory, we hypothesize that these components increase their connections during the medical school course.

METHODS

This is a retrospective cohort study that analyzed data collected for a 10-year period in one medical school. We included students from the 3rd to 6th year of medical school from 2011 to 2021. Three assessment components were addressed: Knowledge, Clinical Skills, and Professionalism. For data analysis, Pearson correlation coefficients (R) and R2 were calculated to study the correlation between variables and a z-test on Fisher's r-to-z was used to determine the differences between correlation coefficients.

RESULTS

949 medical students were included in the study. The correlation between Clinical Skills and Professionalism showed a medium to strong association (Pearson's R ranging from 0.485 to 0.734), while the correlation between Knowledge and Professionalism was weaker but exhibited a steady evolution with Pearson's R fluctuating between 0.075 and 0.218. The Knowledge and Clinical Skills correlation became statistically significant from 2013 onwards and peaking at Pearson's R of 0.440 for the cohort spanning 2016-2019. We also revealed a strengthening of correlations between Professionalism and Clinical Skills from the beginning to the end of clinical training, but not with the knowledge component.

CONCLUSIONS

This analysis contributes to our understanding of the dynamics of correlations of different assessment components within an institution and provides a framework for how they interact and influence each other.

TRIAL REGISTRATION

This study was not a clinical trial, but a retrospective observational study, without health care interventions. Nevertheless, we provide herein the number of the study as submitted to the Ethics committee - CEICVS 146/2021.

Collapse

Sadati L, Edalattalab F, Hajati N, Karami S, Bagheri AB, Bahri MH, Abjar R. OSABSS: An authentic examination for assessing basic surgical skills in surgical residents. Surg Open Sci 2024;19:217-222. [PMID: 38860004 PMCID: PMC11163168 DOI: 10.1016/j.sopen.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 04/20/2024] [Accepted: 04/28/2024] [Indexed: 06/12/2024] Open

Vandecasteele R, Schelfhout S, D'hondt F, De Maesschalck S, Derous E, Willems S. Intercultural effectiveness in GPs' communication and clinical assessment: An experimental study. PATIENT EDUCATION AND COUNSELING 2024;122:108138. [PMID: 38237531 DOI: 10.1016/j.pec.2024.108138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 11/27/2023] [Accepted: 01/04/2024] [Indexed: 02/25/2024]

Affiliation(s)

Robin Vandecasteele Ghent University, Faculty of Medicine and Health Sciences, Department of Public Health and Primary Care, Research Group Equity in Health Care, University Hospital Campus entrance 42, C. Heymanslaan 10, 9000 Ghent, Belgium.
Stijn Schelfhout Ghent University, Faculty of Psychology and Educational Sciences, Department of Work, Organization and Society, Vocational and Personnel Psychology Lab, H. Dunantlaan 2, 9000 Ghent, Belgium; Ghent University, Faculty of Psychology and Educational Sciences, Department of Experimental Psychology, Henri Dunantlaan 2, 9000 Ghent, Belgium
Fanny D'hondt Department of Sociology, Faculty of Political and Social Sciences, Ghent University, Sint-Pietersnieuwstraat 41, 9000 Ghent, Belgium
Stéphanie De Maesschalck Ghent University, Faculty of Medicine and Health Sciences, Department of Public Health and Primary Care, Research Group Equity in Health Care, University Hospital Campus entrance 42, C. Heymanslaan 10, 9000 Ghent, Belgium; Ghent University, Centre for the Social Study of Migration and Refugees, H. Dunantlaan 2, 9000 Ghent, Belgium
Eva Derous Ghent University, Faculty of Psychology and Educational Sciences, Department of Work, Organization and Society, Vocational and Personnel Psychology Lab, H. Dunantlaan 2, 9000 Ghent, Belgium; Erasmus University Rotterdam, Erasmus School of Social and Behavioural Sciences, Burgemeester Oudlaan 50, 3062 Rotterdam, the Netherlands
Sara Willems Ghent University, Faculty of Medicine and Health Sciences, Department of Public Health and Primary Care, Research Group Equity in Health Care, University Hospital Campus entrance 42, C. Heymanslaan 10, 9000 Ghent, Belgium; Ghent University, Centre for the Social Study of Migration and Refugees, H. Dunantlaan 2, 9000 Ghent, Belgium; Ghent University, Faculty of Medicine and Health Sciences, Department of Public Health and Primary Care, Quality & Safety Ghent, University Hospital Campus entrance 42, C. Heymanslaan 10, 9000 Ghent, Belgium

Collapse

Yeates P, Maluf A, Kinston R, Cope N, Cullen K, Cole A, O'Neill V, Chung CW, Goodfellow R, Vallender R, Ensaff S, Goddard-Fuller R, McKinley R, Wong G. A realist evaluation of how, why and when objective structured clinical exams (OSCEs) are experienced as an authentic assessment of clinical preparedness. MEDICAL TEACHER 2024:1-9. [PMID: 38635469 DOI: 10.1080/0142159x.2024.2339413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 04/02/2024] [Indexed: 04/20/2024]

Wise CM, Lovett GD, Swarm GM, Nazar A. The 7 Elements Communication Rating Form for Teaching and Assessing Medical Communication Competency. Cureus 2024;16:e59008. [PMID: 38800217 PMCID: PMC11127704 DOI: 10.7759/cureus.59008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/25/2024] [Indexed: 05/29/2024] Open

Faisal H, Qamar F, Martinez S, Razmi S, Oviedo R, Masud F. Learning curve of ultrasound-guided surgeon-administered transversus abdominis plane (UGSA-TAP) block on a porcine model. Heliyon 2024;10:e25006. [PMID: 38322832 PMCID: PMC10844114 DOI: 10.1016/j.heliyon.2024.e25006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 11/22/2023] [Accepted: 01/18/2024] [Indexed: 02/08/2024] Open

Abstract

Background

Surgeons commonly perform ultrasound-guided Transversus Abdominis Plane blocks to manage acute pain following abdominal surgeries. There is no consensus on whether surgeons should undergo basic hands-on training to perform TAP blocks or if video-based learning is sufficient. We theorized that simulation-based learning is superior to video-based learning. In the present study, we present the analysis of technical skills of UGSA-TAP block performance on a live porcine model by general surgery trainees after undergoing video or simulation-based learning.

Methods

We performed a prospective, double-blinded, randomized study. Ten surgery residents and two surgery critical-care fellows (n = 12) without prior experience in performing the TAP block were recruited. The participants were randomized either into a video-based or simulation-based training group. After that, all participants performed a TAP block on a live anesthetized pig, which was recorded and scored by three blinded anesthesiologists. All participants completed a post-performance survey to assess their confidence in gaining competency in the UGSA-TAP block. Statistical analyses were performed to assess the differences between the two groups. P < 0.05 was considered statistically significant.

Results

All simulation-based learning participants successfully performed a survey scan, identified the three muscular layers of the abdominal wall, and identified the transversus abdominis plane compared to 50 %, 50 %, and 33 % video-based learning group participants for the respective parameters (p < 0.05). While some performance metrics showed no statistically significant differences between the groups, substantial effect sizes (Cohen's ℎ up to 1.07) highlighted notable differences in participants' performance. Both groups exhibited confidence in core competencies, with varied rates of satisfactory skill execution. Performance assessed using a global rating scale revealed a higher passing rate for the simulation group (83 % vs. 33 %). Participant feedback via the Likert scale reflected confidence post-training. Inter-rater reliability (0.83-1) confirmed the robustness of study evaluations.

Conclusion

The UGSA-TAP block curriculum should be introduced into the surgical residency programs with an emphasis on simulation-based learning to enhance the procedural skills of the trainees before transitioning to surgical patients.

Collapse

Zhang L, Liang H, Luo H, He W, Cai Y, Liu S, Fan Y, Huang W, Zhao Q, Zhong D, Li J, Lv S, Li C, Xie Y, Zhang N, Xu D(R. Quality in screening and measuring blood pressure in China's primary health care: a national cross-sectional study using unannounced standardized patients. THE LANCET REGIONAL HEALTH. WESTERN PACIFIC 2024;43:100973. [PMID: 38076324 PMCID: PMC10701131 DOI: 10.1016/j.lanwpc.2023.100973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 10/15/2023] [Accepted: 11/06/2023] [Indexed: 02/12/2024]

Abstract

Background

This study aims to evaluate primary care providers' adherence to the standard of measuring blood pressure for people aged 35 or above during their initial visit, as per Chinese guidelines, and to identify factors affecting their practices.

Methods

We developed 11 standardized patients (SP) cases as tracer conditions to evaluate primary care, and deployed trained SPs for unannounced visits to randomly selected providers in seven provinces of China. The SPs used a checklist based on guidelines to record whether and how blood pressure was measured. Data were analyzed descriptively and regression analysis was performed to examine the association between outcomes and factors such as provider, patient, facility, and clinical case characteristics.

Findings

The SPs conducted 1201 visits and found that less than one-third of USPs ≥35 had their blood pressure measured. Only 26.9% of migraine and 15.4% of diabetes cases received blood pressure measurements. Additionally, these measurements did not follow the proper guidelines and recommended steps. On average, 55.6% of the steps were followed with few providers considering influencing factors before measurement and only 6.0% of patients received both-arm measurements. The use of wrist sphygmomanometers was associated with poor blood pressure measurement.

Interpretation

In China, primary care hypertension screening practices fall short of guidelines, with infrequent initiation of blood pressure measurements and inadequate adherence to proper measurement steps. To address this, priority should be placed on adopting, implementing, and upholding guidelines for hypertension screening and measurement.

Funding

National Natural Science Foundation of China, Swiss Agency for Development and Cooperation, Doctoral Fund Project of Inner Mongolia Medical University, China Postdoctoral Science Foundation.

Collapse

Affiliation(s)

Lanping Zhang Acacia Lab for Implementation Science, School of Health Management, Southern Medical University, Guangzhou, China The Third Department of Lung Disease, Shenzhen Third People's Hospital, Shenzhen, Guangdong Province 518112, China
Huijuan Liang School of Health Management, Inner Mongolia Medical University, Hohhot, China
Huanyuan Luo Acacia Lab for Implementation Science, Institute for Global Health, Dermatology Hospital of Southern Medical University, Guangzhou, China
Wenjun He Acacia Lab for Implementation Science, School of Health Management, Southern Medical University, Guangzhou, China Acacia Lab for Implementation Science, School of Public Health, Southern Medical University, Guangzhou, China
Yiyuan Cai Department of Epidemiology and Health Statistics, School of Public Health, Guizhou Medical University, Guizhou Province, China
Siyuan Liu Acacia Lab for Implementation Science, School of Health Management, Southern Medical University, Guangzhou, China
Yancun Fan School of Health Management, Inner Mongolia Medical University, Hohhot, China
Wenxiu Huang Erfenzi Township Health Center of Wuchuan County, Inner Mongolia, China
Qing Zhao Center for World Health Organization Studies and Department of Health Management, School of Health Management of Southern Medical University, Guangzhou, China
Dongmei Zhong Acacia Lab for Implementation Science, School of Health Management, Southern Medical University, Guangzhou, China
Jiaqi Li Acacia Lab for Implementation Science, School of Public Health, Southern Medical University, Guangzhou, China
Sensen Lv Acacia Lab for Implementation Science, School of Health Management, Southern Medical University, Guangzhou, China
Chunping Li Acacia Lab for Implementation Science, School of Public Health, Southern Medical University, Guangzhou, China
Yunyun Xie Acacia Lab for Implementation Science, School of Health Management, Southern Medical University, Guangzhou, China
Nan Zhang School of Health Management, Inner Mongolia Medical University, Hohhot, China
Dong (Roman) Xu Acacia Lab for Implementation Science, School of Health Management and Dermatology Hospital, Southern Medical University, Guangzhou, China Center for World Health Organization Studies and Department of Health Management, School of Health Management of Southern Medical University, Guangzhou, China Southern Medical University Institute for Global Health (SIGHT), Dermatology Hospital of Southern Medical University (SMU), Guangzhou, China

Collapse

Salawu YK, Stewart D, Daud A. Structures, processes and outcomes of objective structured clinical examinations in dental education during the COVID-19 pandemic: A scoping review. EUROPEAN JOURNAL OF DENTAL EDUCATION : OFFICIAL JOURNAL OF THE ASSOCIATION FOR DENTAL EDUCATION IN EUROPE 2023;27:802-814. [PMID: 36337030 PMCID: PMC9877700 DOI: 10.1111/eje.12869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Revised: 08/30/2022] [Accepted: 10/28/2022] [Indexed: 06/16/2023]

Abstract

INTRODUCTION

Objective structured clinical examinations (OSCEs) are an essential examination tool within undergraduate dental education. Fear of spread of the COVID-19 virus led to dental institutions exploring alternative means of conducting OSCEs. The aim of this scoping review was to investigate what structures, processes and outcomes of dental OSCEs were reported during the COVID-19 pandemic.

MATERIALS AND METHODS

This scoping review was conducted and reported adhering to the Preferred Reporting Items for Systematic Reviews and Meta-analyses extension for scoping review guidelines (PRISMA-ScR). Published literature was identified through a systematic search of PubMed, Embase, Cumulative Index to Nursing and Allied Health Literature (CINAHL), Education Resources Information Center (Eric), ProQuest and Google Scholar. Identified articles were independently reviewed by two authors (KS, AD), followed by synthesis in terms of the reported structures, processes and outcomes. Articles reporting cancellation or rescheduling were also included, extracting data on reasons and any suggestions/recommendations.

RESULTS

The search yielded a total of 290 studies of which 239 sources were excluded after removal of duplicates, leaving 51 studies for title and abstract evaluation. Thirty-four articles were excluded as they did not report on the topic of interest, leaving 17 for full-text evaluation, of which nine were analysed according to the pre-set themes. All dental OSCEs taking place (n = 6) were conducted online whilst the remaining (n = 3) were either cancelled or rescheduled. Data on structures reported specific online videoconferencing software used and provision of staff and student training. Processes on the execution of online OSCEs varied significantly from one study to the other, providing rich data on how dental institutions may carry out such assessments tailored to their need. Information regarding outcomes was sparse, as little attention was paid to the results of the students compared to pre-pandemic, lacking investigation into reliability and validity of online dental OSCEs.

CONCLUSION

Dental OSCEs could be conducted online implementing well-planned structures and processes; however, further evidence is needed to prove its reliability and validity based on outcomes. Dental institutions may need to consider alternative methods to assess practical competencies if online OSCEs are to take place.

Collapse

Fink MC, Heitzmann N, Reitmeier V, Siebeck M, Fischer F, Fischer MR. Diagnosing virtual patients: the interplay between knowledge and diagnostic activities. ADVANCES IN HEALTH SCIENCES EDUCATION : THEORY AND PRACTICE 2023;28:1245-1264. [PMID: 37052740 PMCID: PMC10099021 DOI: 10.1007/s10459-023-10211-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 01/22/2023] [Indexed: 06/19/2023]

Perez A, Fetters MD, Creswell JW, Scerbo M, Kron FW, Gonzalez R, An L, Jimbo M, Klasnja P, Guetterman TC. Enhancing Nonverbal Communication Through Virtual Human Technology: Protocol for a Mixed Methods Study. JMIR Res Protoc 2023;12:e46601. [PMID: 37279041 PMCID: PMC10282909 DOI: 10.2196/46601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Revised: 04/27/2023] [Accepted: 04/27/2023] [Indexed: 06/07/2023] Open

Abstract

BACKGROUND

Communication is a critical component of the patient-provider relationship; however, limited research exists on the role of nonverbal communication. Virtual human training is an informatics-based educational strategy that offers various benefits in communication skill training directed at providers. Recent informatics-based interventions aimed at improving communication have mainly focused on verbal communication, yet research is needed to better understand how virtual humans can improve verbal and nonverbal communication and further elucidate the patient-provider dyad.

OBJECTIVE

The purpose of this study is to enhance a conceptual model that incorporates technology to examine verbal and nonverbal components of communication and develop a nonverbal assessment that will be included in the virtual simulation for further testing.

METHODS

This study will consist of a multistage mixed methods design, including convergent and exploratory sequential components. A convergent mixed methods study will be conducted to examine the mediating effects of nonverbal communication. Quantitative (eg, MPathic game scores, Kinect nonverbal data, objective structured clinical examination communication score, and Roter Interaction Analysis System and Facial Action Coding System coding of video) and qualitative data (eg, video recordings of MPathic-virtual reality [VR] interventions and student reflections) will be collected simultaneously. Data will be merged to determine the most crucial components of nonverbal behavior in human-computer interaction. An exploratory sequential design will proceed, consisting of a grounded theory qualitative phase. Using theoretical, purposeful sampling, interviews will be conducted with oncology providers probing intentional nonverbal behaviors. The qualitative findings will aid the development of a nonverbal communication model that will be included in a virtual human. The subsequent quantitative strand will incorporate and validate a new automated nonverbal communication behavior assessment into the virtual human simulation, MPathic-VR, by assessing interrater reliability, code interactions, and dyadic data analysis by comparing Kinect responses (system recorded) to manually scored records for specific nonverbal behaviors. Data will be integrated using building integration to develop the automated nonverbal communication behavior assessment and conduct a quality check of these nonverbal features.

RESULTS

Secondary data from the MPathic-VR randomized controlled trial data set (210 medical students and 840 video recordings of interactions) were analyzed in the first part of this study. Results showed differential experiences by performance in the intervention group. Following the analysis of the convergent design, participants consisting of medical providers (n=30) will be recruited for the qualitative phase of the subsequent exploratory sequential design. We plan to complete data collection by July 2023 to analyze and integrate these findings.

CONCLUSIONS

The results from this study contribute to the improvement of patient-provider communication, both verbal and nonverbal, including the dissemination of health information and health outcomes for patients. Further, this research aims to transfer to various topical areas, including medication safety, informed consent processes, patient instructions, and treatment adherence between patients and providers.

INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID)

DERR1-10.2196/46601.

Collapse

Bartlett MJ, Umoren R, Amory JH, Huynh T, Kim AJH, Stiffler AK, Mastroianni R, Ficco E, French H, Gray M. Measuring antenatal counseling skill with a milestone-based assessment tool: a validation study. BMC MEDICAL EDUCATION 2023;23:325. [PMID: 37165398 PMCID: PMC10170031 DOI: 10.1186/s12909-023-04282-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 04/20/2023] [Indexed: 05/12/2023]

McAfee NW, Schumacher JA, Williams DC, Madson MB, Bagge CL, Konkle-Parker D, Paul IA, Houston LJ, Young KM. Multidimensional Evaluation of Screening Brief Intervention and Referral to Treatment Training for Medical Students. ACADEMIC PSYCHIATRY : THE JOURNAL OF THE AMERICAN ASSOCIATION OF DIRECTORS OF PSYCHIATRIC RESIDENCY TRAINING AND THE ASSOCIATION FOR ACADEMIC PSYCHIATRY 2023:10.1007/s40596-023-01752-2. [PMID: 36720777 DOI: 10.1007/s40596-023-01752-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 01/20/2023] [Indexed: 06/18/2023]

Lebdai S, Bouvard B, Martin L, Annweiler C, Lerolle N, Rineau E. Objective structured clinical examination versus traditional written examinations: a prospective observational study. BMC MEDICAL EDUCATION 2023;23:69. [PMID: 36707797 PMCID: PMC9883896 DOI: 10.1186/s12909-023-04050-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Accepted: 01/20/2023] [Indexed: 06/18/2023]

Abstract

BACKGROUND

Recently, Objective Structured Clinical Examinations (OSCE) became an official evaluation modality for 6-year medical students in France. Before, standard examination modalities were: written progressive clinical cases (PCC), written critical reading of scientific articles (CRA), and internship evaluation (IE). The aim of this study was to assess the performances of 6-year medical students in their final faculty tests by comparing OSCE-exams with standard examination modalities.

METHODS

This was a prospective observational study. We included all 6-year medical students in our university from 2020 to 2021. The endpoints were the scores obtained at the following final faculty tests during the 6^th year of medical studies: OSCE-training, OSCE-exams, written PCC, written CRA, and IE. All scores were compared in a paired-analysis.

RESULTS

A total of 400 students were included in the study. No student was excluded in the final analysis. The mean scores obtained at the OSCE-exams were significantly different from those obtained at OSCE-training, PCC, CRA, and IE (12.6 ± 1.7, 11.7 ± 1.7, 13.4 ± 1.4, 13.2 ± 1.5, 14.7 ± 0.9, respectively; p < 0.001). OSCE-exams scores were moderately and significantly correlated with OSCE-training and PCC (Spearman rho coefficient = 0.4, p < 0.001); OSCE examination scores were lowly but significantly correlated with CRA and IE (Spearman rho coefficient = 0.3, p < 0.001). OSCE-scores significantly increased after an OSCE training session.

CONCLUSION

In our faculty, 6-year medical students obtained lower scores at OSCE exams compared to other standard evaluation modalities. The correlation was weak to moderate but significant. These results suggest that OSCE are not redundant with the other evaluation modalities. Interestingly, a single OSCE training session led to an improvement in OSCE scores underlining the importance of a specific training.

Collapse

Smee S, Coetzee K, Bartman I, Roy M, Monteiro S. OSCE Standard Setting: Three Borderline Group Methods. MEDICAL SCIENCE EDUCATOR 2022;32:1439-1445. [PMID: 36532388 PMCID: PMC9755382 DOI: 10.1007/s40670-022-01667-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 10/19/2022] [Indexed: 06/17/2023]

Makrides A, Yeates P. Memory, credibility and insight: How video-based feedback promotes deeper reflection and learning in objective structured clinical exams. MEDICAL TEACHER 2022;44:664-671. [PMID: 35000530 DOI: 10.1080/0142159x.2021.2020232] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Reiser S, Schacht L, Thomm E, Figalist C, Janssen L, Schick K, Dörfler E, Berberat PO, Gartmeier M, Bauer J. A video-based situational judgement test of medical students' communication competence in patient encounters: Development and first evaluation. PATIENT EDUCATION AND COUNSELING 2022;105:1283-1289. [PMID: 34481676 DOI: 10.1016/j.pec.2021.08.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 06/22/2021] [Accepted: 08/20/2021] [Indexed: 06/13/2023]

Hu P, Sun J, Wei F, Liu X. Patient-Tailored 3D-Printing Models in the Subspecialty Training of Spinal Tumors: A Comparative Study and Questionnaire Survey. World Neurosurg 2022;161:e488-e494. [PMID: 35189420 DOI: 10.1016/j.wneu.2022.02.042] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 02/11/2022] [Indexed: 10/19/2022]

Yeates P, Moult A, Cope N, McCray G, Fuller R, McKinley R. Determining influence, interaction and causality of contrast and sequence effects in objective structured clinical exams. MEDICAL EDUCATION 2022;56:292-302. [PMID: 34893998 PMCID: PMC9304241 DOI: 10.1111/medu.14713] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 11/03/2021] [Accepted: 12/08/2021] [Indexed: 06/14/2023]

Abstract

INTRODUCTION

Differential rater function over time (DRIFT) and contrast effects (examiners' scores biased away from the standard of preceding performances) both challenge the fairness of scoring in objective structured clinical exams (OSCEs). This is important as, under some circumstances, these effects could alter whether some candidates pass or fail assessments. Benefitting from experimental control, this study investigated the causality, operation and interaction of both effects simultaneously for the first time in an OSCE setting.

METHODS

We used secondary analysis of data from an OSCE in which examiners scored embedded videos of student performances interspersed between live students. Embedded video position varied between examiners (early vs. late) whilst the standard of preceding performances naturally varied (previous high or low). We examined linear relationships suggestive of DRIFT and contrast effects in all within-OSCE data before comparing the influence and interaction of 'early' versus 'late' and 'previous high' versus 'previous low' conditions on embedded video scores.

RESULTS

Linear relationships data did not support the presence of DRIFT or contrast effects. Embedded videos were scored higher early (19.9 [19.4-20.5]) versus late (18.6 [18.1-19.1], p < 0.001), but scores did not differ between previous high and previous low conditions. The interaction term was non-significant.

CONCLUSIONS

In this instance, the small DRIFT effect we observed on embedded videos can be causally attributed to examiner behaviour. Contrast effects appear less ubiquitous than some prior research suggests. Possible mediators of these finding include the following: OSCE context, detail of task specification, examiners' cognitive load and the distribution of learners' ability. As the operation of these effects appears to vary across contexts, further research is needed to determine the prevalence and mechanisms of contrast and DRIFT effects, so that assessments may be designed in ways that are likely to avoid their occurrence. Quality assurance should monitor for these contextually variable effects in order to ensure OSCE equivalence.

Collapse

Yeates P, McCray G, Moult A, Cope N, Fuller R, McKinley R. Determining the influence of different linking patterns on the stability of students' score adjustments produced using Video-based Examiner Score Comparison and Adjustment (VESCA). BMC MEDICAL EDUCATION 2022;22:41. [PMID: 35039023 PMCID: PMC8764767 DOI: 10.1186/s12909-022-03115-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 01/05/2022] [Indexed: 06/14/2023]

Abstract

BACKGROUND

Ensuring equivalence of examiners' judgements across different groups of examiners is a priority for large scale performance assessments in clinical education, both to enhance fairness and reassure the public. This study extends insight into an innovation called Video-based Examiner Score Comparison and Adjustment (VESCA) which uses video scoring to link otherwise unlinked groups of examiners. This linkage enables comparison of the influence of different examiner-groups within a common frame of reference and provision of adjusted "fair" scores to students. Whilst this innovation promises substantial benefit to quality assurance of distributed Objective Structured Clinical Exams (OSCEs), questions remain about how the resulting score adjustments might be influenced by the specific parameters used to operationalise VESCA. Research questions, How similar are estimates of students' score adjustments when the model is run with either: fewer comparison videos per participating examiner?; reduced numbers of participating examiners?

METHODS

Using secondary analysis of recent research which used VESCA to compare scoring tendencies of different examiner groups, we made numerous copies of the original data then selectively deleted video scores to reduce the number of 1/ linking videos per examiner (4 versus several permutations of 3,2,or 1 videos) or 2/examiner participation rates (all participating examiners (76%) versus several permutations of 70%, 60% or 50% participation). After analysing all resulting datasets with Many Facet Rasch Modelling (MFRM) we calculated students' score adjustments for each dataset and compared these with score adjustments in the original data using Spearman's correlations.

RESULTS

Students' score adjustments derived form 3 videos per examiner correlated highly with score adjustments derived from 4 linking videos (median Rho = 0.93,IQR0.90-0.95,p < 0.001), with 2 (median Rho 0.85,IQR0.81-0.87,p < 0.001) and 1 linking videos (median Rho = 0.52(IQR0.46-0.64,p < 0.001) producing progressively smaller correlations. Score adjustments were similar for 76% participating examiners and 70% (median Rho = 0.97,IQR0.95-0.98,p < 0.001), and 60% (median Rho = 0.95,IQR0.94-0.98,p < 0.001) participation, but were lower and more variable for 50% examiner participation (median Rho = 0.78,IQR0.65-0.83, some ns).

CONCLUSIONS

Whilst VESCA showed some sensitivity to the examined parameters, modest reductions in examiner participation rates or video numbers produced highly similar results. Employing VESCA in distributed or national exams could enhance quality assurance or exam fairness.

Collapse

Odusola F, Smith JL, Turrigiano E, Shulman M, Grbic JT, Fine JB, Hu MC, Nunes EV, Bisaga A, Levin FR. The utility of a formative one-station objective structured clinical examination for Substance use disorders in a dental curriculum. EUROPEAN JOURNAL OF DENTAL EDUCATION : OFFICIAL JOURNAL OF THE ASSOCIATION FOR DENTAL EDUCATION IN EUROPE 2021;25:813-828. [PMID: 33471403 PMCID: PMC8289927 DOI: 10.1111/eje.12661] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Revised: 12/22/2020] [Accepted: 01/09/2021] [Indexed: 05/10/2023]

Tsichlis JT, Del Re AM, Carmody JB. The Past, Present, and Future of the United States Medical Licensing Examination Step 2 Clinical Skills Examination. Cureus 2021;13:e17157. [PMID: 34548971 PMCID: PMC8437080 DOI: 10.7759/cureus.17157] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/12/2021] [Indexed: 11/29/2022] Open

Naidoo N, Azar AJ, Khamis AH, Gholami M, Lindsbro M, Alsheikh-Ali A, Banerjee Y. Design, Implementation, and Evaluation of a Distance Learning Framework to Adapt to the Changing Landscape of Anatomy Instruction in Medical Education During COVID-19 Pandemic: A Proof-of-Concept Study. Front Public Health 2021;9:726814. [PMID: 34568264 PMCID: PMC8460872 DOI: 10.3389/fpubh.2021.726814] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 08/06/2021] [Indexed: 12/23/2022] Open

Abstract

This study presents the design of a DL-framework to deliver anatomy teaching that provides a microfiche of the onsite anatomy learning experience during the mandated COVID-19 lockdown. First, using nominal-group technique, we identified the DL learning theories to be employed in blueprinting the DL-framework. Effectiveness of the designed DL-framework in anatomy teaching was demonstrated using the exemplar of the Head and Neck (H&N) course during COVID-19 lockdown, in the pre-clerkship curriculum at our medical school. The dissemination of the DL-framework in the anatomy course was informed by the Analyse, Design, Develop, Implement, and Evaluate (ADDIE) model. The efficiency of the DL-framework was evaluated using the first two levels of Kirkpatrick's model. Versatility of the DL-framework was demonstrated by aligning its precepts with individual domains of key learning outcomes framework. The framework's blueprint was designed amalgamating principles of: Garrison's community inquiry, Siemens' connectivism and Harasim's online-collaborative-learning; and improved using Anderson's DL-model. Following the implementation of the DL-framework in the H&N course informed by ADDIE, the framework's efficiency was evaluated. In total, 70% students responded to the survey assessing perception toward DL (Kirkpatrick's Level: 1). Descriptive analysis of the survey results showed that the DL-framework was positively received by students and attested that students had an enriched learning experience, which promoted collaborative-learning and student-autonomy. For, Kirkpatrick's Level: 2 i.e., cognitive development, we compared the summative assessment performance in the H&N course across three cohort of students. The results show that the scores of the cohort, which experienced the course entirely through DL modality was statistically higher (P < 0.01) than both the other cohorts, indicating that shift to DL did not have an adverse effect on students' learning. Using Bourdieu's Theory of Practice, we showed that the DL-framework is an efficient pedagogical approach, pertinent for medical schools to adopt; and is versatile as it attests to the key domains of students' learning outcomes in the different learning outcomes framework. To our knowledge this is the first-study of its kind where a rationale and theory-guided approach has been availed not only to blueprint a DL framework, but also to implement it in the MBBS curriculum.

Collapse

Yeates P, Moult A, Cope N, McCray G, Xilas E, Lovelock T, Vaughan N, Daw D, Fuller R, McKinley RK(B. Measuring the Effect of Examiner Variability in a Multiple-Circuit Objective Structured Clinical Examination (OSCE). ACADEMIC MEDICINE : JOURNAL OF THE ASSOCIATION OF AMERICAN MEDICAL COLLEGES 2021;96:1189-1196. [PMID: 33656012 PMCID: PMC8300845 DOI: 10.1097/acm.0000000000004028] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Abstract

PURPOSE

Ensuring that examiners in different parallel circuits of objective structured clinical examinations (OSCEs) judge to the same standard is critical to the chain of validity. Recent work suggests examiner-cohort (i.e., the particular group of examiners) could significantly alter outcomes for some candidates. Despite this, examiner-cohort effects are rarely examined since fully nested data (i.e., no crossover between the students judged by different examiner groups) limit comparisons. In this study, the authors aim to replicate and further develop a novel method called Video-based Examiner Score Comparison and Adjustment (VESCA), so it can be used to enhance quality assurance of distributed or national OSCEs.

METHOD

In 2019, 6 volunteer students were filmed on 12 stations in a summative OSCE. In addition to examining live student performances, examiners from 8 separate examiner-cohorts scored the pool of video performances. Examiners scored videos specific to their station. Video scores linked otherwise fully nested data, enabling comparisons by Many Facet Rasch Modeling. Authors compared and adjusted for examiner-cohort effects. They also compared examiners' scores when videos were embedded (interspersed between live students during the OSCE) or judged later via the Internet.

RESULTS

Having accounted for differences in students' ability, different examiner-cohort scores for the same ability of student ranged from 18.57 of 27 (68.8%) to 20.49 (75.9%), Cohen's d = 1.3. Score adjustment changed the pass/fail classification for up to 16% of students depending on the modeled cut score. Internet and embedded video scoring showed no difference in mean scores or variability. Examiners' accuracy did not deteriorate over the 3-week Internet scoring period.

CONCLUSIONS

Examiner-cohorts produced a replicable, significant influence on OSCE scores that was unaccounted for by typical assessment psychometrics. VESCA offers a promising means to enhance validity and fairness in distributed OSCEs or national exams. Internet-based scoring may enhance VESCA's feasibility.

Collapse

Affiliation(s)

Peter Yeates P. Yeates is a senior lecturer in medical education research, School of Medicine, Keele University, Keele, Staffordshire, and a consultant in acute and respiratory medicine, Fairfield General Hospital, Pennine Acute Hospitals, NHS Trust, Bury, Lancashire, United Kingdom; ORCID: https://orcid.org/0000-0001-6316-4051
Alice Moult A. Moult is a research assistant in medical education, School of Medicine, Keele University, Keele, Staffordshire, United Kingdom; ORCID: https://orcid.org/0000-0002-9424-5660
Natalie Cope N. Cope is a lecturer in clinical education (psychometrics), School of Medicine, Keele University, Keele, Staffordshire, United Kingdom
Gareth McCray G. McCray is a researcher, School of Primary, Community and Social Care, Keele University, Keele, Staffordshire, United Kingdom
Eleftheria Xilas E. Xilas is a foundation year 1 doctor and recent graduate, School of Medicine, Keele University, Keele, Staffordshire, United Kingdom
Tom Lovelock T. Lovelock is an information technology services manager, Faculty of Medicine & Health Sciences, Keele University, Keele, Staffordshire, United Kingdom
Nicholas Vaughan N. Vaughan is a senior application developer, directorate of digital strategy and information technology services, Keele University, Keele, Staffordshire, United Kingdom
Dan Daw D. Daw is an information technology systems development engineer, School of Medicine, Keele University, Keele, Staffordshire, United Kingdom
Richard Fuller R. Fuller is deputy dean, School of Medicine, University of Liverpool, Liverpool, United Kingdom; ORCID: https://orcid.org/0000-0001-7965-4864
Robert K. (Bob) McKinley R.K. McKinley is an emeritus professor of education in general practice, School of Medicine, Keele University, Keele, Staffordshire, United Kingdom; ORCID: https://orcid.org/0000-0002-3684-3435

Collapse

McGue SR, Pelic CM, McCadden A, Pelic CG, Lewis AL. The Use of Simulation in Teaching. Psychiatr Clin North Am 2021;44:159-171. [PMID: 34049640 DOI: 10.1016/j.psc.2021.03.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Peeters MJ. Moving beyond Cronbach's Alpha and Inter-Rater Reliability: A Primer on Generalizability Theory for Pharmacy Education. Innov Pharm 2021;12:10.24926/iip.v12i1.2131. [PMID: 34007684 PMCID: PMC8102977 DOI: 10.24926/iip.v12i1.2131] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Boursicot K, Kemp S, Wilkinson T, Findyartini A, Canning C, Cilliers F, Fuller R. Performance assessment: Consensus statement and recommendations from the 2020 Ottawa Conference. MEDICAL TEACHER 2021;43:58-67. [PMID: 33054524 DOI: 10.1080/0142159x.2020.1830052] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Bremer A, Andersson Hagiwara M, Tavares W, Paakkonen H, Nyström P, Andersson H. Translation and further validation of a global rating scale for the assessment of clinical competence in prehospital emergency care. Nurse Educ Pract 2020;47:102841. [PMID: 32768897 DOI: 10.1016/j.nepr.2020.102841] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Revised: 05/30/2020] [Accepted: 07/13/2020] [Indexed: 10/23/2022]

Rocha SR, Romão GS, Setúbal MSV, Lajos GJ, Luz AG, Collares CF, Amaral E. Cross-Cultural Adaptation of the Communication Assessment Tool for Use in a Simulated Clinical Setting. TEACHING AND LEARNING IN MEDICINE 2020;32:308-318. [PMID: 32090632 DOI: 10.1080/10401334.2020.1717958] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Abstract

Construct: The Communication Assessment Tool (CAT) is a 14-item instrument developed in English to assess medical trainees' interpersonal communication skills from the patient's perspective in clinical settings. Background: Using validated instruments and simulated patients constitutes good practice in assessing doctor-patient communication. The CAT was designed for use in real practice, but has not yet been applied to assessing OB-GYN residents' delivery of bad news in Objective Structured Clinical Examination (OSCE) stations. This study aims to provide validity evidence for using the CAT to assess residents' interpersonal communication skills under difficult circumstances in a simulated clinical setting in Brazil. Approach: Cross-cultural adaptation comprised translation into Portuguese, synthesis of translations, and back-translation. Next, a committee of 10 external and independent experts rated the items for linguistic equivalence and relevance to the overall scale. Researchers used the expert ratings to produce a preliminary Brazilian-Portuguese version. This version was applied by four simulated patients to assess 28 OB-GYN residents completing two, 10-minute OSCE stations focused on delivering bad news. Item and scale content validity indices and internal-consistency reliability were calculated. Simulated patients were interviewed to clarify any doubt regarding the content and usability of the tool and their response process. Findings: Thirteen of the 14 items in the Brazilian-Portuguese version were considered "equivalent" by at least 70% of the experts. All items were considered relevant by 100% of the experts. The Item Content Validity Index ranged from .9 to 1, and the Scale Content Validity Index was .99. The instrument showed good reliability for both scenarios (Cronbach's alpha > .90). Simulated patients considered the CAT easy to understand and complete. Conclusions: This study provides validity evidence for using the Brazilian-Portuguese CAT in a simulated clinical environment to assess OB-GYN residents' delivery of bad news. Based on this study's findings, the OB-GYN Department organized an annual formative assessment for residents to improve their interpersonal communication skills. This version of the CAT may also be applicable to other specialties.

Collapse

Berl Q, Resseguier N, Katsogiannou M, Mauviel F, Carcopino X, Boubli L, Blanc J. Objective assessment of obstetrics residents' surgical skills in caesarean: Development and evaluation of a specific rating scale. J Gynecol Obstet Hum Reprod 2020;50:101812. [PMID: 32439616 DOI: 10.1016/j.jogoh.2020.101812] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 05/09/2020] [Accepted: 05/11/2020] [Indexed: 11/24/2022]

Abstract

OBJECTIVE

To develop a modified version of Objective Structured Assessment of Technical Skill (OSATS) rating scale for evaluation of surgical skills specific to caesarean and to assess its relevance in documenting the residents' learning curve during their training. Secondarily, to verify the scale's stability to caesarean's level of difficulty and comparing self-assessment to hetero-assessment in order to propose a practical application of this rating scale during residency.

STUDY DESIGN

We conducted a multicentre observational prospective study, from May 2018 to November 2018. All residents at that time could participate and fill in the rating scale after caesarean. Senior surgeons had to fill in the same rating scale. We analysed correlation between self-assessments and hetero-assessments and sensitivity to change of the rating scale. Analysis of feature's relevance was performed by principal component analysis, factor analysis and reliability analysis.

RESULTS

In total, 234 rating scales were completed evaluating 18 residents. Our study demonstrated that our rating scale could be used to evaluate surgical skills of residents during caesarean and distinguish their year of residency (p < 0.001) with a high correlation between self and hetero-assessment (Intraclass Correlation coefficient for global score: 0.78; 95% CI 0.68-0.86). The principal component analysis revealed two dimensions corresponding to the two parts of the rating scale and the factorial analysis allowed us to confirm distribution of features according to these two dimensions. Cronbach's alpha allowed us to highlight the percentage of representation of the scale's features in relation to all potential theoretical features (0.93, 95% CI 0.82-0.95).

CONCLUSION

Our rating scale could be used for self-assessment during residency and as a hetero-assessment tool for validating defined stages of the internship.

Collapse

Groenier M, Brummer L, Bunting BP, Gallagher AG. Reliability of Observational Assessment Methods for Outcome-based Assessment of Surgical Skill: Systematic Review and Meta-analyses. JOURNAL OF SURGICAL EDUCATION 2020;77:189-201. [PMID: 31444148 DOI: 10.1016/j.jsurg.2019.07.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Revised: 06/24/2019] [Accepted: 07/09/2019] [Indexed: 06/10/2023]

Abstract

BACKGROUND

Reliable performance assessment is a necessary prerequisite for outcome-based assessment of surgical technical skill. Numerous observational instruments for technical skill assessment have been developed in recent years. However, methodological shortcomings of reported studies might negatively impinge on the interpretation of inter-rater reliability.

OBJECTIVE

To synthesize the evidence about the inter-rater reliability of observational instruments for technical skill assessment for high-stakes decisions.

DESIGN

A systematic review and meta-analysis were performed. We searched Scopus (including MEDLINE) and Pubmed, and key publications through December, 2016. This included original studies that evaluated reliability of instruments for the observational assessment of technical skills. Two reviewers independently extracted information on the primary outcome (the reliability statistic), secondary outcomes, and general information. We calculated pooled estimates using multilevel random effects meta-analyses where appropriate.

RESULTS

A total of 247 documents met our inclusion criteria and provided 491 inter-rater reliability estimates. Inappropriate inter-rater reliability indices were reported for 40% of the checklists estimates, 50% of the rating scales estimates and 41% of the other types of assessment instruments estimates. Only 14 documents provided sufficient information to be included in the meta-analyses. The pooled Cohen's kappa was .78 (95% CI 0.69-0.89, p < 0.001) and pooled proportion agreement was 0.84 (95% CI 0.71-0.96, p < 0.001). A moderator analysis was performed to explore the influence of type of assessment instrument as a possible source of heterogeneity.

CONCLUSIONS AND RELEVANCE

For high-stakes decisions, there was often insufficient information available on which to base conclusions. The use of suboptimal statistical methods and incomplete reporting of reliability estimates does not support the use of observational assessment instruments for technical skill for high-stakes decisions. Interpretations of inter-rater reliability should consider the reliability index and assessment instrument used. Reporting of inter-rater reliability needs to be improved by detailed descriptions of the assessment process.

Collapse

Peters G. The role of standardized patient assessment forms in medical communication skills education. QUALITATIVE RESEARCH IN MEDICINE & HEALTHCARE 2019. [DOI: 10.4081/qrmh.2019.8213] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Koschmann T, Frankel R, Albers J. A Tale of Two Inquiries (or, Doing Being Competent in a Clinical Skills Exam). TEACHING AND LEARNING IN MEDICINE 2019;31:258-269. [PMID: 30714409 DOI: 10.1080/10401334.2018.1530597] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Yeates P, Cope N, Hawarden A, Bradshaw H, McCray G, Homer M. Developing a video-based method to compare and adjust examiner effects in fully nested OSCEs. MEDICAL EDUCATION 2019;53:250-263. [PMID: 30575092 PMCID: PMC6519246 DOI: 10.1111/medu.13783] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Revised: 08/14/2018] [Accepted: 11/07/2018] [Indexed: 05/09/2023]

Abstract

BACKGROUND

Although averaging across multiple examiners' judgements reduces unwanted overall score variability in objective structured clinical examinations (OSCE), designs involving several parallel circuits of the OSCE require that different examiner cohorts collectively judge performances to the same standard in order to avoid bias. Prior research suggests the potential for important examiner-cohort effects in distributed or national examinations that could compromise fairness or patient safety, but despite their importance, these effects are rarely investigated because fully nested assessment designs make them very difficult to study. We describe initial use of a new method to measure and adjust for examiner-cohort effects on students' scores.

METHODS

We developed video-based examiner score comparison and adjustment (VESCA): volunteer students were filmed 'live' on 10 out of 12 OSCE stations. Following the examination, examiners additionally scored station-specific common-comparator videos, producing partial crossing between examiner cohorts. Many-facet Rasch modelling and linear mixed modelling were used to estimate and adjust for examiner-cohort effects on students' scores.

RESULTS

After accounting for students' ability, examiner cohorts differed substantially in their stringency or leniency (maximal global score difference of 0.47 out of 7.0 [Cohen's d = 0.96]; maximal total percentage score difference of 5.7% [Cohen's d = 1.06] for the same student ability by different examiner cohorts). Corresponding adjustment of students' global and total percentage scores altered the theoretical classification of 6.0% of students for both measures (either pass to fail or fail to pass), whereas 8.6-9.5% students' scores were altered by at least 0.5 standard deviations of student ability.

CONCLUSIONS

Despite typical reliability, the examiner cohort that students encountered had a potentially important influence on their score, emphasising the need for adequate sampling and examiner training. Development and validation of VESCA may offer a means to measure and adjust for potential systematic differences in scoring patterns that could exist between locations in distributed or national OSCE examinations, thereby ensuring equivalence and fairness.

Collapse

Daniels VJ, Pugh D. Twelve tips for developing an OSCE that measures what you want. MEDICAL TEACHER 2018;40:1208-1213. [PMID: 29069965 DOI: 10.1080/0142159x.2017.1390214] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Six-Year Experience in Teaching Pelvic Floor Ultrasonography Using Pelvic Floor Phantoms. Obstet Gynecol 2018;132:337-344. [DOI: 10.1097/aog.0000000000002729] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

Pottier P, Cohen Aubart F, Steichen O, Desprets M, Pha M, Espitia A, Georgin-Lavialle S, Morel A, Hardouin JB. [Validity and reproducibility of two direct observation assessment forms for evaluation of internal medicine residents' clinical skills]. Rev Med Interne 2017;39:4-9. [PMID: 29157753 DOI: 10.1016/j.revmed.2017.10.424] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2017] [Revised: 09/05/2017] [Accepted: 10/18/2017] [Indexed: 10/18/2022]

Oseni Z, Than HH, Kolakowska E, Chalmers L, Hanboonkunupakarn B, McGready R. Video-based feedback as a method for training rural healthcare workers to manage medical emergencies: a pilot study. BMC MEDICAL EDUCATION 2017;17:149. [PMID: 28859651 PMCID: PMC5580284 DOI: 10.1186/s12909-017-0975-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2016] [Accepted: 08/07/2017] [Indexed: 05/21/2023]

Abstract

BACKGROUND

Video-based feedback has been shown to aid knowledge retention, skills learning and improve team functionality. We explored the use of video-based feedback and low fidelity simulation for training rural healthcare workers along the Thailand-Myanmar border and Papua New Guinea (PNG) to manage medical emergencies effectively.

METHODS

Twenty-four study participants were recruited from three Shoklo Malaria Research Unit clinics along the Thailand-Myanmar border and eight participants from Kudjip Nazarene Hospital, PNG. The teams were recorded on video managing a simulated medical emergency scenario and the video was used to aid feedback and assess performance using Observed Structured Clinical Examination (OSCE) scoring and Team Emergency Assessment Measure (TEAM) questionnaire. The process was repeated post-feedback at both sites and at 6 weeks at the Thailand-Myanmar border site. Thailand-Myanmar border participants' individual confidence levels and baseline knowledge (using OSCE scoring) were assessed before team assessment and feedback at week 1 and repeated post-feedback and at 6 weeks. Focus group discussions (FGD) were held at each Thailand-Myanmar border clinic at week 1 (8 participants at each clinic).

RESULTS

Individual paired tests of OSCE scores showed significant improvement post-feedback at week 1 (p < 0.001) and week 6 (p < 0.001) compared to baseline OSCE scores. There was a trend for increased team OSCE scores compared to baseline at week 1 (p = 0.068) and week 6 (p = 0.109) although not significant. Thailand-Myanmar border TEAM scores demonstrated improvement post-feedback mainly in leadership, teamwork and task management which was sustained up to week 6. PNG showed an improvement mainly in teamwork and task management. The global rating of the teams' non-technical performance at both sites improved post feedback and at week 6 on the Thailand-Myanmar border site. Self-rated confidence scores by Thailand-Myanmar border participants increased significantly from baseline following training at week 1 (p = 0.020), and while higher at 6 weeks follow up than at baseline, this was not significant (p = 0.471). The FGD revealed majority of participants felt that watching the video recording of their performance and the video-based feedback contributed most to their learning.

CONCLUSION

Video-assisted feedback resulted in an improvement in clinical knowledge, confidence and quality of teamwork for managing medical emergencies in two low resource medical facilities in South East Asia and the South Pacific.

Collapse

Preparing Residents Effectively in Emergency Skills Training With a Serious Game. Simul Healthc 2017;12:9-16. [PMID: 27764018 PMCID: PMC5291282 DOI: 10.1097/sih.0000000000000194] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Abstract

Supplemental digital content is available in the text.

Introduction

Training emergency care skills is critical for patient safety but cost intensive. Serious games have been proposed as an engaging self-directed learning tool for complex skills. The objective of this study was to compare the cognitive skills and motivation of medical residents who only used a course manual as preparation for classroom training on emergency care with residents who used an additional serious game.

Methods

This was a quasi-experimental study with residents preparing for a rotation in the emergency department. The “reading” group received a course manual before classroom training; the “reading and game” group received this manual plus the game as preparation for the same training. Emergency skills were assessed before training (with residents who agreed to participate in an extra pretraining assessment), using validated competency scales and a global performance scale. We also measured motivation.

Results

All groups had comparable important characteristics (eg, experience with acute care). Before training, the reading and game group felt motivated to play the game and spent more self-study time (+2.5 hours) than the reading group. Game-playing residents showed higher scores on objectively measured and self-assessed clinical competencies but equal scores on the global performance scale and were equally motivated for training, compared with the reading group. After the 2-week training, no differences between groups existed.

Conclusions

After preparing training with an additional serious game, residents showed improved clinical competencies, compared with residents who only studied course material. After a 2-week training, this advantage disappeared. Future research should study the retention of game effects in blended designs.

Collapse

Pascual-Ramos V, Flores-Alvarado DE, Portela-Hernández M, Maldonado-Velázquez MDR, Amezcua-Guerra LM, López-Zepeda J, Álvarez E, Rubio N, Lastra OV, Saavedra MÁ, Arce-Salinas CA. Communication skills in candidates for accreditation in rheumatology are correlated with candidate's performance in the objective structured clinical examination. ACTA ACUST UNITED AC 2017;15:97-101. [PMID: 28755908 DOI: 10.1016/j.reuma.2017.06.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2017] [Revised: 05/22/2017] [Accepted: 06/02/2017] [Indexed: 10/19/2022]

Krautter M, Diefenbacher K, Schultz JH, Maatouk I, Herrmann-Werner A, Koehl-Hackert N, Herzog W, Nikendei C. Physical examination skills training: Faculty staff vs. patient instructor feedback-A controlled trial. PLoS One 2017;12:e0180308. [PMID: 28692703 PMCID: PMC5503248 DOI: 10.1371/journal.pone.0180308] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Accepted: 06/13/2017] [Indexed: 11/18/2022] Open

Abstract

BACKGROUND

Standardized patients are widely used in training of medical students, both in teaching and assessment. They also frequently lead complete training sessions delivering physical examination skills without the aid of faculty teaching staff-acting as "patient instructors" (PIs). An important part of this training is their ability to provide detailed structured feedback to students which has a strong impact on their learning success. Yet, to date no study has assessed the quality of physical examination related feedback by PIs. Therefore, we conducted a randomized controlled study comparing feedback of PIs and faculty staff following a physical examination assessed by students and video assessors.

METHODS

14 PIs and 14 different faculty staff physicians both delivered feedback to 40 medical students that had performed a physical examination on the respective PI while the physicians observed the performance. The physical examination was rated by two independent video assessors to provide an objective performance standard (gold standard). Feedback of PI and physicians was content analyzed by two different independent video assessors based on a provided checklist and compared to the performance standard. Feedback of PIs and physicians was also rated by medical students and video assessors using a questionnaire consisting of 12 items.

RESULTS

There was no statistical significant difference concerning overall matching of physician or PI feedback with gold standard ratings by video assessment (p = .219). There was also no statistical difference when focusing only on items that were classified as major key steps (p = .802), mistakes or parts that were left out during physical examination (p = .219) or mistakes in communication items (p = .517). The feedback of physicians was significantly better rated than PI feedback both by students (p = .043) as well as by video assessors (p = .034).

CONCLUSIONS

In summary, our study demonstrates that trained PIs are able to provide feedback of equal quantitative value to that of faculty staff physicians with regard to a physical examination performed on them. However, both the students and the video raters judged the quality of the feedback given by the physicians to be significantly better than that of the PIs.

Collapse

Daniels VJ, Harley D. The effect on reliability and sensitivity to level of training of combining analytic and holistic rating scales for assessing communication skills in an internal medicine resident OSCE. PATIENT EDUCATION AND COUNSELING 2017;100:1382-1386. [PMID: 28228339 DOI: 10.1016/j.pec.2017.02.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Revised: 02/06/2017] [Accepted: 02/09/2017] [Indexed: 05/17/2023]

Risse J, Busato T, Dufrost V, Perri M, Zuily S, Wahl D. [Development of an objective structured clinical examination (OSCE) for evaluating clinical competence in vascular medicine]. JOURNAL DE MÉDECINE VASCULAIRE 2017;42:141-147. [PMID: 28705402 DOI: 10.1016/j.jdmv.2017.02.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Accepted: 01/20/2017] [Indexed: 10/19/2022]

Mafinejad MK, Rastegarpanah M, Moosavi F, Shirazi M. Training and Validation of Standardized Patients for Assessing Communication and Counseling Skills of Pharmacy Students: A Pilot Study. J Res Pharm Pract 2017;6:83-88. [PMID: 28616430 PMCID: PMC5463554 DOI: 10.4103/jrpp.jrpp_17_20] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

Abstract

OBJECTIVE

The objective of this study is to describe the process of training valid simulated patients (SPs) for assessing communication and counseling skills of pharmacy students' performance.

METHODS

This is a cross-sectional and correlational study. Psychometric properties of checklist and SPs' portrayals and their filling of the checklist regarding assessing pharmacy students were assessed. Five SPs who were working in the simulated patient's pool were volunteered to take part in the project, which one of the SPs failed. Three scenarios, along with corresponding checklists, were developed based on the usual medications of different diseases consisting of asthma, respiratory infections, and osteoporosis. The SPs' role-play performance was video-recorded and rated independently by two experts according to an observational rating scale to assess validity. The role-play was repeated after 1 week with the same scenario and the same doctor, to assess test-retest reliability. The inter-rater agreement between SPs and experts was determined by calculating the intraclass correlation coefficient and kappa coefficient.

FINDINGS

The four eligible SPs were all women, with an average age of 37 years. The correlation between mean scores of raters and mean scores of SPs was 0.91 and 0.85, respectively. The Pearson's correlation between mean scores of raters with SPs was 0.75. The checklists' reliability, Cronbach's alpha, was calculated to be 0.72. The measured weighted Cohen's kappa for the ratings of by each SP, and the gold standard was between 0.53 and 0.57, indicating a moderate agreement. The inter-rater reliability kappa coefficient between raters was 0.75 (P = 0.01).

CONCLUSION

The authors have demonstrated the technique of using standardized patients to evaluate communication and counseling skills of pharmacy students. The findings indicated that trained SPs can be used as an effective tool to assess pharmacy students' communication and counseling skills.

Collapse

Blatt B, Plack M, Simmens S, Lopreiato J, Berg K, Klevan J, Lewis K. Do Standardized Patients Have Concerns About Students Not Captured by Traditional Assessment Forms? TEACHING AND LEARNING IN MEDICINE 2016;28:395-405. [PMID: 27152446 DOI: 10.1080/10401334.2016.1176573] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Abstract

UNLABELLED

Construct: Traditionally, standardized patients (SPs) assess students' clinical skills principally through numerical rating forms-an approach that may not fully capture SPs' concerns. SPs are students' closest approximation to real patients. To maximally benefit students' clinical training and evaluation it is important to find ways to give voice to the totality of SPs' perspectives.

BACKGROUND

SPs have been shown to be a reliable and valid means to assess medical students' clinical skills in clinical skills examinations. We noticed, however, that SPs often express "off the record" concerns about students, which they do not include on traditional assessment forms.

APPROACH

To explore these "off the record" concerns, we designed a Concerns item and added it to the traditional assessment form for an end-of-3rd-year clinical skills examination shared by three medical schools. We asked SPs to use this Concerns item to identify students about whom they had any "gut-level" concerns and provided them with a narrative opportunity to explain why. SPs were informed that the purpose of the item was to help students with difficulties and was not part of the student's grade.

RESULTS

We analyzed the concerns data using quantitative and qualitative methods. Of 551 students at three schools, 223 (∼40%) had concerns recorded. Seventy students received two or more concerns. Qualitative analysis of SPs' comments revealed 3 major categories of concern: communication and interpersonal skills, history taking, and physical exam. Grouped under each were several subcategories. More than half of the written comments from the SPs related to the communication/interpersonal skills category and included subcategories commonly addressed in communications courses: lack of empathy, good listening skills, and lack of connection to the patient. They also included subcategories that in our experience are less commonly addressed: odd or off-putting mannerisms, lack of confidence, unprofessional behavior, domineering behavior, and biased behavior. Another 47% of concerns identified deficiencies in history taking and physical examination. Of the students with concerns noted by two or more SPs, SPs' narrative comments on 84%, 42%, and 48% of the students in the domains of communications, history, and physical exam respectively indicated potential problems not identified by scores on the traditional assessment form.

CONCLUSION

The Concerns item is a narrative assessment method that may add value to traditional quantitative scoring by identifying and characterizing problematic student performance not captured by the traditional assessment form. It may thus contribute to giving fuller voice to the totality of SPs' perspective.

Collapse

[Validation of a questionnaire for standardized-patient assessment of clinical skills]. Rev Med Interne 2016;37:802-810. [PMID: 27481203 DOI: 10.1016/j.revmed.2016.06.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Revised: 05/04/2016] [Accepted: 06/21/2016] [Indexed: 11/22/2022]

Dankbaar MEW, Alsma J, Jansen EEH, van Merrienboer JJG, van Saase JLCM, Schuit SCE. An experimental study on the effects of a simulation game on students' clinical cognitive skills and motivation. ADVANCES IN HEALTH SCIENCES EDUCATION : THEORY AND PRACTICE 2016;21:505-21. [PMID: 26433730 PMCID: PMC4923100 DOI: 10.1007/s10459-015-9641-x] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2015] [Accepted: 09/24/2015] [Indexed: 05/23/2023]

Mahoney JM, Vardaxis V, Anwar N, Hagenbucher J. Relationship Between Faculty and Standardized Patient Assessment Scores of Podiatric Medical Students During a Standardized Performance Assessment Laboratory. J Am Podiatr Med Assoc 2016;106:116-20. [PMID: 27031547 DOI: 10.7547/14-149] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]

Atkins S, Roberts C, Hawthorne K, Greenhalgh T. Simulated consultations: a sociolinguistic perspective. BMC MEDICAL EDUCATION 2016;16:16. [PMID: 26768421 PMCID: PMC4714536 DOI: 10.1186/s12909-016-0535-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Accepted: 01/06/2016] [Indexed: 05/24/2023]

Abstract

BACKGROUND

Assessment of consulting skills using simulated patients is widespread in medical education. Most research into such assessment is sited in a statistical paradigm that focuses on psychometric properties or replicability of such tests. Equally important, but less researched, is the question of how far consultations with simulated patients reflect real clinical encounters--for which sociolinguistics, defined as the study of language in its socio-cultural context, provides a helpful analytic lens.

DISCUSSION

In this debate article, we draw on a detailed empirical study of assessed role-plays, involving sociolinguistic analysis of talk in OSCE interactions. We consider critically the evidence for the simulated consultation (a) as a proxy for the real; (b) as performance; (c) as a context for assessing talk; and (d) as potentially disadvantaging candidates trained overseas. Talk is always a performance in context, especially in professional situations (such as the consultation) and institutional ones (the assessment of professional skills and competence). Candidates who can handle the social and linguistic complexities of the artificial context of assessed role-plays score highly--yet what is being assessed is not real professional communication, but the ability to voice a credible appearance of such communication. Fidelity may not be the primary objective of simulation for medical training, where it enables the practising of skills. However the linguistic problems and differences that arise from interacting in artificial settings are of considerable importance in assessment, where we must be sure that the exam construct adequately embodies the skills expected for real-life practice. The reproducibility of assessed simulations should not be confused with their validity. Sociolinguistic analysis of simulations in various professional contexts has identified evidence for the gap between real interactions and assessed role-plays. The contextual conditions of the simulated consultation both expect and reward a particular interactional style. Whilst simulation undoubtedly has a place in formative learning for professional communication, the simulated consultation may distort assessment of professional communication These sociolinguistic findings contribute to the on-going critique of simulations in high-stakes assessments and indicate that further research, which steps outside psychometric approaches, is necessary.

Collapse

Swanson DB, Roberts TE. Trends in national licensing examinations in medicine. MEDICAL EDUCATION 2016;50:101-14. [PMID: 26695470 DOI: 10.1111/medu.12810] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2015] [Revised: 05/01/2015] [Accepted: 06/09/2015] [Indexed: 05/09/2023]

Hatala R, Cook DA, Brydges R, Hawkins R. Constructing a validity argument for the Objective Structured Assessment of Technical Skills (OSATS): a systematic review of validity evidence. ADVANCES IN HEALTH SCIENCES EDUCATION : THEORY AND PRACTICE 2015;20:1149-75. [PMID: 25702196 DOI: 10.1007/s10459-015-9593-1] [Citation(s) in RCA: 94] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Accepted: 02/15/2015] [Indexed: 05/28/2023]