Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lievens F. Assessor training strategies and their effects on accuracy, interrater reliability, and discriminant validity. J Appl Psychol 2001;86:255-64. [PMID: 11393438 DOI: 10.1037/0021-9010.86.2.255] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open

For:	Lievens F. Assessor training strategies and their effects on accuracy, interrater reliability, and discriminant validity. J Appl Psychol 2001;86:255-64. [PMID: 11393438 DOI: 10.1037/0021-9010.86.2.255] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open

Number

Cited by Other Article(s)

Roche SM, Renaud DL, Saraceni J, Kelton DF, DeVries TJ. Invited review: Prevalence, risk factors, treatment, and barriers to best practice adoption for lameness and injuries in dairy cattle-A narrative review. J Dairy Sci 2024;107:3347-3366. [PMID: 38101730 DOI: 10.3168/jds.2023-23870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 11/08/2023] [Indexed: 12/17/2023]

Abstract

Lameness and leg injuries are both painful and prevalent across the dairy industry, and are a major welfare concern. There has been a considerable amount of research focused on investigating the risk factors associated with lameness and injuries and how they might be prevented and treated. The objectives of this narrative review were to summarize herd-level prevalence estimates, risk factors, strategies for prevention, control, and treatment of these conditions, and the barriers to best practice adoption for lameness and injuries on dairy farms. There is a relatively high within-herd prevalence of lameness on dairy farms globally, with a recent systematic review estimating the mean prevalence at 22.8%. Similarly, there is a relatively high prevalence of hock injuries, with within-herd estimates ranging from 12% to 81% of cows affected. Knee and neck injuries have been reported to be less common, with 6% to 43% and 1% to 33%, respectively. Numerous risk factors have been associated with the incidence of lameness, notably housing (e.g., access to pasture, bedding depth, bedding type, flooring type, stall design), management (e.g., stall cleanliness, frequency of trimming, holding times, stocking density), and cow-level (e.g., body condition, parity, injured hocks) factors. Risk factors associated with hock injuries can be similarly classified into housing (e.g., bedding type and depth, outdoor access, parlor type, stall design), management (e.g., bedding depth, cleanliness), and cow (e.g., parity, days in milk, lameness) factors. Key preventative approaches for lameness include routine preventative and corrective hoof trimming, improving hoof cushioning and traction through access to pasture or adding rubber flooring, deep-bedded stalls, sand bedding, ensuring appropriate stocking densities, reduced holding times, and the frequent use of routine footbaths. Very little research has been conducted on hock, knee, and neck injury prevention and recovery. Numerous researchers have concluded that both extrinsic (e.g., time, money, space) and intrinsic (e.g., farmer attitude, perception, priorities, and mindset) barriers exist to addressing lameness and injuries on dairy farms. There are many diverse stakeholders in lameness and injury management including the farmer, farm staff, veterinarian, hoof trimmer, nutritionist, and other advisors. Addressing dairy cattle lameness and injuries must, therefore, consider the people involved, as it is these people who are influencing and implementing on-farm decisions related to lameness prevention, treatment, and control.

Collapse

Yang D, Draganov PV, Pohl H, Aihara H, Jeyalingam T, Khashab M, Liu N, Hasan MK, Jawaid S, Othman M, Al-Haddad M, DeWitt JM, Triggs JR, Wang AY, Bechara R, Sethi A, Law R, Aadam AA, Kumta N, Sharma N, Hayat M, Zhang Y, Yi F, Elmunzer BJ. Development and initial validation of a video-based peroral endoscopic myotomy assessment tool. Gastrointest Endosc 2024;99:177-185. [PMID: 37500019 DOI: 10.1016/j.gie.2023.07.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 05/18/2023] [Accepted: 07/19/2023] [Indexed: 07/29/2023]

Affiliation(s)

Dennis Yang Center for Interventional Endoscopy, AdventHealth, Orlando, Florida, USA.
Peter V Draganov Division of Gastroenterology and Hepatology, University of Florida, Gainesville, Florida, USA
Heiko Pohl Veterans Affairs Medical Center, White River Junction, Vermont; Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, USA
Hiroyuki Aihara Division of Gastroenterology, Hepatology and Endoscopy, Brigham and Women's Hospital, Boston, Massachusetts, USA
Thurarshen Jeyalingam Division of Gastroenterology and Hepatology, University of Toronto, Toronto, Ontario, Canada
Mouen Khashab Division of Gastroenterology and Hepatology, Johns Hopkins Hospital, Baltimore, Maryland, USA
Nanlong Liu Division of Gastroenterology, University of Louisville, Louisville, Kentucky, USA
Muhammad K Hasan Center for Interventional Endoscopy, AdventHealth, Orlando, Florida, USA
Salmaan Jawaid Division of Gastroenterology, Baylor College of Medicine, Houston, Texas, USA
Mohamed Othman Division of Gastroenterology, Baylor College of Medicine, Houston, Texas, USA
Mohamed Al-Haddad Department of Gastroenterology and Hepatology, Indiana University School of Medicine, Indianapolis, Indiana, USA
John M DeWitt Department of Gastroenterology and Hepatology, Indiana University School of Medicine, Indianapolis, Indiana, USA
Joseph R Triggs Division of Gastroenterology, Fox Chase Cancer Center, Temple Health, Philadelphia, Pennsylvania, USA
Andrew Y Wang Division of Gastroenterology and Hepatology, University of Virginia, Charlottesville, Virginia, USA
Robert Bechara Division of Gastroenterology and GI Diseases Research Unit, Queen's University, Kingston, Ontario, Canada
Amrita Sethi Division of Digestive and Liver Diseases, Columbia University Irving Medical Center, Presbyterian Hospital, New York, New York, USA
Ryan Law Division of Gastroenterology and Hepatology, Mayo Clinic, Minneapolis, Minnesota, USA
Aziz A Aadam Division of Gastroenterology and Hepatology, Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
Nikhil Kumta Henry D. Janowitz Division of Gastroenterology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
Neil Sharma Division of Interventional Oncology and Surgical Endoscopy (IOSE), Parkview Cancer Institute, Fort Wayne, Indiana, USA
Maham Hayat Center for Interventional Endoscopy, AdventHealth, Orlando, Florida, USA
YiYang Zhang Center for Collaborative Research, AdventHealth Research Institute, Orlando, Florida, USA
Fanchao Yi Center for Collaborative Research, AdventHealth Research Institute, Orlando, Florida, USA
B Joseph Elmunzer Department of Gastroenterology and Hepatology, Medical University of South Carolina, Charleston, South Carolina, USA

Collapse

Breil SM, Lievens F, Forthmann B, Back MD. Interpersonal behavior in assessment center role‐play exercises: investigating structure, consistency, and effectiveness. PERSONNEL PSYCHOLOGY 2022. [DOI: 10.1111/peps.12507] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Miller E, Brooks D, O'Brien KK, Beavers L, Stratford P, Nonoyama M, Mori B. Assessing the inter-rater and intra-rater reliability of the Physical Therapy Competence Assessment for Airway Suctioning (PT-CAAS). PHYSIOTHERAPY RESEARCH INTERNATIONAL 2022;27:e1944. [PMID: 35174940 DOI: 10.1002/pri.1944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 12/01/2021] [Accepted: 01/29/2022] [Indexed: 11/11/2022]

Abstract

BACKGROUND AND PURPOSE

The Physical Therapy Competence Assessment for Airway Suctioning (PT-CAAS) is a recently developed measure to assess the clinical competence of physiotherapists who perform airway suctioning with adults. The purpose of this study was to assess the inter-rater and intra-rater reliability of the PT-CAAS.

METHODS

Scoring rules were developed through expert consultation. Reliability was then assessed using nine videos of suctioning performed in a simulated learning environment. A repeated measures design was used, with two replicate sets of measurements made by each participant for all videos. Data were analyzed using a repeated measures model for the concurrent assessment of inter-rater and intra-rater reliability. Participants were physiotherapists with suctioning experience.

RESULTS

Twenty physiotherapists completed initial scoring and re-scoring for all nine videos; their data were included in the analysis. Intraclass correlation coefficients (ICCs) for inter-rater reliability ranged from 0.569 [lower one-sided 95% confidence interval (CI): 0.395; standard error of measurement (SEM): 0.963] for infection control to 0.759 (lower one-sided 95% CI: 0.612; SEM: 0.722) for post-suctioning assessment and care. The inter-rater ICC for overall performance was 0.752 (lower one-sided 95% CI: 0.602; SEM: 0.660). ICCs for intra-rater reliability ranged from 0.759 (lower one-sided 95% CI: 0.197; SEM 0.721) for infection control to 0.860 (lower one-sided 95% CI: 0.544; SEM: 0.550) for post-suctioning assessment and care. The intra-rater ICC for overall performance was 0.867 (lower one-sided 95% CI: 0.559; SEM: 0.483).

DISCUSSION

Evidence of moderate to good inter-rater and good intra-rater reliability was found; however, the results should be interpreted with caution given the wide CIs and relatively large SEMs. Improved assessor training and assessments of reliability using a larger sample size are recommended.

Collapse

Rusling M, Masin D, Voss M, Gottumukkala P, Keenan C, Botten M, Chambers D, Parrill C, Dube J, Tucker JR. Medical student coping and performance in simulated disasters. ANXIETY STRESS AND COPING 2021;34:766-777. [PMID: 33896294 DOI: 10.1080/10615806.2021.1916481] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Koedijk M, Renden PG, Oudejans RRD, Kleygrewe L, Hutter RIV. Observational Behavior Assessment for Psychological Competencies in Police Officers: A Proposed Methodology for Instrument Development. Front Psychol 2021;12:589258. [PMID: 33732178 PMCID: PMC7959728 DOI: 10.3389/fpsyg.2021.589258] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 02/08/2021] [Indexed: 12/14/2022] Open

The profile of the ‘Good Judge’ in HRM: A systematic review and agenda for future research. HUMAN RESOURCE MANAGEMENT REVIEW 2020. [DOI: 10.1016/j.hrmr.2018.09.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Buckett A, Becker JR, Melchers KG, Roodt G. How Different Indicator-Dimension Ratios in Assessment Center Ratings Affect Evidence for Dimension Factors. Front Psychol 2020;11:459. [PMID: 32265785 PMCID: PMC7105720 DOI: 10.3389/fpsyg.2020.00459] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Accepted: 02/27/2020] [Indexed: 11/13/2022] Open

Abstract

Previous research on the construct validity of assessment center (AC) ratings has usually struggled to find support for dimension factors as an underlying source of variance of these ratings. Confirmatory factor analysis (CFA) remains the most widely used method to specify and validate the internal structure of AC ratings. However, the research support for dimension effects in AC ratings remains mixed. In addition, competing CFA models (e.g., correlated dimensions-correlated exercises models) are often plagued by non-convergence and estimation problems. Recently, it has been proposed that increasing the number of indicators per dimension and exercise combination might help to find support for dimension factors, in addition to exercise factors, in CFAs of AC ratings. Furthermore, it was also suggested that the increased ratio of indicators to dimensions may also solve some of the methodological problems associated with CFA models used to model AC ratings. However, in this research it remained unclear whether the support for dimension factors was solely due to the use of a larger indicator-dimension ratio or due to parceling that combines several behavioral indicators per dimension and exercise combination into more reliable measures of the targeted dimension. These are important empirical questions that have been left unanswered in the literature but can be potentially meaningful in seeking more balanced support for dimension effects in AC research. Using data from N = 213 participants from a 1-day AC, we aimed to investigate the impact of using different indicator-dimension ratios when specifying CFA models of AC ratings. Therefore, we investigated the impact of using different indicator-dimension ratios in the form of item parcels with data from an operational AC. On average, using three parcels eventually led to support for dimension factors in CFAs. However, exercise-based CFA models still performed better than dimension-based models. Thus, the present results point out potential limits concerning the generalizability of recent results that provided support for dimension factors in ACs.

Collapse

Patnaik R, Anton NE, Stefanidis D. A video anchored rating scale leads to high inter-rater reliability of inexperienced and expert raters in the absence of rater training. Am J Surg 2020;219:221-226. [PMID: 31918843 PMCID: PMC10495932 DOI: 10.1016/j.amjsurg.2019.12.026] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 11/01/2019] [Accepted: 12/21/2019] [Indexed: 11/26/2022]

A novel evaluation of two related and two independent algorithms for eye movement classification during reading. Behav Res Methods 2019;50:1374-1397. [PMID: 29766396 DOI: 10.3758/s13428-018-1050-7] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Kleinmann M, Ingold PV. Toward a Better Understanding of Assessment Centers: A Conceptual Review. ANNUAL REVIEW OF ORGANIZATIONAL PSYCHOLOGY AND ORGANIZATIONAL BEHAVIOR 2019. [DOI: 10.1146/annurev-orgpsych-012218-014955] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Croyle S, Nash C, Bauman C, LeBlanc S, Haley D, Khosa D, Kelton D. Training method for animal-based measures in dairy cattle welfare assessments. J Dairy Sci 2018;101:9463-9471. [DOI: 10.3168/jds.2018-14469] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Accepted: 06/19/2018] [Indexed: 11/19/2022]

Lee J, Connelly BS, Goff M, Hazucha JF. Are assessment center behaviors' meanings consistent across exercises? A measurement invariance approach. INTERNATIONAL JOURNAL OF SELECTION AND ASSESSMENT 2017. [DOI: 10.1111/ijsa.12187] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

De Kock FS, Lievens F, Born MP. A closer look at the measurement of dispositional reasoning: Dimensionality and invariance across assessor groups. INTERNATIONAL JOURNAL OF SELECTION AND ASSESSMENT 2017. [DOI: 10.1111/ijsa.12176] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Lockyer J, Carraccio C, Chan MK, Hart D, Smee S, Touchie C, Holmboe ES, Frank JR. Core principles of assessment in competency-based medical education. MEDICAL TEACHER 2017;39:609-616. [PMID: 28598746 DOI: 10.1080/0142159x.2017.1315082] [Citation(s) in RCA: 265] [Impact Index Per Article: 37.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]

Muleya VR, Fourie L, Schlebusch S. Ethical challenges in assessment centres in South Africa. SA JOURNAL OF INDUSTRIAL PSYCHOLOGY 2017. [DOI: 10.4102/sajip.v43i0.1324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022] Open

Abstract Orientation: Assessment Centres (ACs) are used globally for the selection and development of candidates. Limited empirical evidence exists of the ethical challenges encountered in the use of ACs, especially in South Africa (SA).Research purpose: Firstly, to explore possible ethical challenges related to ACs in SA from the vantage point of the practitioner and, secondly, to search for possible solutions to these.Motivation for the study: Decisions based on AC outcomes have profound implications for participants and organisations, and it is essential to understand potential ethical challenges to minimise these, specifically in the SA context, given its socio-political history, multiculturalism, diversity and pertinent legal considerations.Research design, approach and method: A qualitative, interpretative research design was chosen. Data were collected by means of a semi-structured survey that was completed by 96 AC practitioners who attended an AC conference. Content analysis and thematic interpretation were used to make sense of the data. The preliminary findings were assessed by a focus group of purposively selected subject-matter experts (n = 16) who provided informed insights, which were incorporated into the final findings. The focus group suggested ways in which specific ethical challenges may be addressed.Main findings: The findings revealed many ethical challenges that can be better understood within a broad framework encompassing 10 themes: Universal ethical values; multicultural global contexts; the regulatory-legal framework for ACs in SA; characteristics of the assessor; psychometric properties of the AC; characteristics of the participant; bias and prejudice; governance of the AC process; ethical culture of the employer organisation and the evasive nature of ethics as a concept.Practical and managerial implications: Considerable risk exists for the unethical use of ACs. An awareness of possible areas of risk may assist AC stakeholders in their search for ethical AC use.Contribution or value-add: The study may contribute to an evidence-based understanding of the ethical aspects of ACs. The recommendations may also benefit all AC stakeholders who wish to use ACs ethically. Collapse

Gorman CA, Rentsch JR. Retention of Assessment Center Rater Training. JOURNAL OF PERSONNEL PSYCHOLOGY 2017. [DOI: 10.1027/1866-5888/a000167] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Park S. Measuring Accountability in the Performance Appraisal Context: Rater Status and Organization Culture as Determinants of Rater Accountability. CURRENT PSYCHOLOGY 2016. [DOI: 10.1007/s12144-016-9499-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Vanhove AJ, Gibbons AM, Kedharnath U. Rater agreement, accuracy, and experienced cognitive load: Comparison of distributional and traditional assessment approaches to rating performance. HUMAN PERFORMANCE 2016. [DOI: 10.1080/08959285.2016.1192632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Time To Change the Bathwater: Correcting Misconceptions About Performance Ratings. INDUSTRIAL AND ORGANIZATIONAL PSYCHOLOGY-PERSPECTIVES ON SCIENCE AND PRACTICE 2016. [DOI: 10.1017/iop.2016.17] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Royal KD, Hecker KG. Rater Errors in Clinical Performance Assessments. JOURNAL OF VETERINARY MEDICAL EDUCATION 2015;43:5-8. [PMID: 26560550 DOI: 10.3138/jvme.0715-112r] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Borteyrou X, Lievens F, Bruchon-Schweitzer M, Congard A, Rascle N. Incremental Validity of Leaderless Group Discussion Ratings Over and Above General Mental Ability and Personality in Predicting Promotion. INTERNATIONAL JOURNAL OF SELECTION AND ASSESSMENT 2015. [DOI: 10.1111/ijsa.12121] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Measurement Error Obfuscates Scientific Knowledge: Path to Cumulative Knowledge Requires Corrections for Unreliability and Psychometric Meta-Analyses. INDUSTRIAL AND ORGANIZATIONAL PSYCHOLOGY-PERSPECTIVES ON SCIENCE AND PRACTICE 2015. [DOI: 10.1017/s1754942600006799] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Seeing the Forest but Missing the Trees: The Role of Judgments in Performance Management. INDUSTRIAL AND ORGANIZATIONAL PSYCHOLOGY-PERSPECTIVES ON SCIENCE AND PRACTICE 2015. [DOI: 10.1017/iop.2015.6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Ilgen JS, Ma IWY, Hatala R, Cook DA. A systematic review of validity evidence for checklists versus global rating scales in simulation-based assessment. MEDICAL EDUCATION 2015;49:161-73. [PMID: 25626747 DOI: 10.1111/medu.12621] [Citation(s) in RCA: 207] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Revised: 08/01/2014] [Accepted: 09/09/2014] [Indexed: 05/14/2023]

Abstract

CONTEXT

The relative advantages and disadvantages of checklists and global rating scales (GRSs) have long been debated. To compare the merits of these scale types, we conducted a systematic review of the validity evidence for checklists and GRSs in the context of simulation-based assessment of health professionals.

METHODS

We conducted a systematic review of multiple databases including MEDLINE, EMBASE and Scopus to February 2013. We selected studies that used both a GRS and checklist in the simulation-based assessment of health professionals. Reviewers working in duplicate evaluated five domains of validity evidence, including correlation between scales and reliability. We collected information about raters, instrument characteristics, assessment context, and task. We pooled reliability and correlation coefficients using random-effects meta-analysis.

RESULTS

We found 45 studies that used a checklist and GRS in simulation-based assessment. All studies included physicians or physicians in training; one study also included nurse anaesthetists. Topics of assessment included open and laparoscopic surgery (n = 22), endoscopy (n = 8), resuscitation (n = 7) and anaesthesiology (n = 4). The pooled GRS-checklist correlation was 0.76 (95% confidence interval [CI] 0.69-0.81, n = 16 studies). Inter-rater reliability was similar between scales (GRS 0.78, 95% CI 0.71-0.83, n = 23; checklist 0.81, 95% CI 0.75-0.85, n = 21), whereas GRS inter-item reliabilities (0.92, 95% CI 0.84-0.95, n = 6) and inter-station reliabilities (0.80, 95% CI 0.73-0.85, n = 10) were higher than those for checklists (0.66, 95% CI 0-0.84, n = 4 and 0.69, 95% CI 0.56-0.77, n = 10, respectively). Content evidence for GRSs usually referenced previously reported instruments (n = 33), whereas content evidence for checklists usually described expert consensus (n = 26). Checklists and GRSs usually had similar evidence for relations to other variables.

CONCLUSIONS

Checklist inter-rater reliability and trainee discrimination were more favourable than suggested in earlier work, but each task requires a separate checklist. Compared with the checklist, the GRS has higher average inter-item and inter-station reliability, can be used across multiple tasks, and may better capture nuanced elements of expertise.

Collapse

Lance CE. Why Assessment Centers Do Not Work the Way They Are Supposed To. INDUSTRIAL AND ORGANIZATIONAL PSYCHOLOGY-PERSPECTIVES ON SCIENCE AND PRACTICE 2015. [DOI: 10.1111/j.1754-9434.2007.00017.x] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Viswesvaran C, Ones DS, Schmidt FL, Le H, Oh IS. Measurement Error Obfuscates Scientific Knowledge: Path to Cumulative Knowledge Requires Corrections for Unreliability and Psychometric Meta-Analyses. INDUSTRIAL AND ORGANIZATIONAL PSYCHOLOGY-PERSPECTIVES ON SCIENCE AND PRACTICE 2014. [DOI: 10.1111/iops.12186] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Weitz G, Vinzentius C, Twesten C, Lehnert H, Bonnemeier H, König IR. Effects of a rater training on rating accuracy in a physical examination skills assessment. GMS ZEITSCHRIFT FUR MEDIZINISCHE AUSBILDUNG 2014;31:Doc41. [PMID: 25489341 PMCID: PMC4259060 DOI: 10.3205/zma000933] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2014] [Revised: 03/24/2014] [Accepted: 08/20/2014] [Indexed: 11/30/2022]

Abstract

Background: The accuracy and reproducibility of medical skills assessment is generally low. Rater training has little or no effect. Our knowledge in this field, however, relies on studies involving video ratings of overall clinical performances. We hypothesised that a rater training focussing on the frame of reference could improve accuracy in grading the curricular assessment of a highly standardised physical head-to-toe examination.

Methods: Twenty-one raters assessed the performance of 242 third-year medical students. Eleven raters had been randomly assigned to undergo a brief frame-of-reference training a few days before the assessment. 218 encounters were successfully recorded on video and re-assessed independently by three additional observers. Accuracy was defined as the concordance between the raters' grade and the median of the observers' grade. After the assessment, both students and raters filled in a questionnaire about their views on the assessment.

Results: Rater training did not have a measurable influence on accuracy. However, trained raters rated significantly more stringently than untrained raters, and their overall stringency was closer to the stringency of the observers. The questionnaire indicated a higher awareness of the halo effect in the trained raters group. Although the self-assessment of the students mirrored the assessment of the raters in both groups, the students assessed by trained raters felt more discontent with their grade.

Conclusions: While training had some marginal effects, it failed to have an impact on the individual accuracy. These results in real-life encounters are consistent with previous studies on rater training using video assessments of clinical performances. The high degree of standardisation in this study was not suitable to harmonize the trained raters’ grading. The data support the notion that the process of appraising medical performance is highly individual. A frame-of-reference training as applied does not effectively adjust the physicians' judgement on medical students in real-live assessments.

Collapse

Brits NM, Meiring D, Becker JR. Investigating the construct validity of a development assessment centre. SA JOURNAL OF INDUSTRIAL PSYCHOLOGY 2013. [DOI: 10.4102/sajip.v39i1.1092] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022] Open

Müller KP, Roodt G. Content validation: The forgotten step-child or a crucial step in assessment centre validation? SA JOURNAL OF INDUSTRIAL PSYCHOLOGY 2013. [DOI: 10.4102/sajip.v39i1.1153] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022] Open

J.B. Govaerts M, W.J. van de Wiel M, P.M. van der Vleuten C. Quality of feedback following performance assessments: does assessor expertise matter? EUROPEAN JOURNAL OF TRAINING AND DEVELOPMENT 2013. [DOI: 10.1108/03090591311293310] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Guenole N, Chernyshenko OS, Stark S, Cockerill T, Drasgow F. More than a mirage: A large-scale assessment centre with more dimension variance than exercise variance. JOURNAL OF OCCUPATIONAL AND ORGANIZATIONAL PSYCHOLOGY 2012. [DOI: 10.1111/j.2044-8325.2012.02063.x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Patterson F, Ferguson E. Testing non-cognitive attributes in selection centres: how to avoid being reliably wrong. MEDICAL EDUCATION 2012;46:240-2. [PMID: 22324522 DOI: 10.1111/j.1365-2923.2011.04193.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]

Magnier KM, Dale VHM, Pead MJ. Workplace-based assessment instruments in the health sciences. JOURNAL OF VETERINARY MEDICAL EDUCATION 2012. [PMID: 23187032 DOI: 10.3138/jvme.1211-118r] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]

Roch SG, Woehr DJ, Mishra V, Kieszczynska U. Rater training revisited: An updated meta-analytic review of frame-of-reference training. JOURNAL OF OCCUPATIONAL AND ORGANIZATIONAL PSYCHOLOGY 2011. [DOI: 10.1111/j.2044-8325.2011.02045.x] [Citation(s) in RCA: 118] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Schmid Mast M, Bangerter A, Bulliard C, Aerni G. How Accurate are Recruiters' First Impressions of Applicants in Employment Interviews? INTERNATIONAL JOURNAL OF SELECTION AND ASSESSMENT 2011. [DOI: 10.1111/j.1468-2389.2011.00547.x] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Govaerts MJB, Schuwirth LWT, Van der Vleuten CPM, Muijtjens AMM. Workplace-based assessment: effects of rater expertise. ADVANCES IN HEALTH SCIENCES EDUCATION : THEORY AND PRACTICE 2011;16:151-65. [PMID: 20882335 PMCID: PMC3068251 DOI: 10.1007/s10459-010-9250-7] [Citation(s) in RCA: 132] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Accepted: 09/16/2010] [Indexed: 05/18/2023]

MELCHERS KLAUSG, LIENHARDT NADJA, VON AARBURG MIRIAM, KLEINMANN MARTIN. IS MORE STRUCTURE REALLY BETTER? A COMPARISON OF FRAME-OF-REFERENCE TRAINING AND DESCRIPTIVELY ANCHORED RATING SCALES TO IMPROVE INTERVIEWERS’ RATING QUALITY. PERSONNEL PSYCHOLOGY 2011. [DOI: 10.1111/j.1744-6570.2010.01202.x] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Stillman JA, Jackson DJR. A detection theory approach to the evaluation of assessors in assessment centres. JOURNAL OF OCCUPATIONAL AND ORGANIZATIONAL PSYCHOLOGY 2011. [DOI: 10.1348/096317905x26147] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Kleinmann M, Klehe UC. Selling Oneself: Construct and Criterion-Related Validity of Impression Management in Structured Interviews. HUMAN PERFORMANCE 2010. [DOI: 10.1080/08959285.2010.530634] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Melchers KG, Kleinmann M, Prinz MA. Do Assessors Have Too Much on their Plates? The Effects of Simultaneously Rating Multiple Assessment Center Candidates on Rating Quality. INTERNATIONAL JOURNAL OF SELECTION AND ASSESSMENT 2010. [DOI: 10.1111/j.1468-2389.2010.00516.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Lievens F. Assessment centres: A tale about dimensions, exercises, and dancing bears. EUROPEAN JOURNAL OF WORK AND ORGANIZATIONAL PSYCHOLOGY 2009. [DOI: 10.1080/13594320802058997] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

WOO SANGE, SIMS CARRAS, RUPP DEBORAHE, GIBBONS ALYSSAM. DEVELOPMENT ENGAGEMENT WITHIN AND FOLLOWING DEVELOPMENTAL ASSESSMENT CENTERS: CONSIDERING FEEDBACK FAVORABILITY AND SELF-ASSESSOR AGREEMENT. PERSONNEL PSYCHOLOGY 2008. [DOI: 10.1111/j.1744-6570.2008.00129.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Ziv A, Rubin O, Moshinsky A, Gafni N, Kotler M, Dagan Y, Lichtenberg D, Mekori YA, Mittelman M. MOR: a simulation-based assessment centre for evaluating the personal and interpersonal qualities of medical school candidates. MEDICAL EDUCATION 2008;42:991-8. [PMID: 18823518 DOI: 10.1111/j.1365-2923.2008.03161.x] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

Abstract

CONTEXT

Medical school admissions traditionally rely heavily on cognitive variables, with non-cognitive measures assessed through interviews only. In recognition of the unsatisfactory reliability and validity of traditional interviews, medical schools are increasingly exploring alternative approaches that can provide improved measures of candidates' personal and interpersonal qualities.

METHODS

An innovative assessment centre (MOR [Hebrew acronym for 'selection for medicine']) was designed to measure candidates' personal and interpersonal attributes. Three assessment tools were developed: behavioural stations, including encounters with simulated patients and group tasks; an autobiographical questionnaire, and a judgement and decision-making questionnaire. Candidates were evaluated by trained raters on four qualities: interpersonal communication; ability to handle stress; initiative and responsibility, and self-awareness.

RESULTS

In the years 2004-05, the 588 medical school candidates with the highest cognitive scores were tested; this resulted in a change of approximately 20% in the cohort of accepted students compared with previous admission criteria. Internal consistency ranged from 0.80 to 0.88; inter-rater reliability ranged from 0.62 to 0.77 for the behavioural stations and from 0.72 to 0.95 for the questionnaires; test-retest score correlation was 0.7. The correlation between candidates' MOR scores and cognitive scores approached zero, reflecting the value of MOR in the screening process. Feedback from participants indicated that MOR was perceived as fair and appropriate for medical school screening.

DISCUSSION

MOR is a reliable tool for measuring non-cognitive attributes in medical school candidates. It has high content and face validity. Furthermore, its implementation conveys the importance of maintaining humanist characteristics in the medical profession to students and faculty staff.

Collapse

Sackett PR, Lievens F. Personnel Selection. Annu Rev Psychol 2008;59:419-50. [PMID: 17854285 DOI: 10.1146/annurev.psych.59.103006.093716] [Citation(s) in RCA: 125] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Wu SM, Whiteside U, Neighbors C. Differences in Inter‐Rater Reliability and Accuracy for a Treatment Adherence Scale. Cogn Behav Ther 2007;36:230-9. [DOI: 10.1080/16506070701584367] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Kanning UP, Pöttker J, Gelléri P. Assessment Center-Praxis in deutschen Großunternehmen. ZEITSCHRIFT FUR ARBEITS-UND ORGANISATIONSPSYCHOLOGIE 2007. [DOI: 10.1026/0932-4089.51.4.155] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

Ziv A, Rubin O, Sidi A, Berkenstadt H. Credentialing and certifying with simulation. Anesthesiol Clin 2007;25:261-9. [PMID: 17574189 DOI: 10.1016/j.anclin.2007.03.002] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]

Govaerts MJB, van der Vleuten CPM, Schuwirth LWT, Muijtjens AMM. Broadening perspectives on clinical performance assessment: rethinking the nature of in-training assessment. ADVANCES IN HEALTH SCIENCES EDUCATION : THEORY AND PRACTICE 2007;12:239-60. [PMID: 17096207 DOI: 10.1007/s10459-006-9043-1] [Citation(s) in RCA: 165] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2005] [Accepted: 10/02/2006] [Indexed: 05/11/2023]

Abstract

CONTEXT

In-training assessment (ITA), defined as multiple assessments of performance in the setting of day-to-day practice, is an invaluable tool in assessment programmes which aim to assess professional competence in a comprehensive and valid way. Research on clinical performance ratings, however, consistently shows weaknesses concerning accuracy, reliability and validity. Attempts to improve the psychometric characteristics of ITA focusing on standardisation and objectivity of measurement thus far result in limited improvement of ITA-practices.

PURPOSE

The aim of the paper is to demonstrate that the psychometric framework may limit more meaningful educational approaches to performance assessment, because it does not take into account key issues in the mechanics of the assessment process. Based on insights from other disciplines, we propose an approach to ITA that takes a constructivist, social-psychological perspective and integrates elements of theories of cognition, motivation and decision making. A central assumption in the proposed framework is that performance assessment is a judgment and decision making process, in which rating outcomes are influenced by interactions between individuals and the social context in which assessment occurs.

DISCUSSION

The issues raised in the article and the proposed assessment framework bring forward a number of implications for current performance assessment practice. It is argued that focusing on the context of performance assessment may be more effective in improving ITA practices than focusing strictly on raters and rating instruments. Furthermore, the constructivist approach towards assessment has important implications for assessment procedures as well as the evaluation of assessment quality. Finally, it is argued that further research into performance assessment should contribute towards a better understanding of the factors that influence rating outcomes, such as rater motivation, assessment procedures and other contextual variables.

Collapse

Whiting HJ, Kline TJB. Testing a model of performance appraisal fit on attitudinal outcomes. ACTA ACUST UNITED AC 2007. [DOI: 10.1080/10887150701451288] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]