1
|
Measuring teacher innovative behavior: a validated multidimensional inventory for use with public school teachers. INTERNATIONAL JOURNAL OF EDUCATIONAL MANAGEMENT 2023. [DOI: 10.1108/ijem-03-2022-0095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/26/2023]
Abstract
PurposeThe role of Teacher Innovative Behavior (TIB), in responding to systemic problems in educational systems and promoting “intrapreneurial” behavior has been recognized in recent times. A robust instrument that can help administrators and teacher educators gauge the levels of TIB among their teachers will facilitate the promotion of innovative behavior.Design/methodology/approachThis study tested a multidimensional innovative behavior inventory (IBI), innovation support inventory (ISI) and innovation output (IO) in a developing nation (India) context with public school teachers (n = 34,754), for reliability, validity, measurement invariance and structural invariance across caste, gender and subject groups.FindingsThe IBI, ISI and IO showed good reliability and validity along with full measurement invariance at configural, metric and scalar levels. With respect to the structural parameters, the inventories exhibited invariance of factor variance and covariance, but not of factor means.Practical implicationsTeacher innovative behavior (TIB) is seen by developing country education administrators as a tool to address difficult problems. With better measurement, it will be possible to identify teachers who need training in creativity and entrepreneurial behavior, teachers who might have developed innovative practices that could be used for teacher development, and ways of promoting competition among teachers.Originality/valueThe study validates inventories, which were earlier tested in non-educational domains, for use with public school teachers of a developing country across gender, caste and subject groups.
Collapse
|
2
|
Large studies reveal how reference bias limits policy applications of self-report measures. Sci Rep 2022; 12:19189. [PMID: 36357481 PMCID: PMC9649615 DOI: 10.1038/s41598-022-23373-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 10/31/2022] [Indexed: 11/11/2022] Open
Abstract
There is growing policy interest in identifying contexts that cultivate self-regulation. Doing so often entails comparing groups of individuals (e.g., from different schools). We show that self-report questionnaires-the most prevalent modality for assessing self-regulation-are prone to reference bias, defined as systematic error arising from differences in the implicit standards by which individuals evaluate behavior. In three studies, adolescents (N = 229,685) whose peers performed better academically rated themselves lower in self-regulation and held higher standards for self-regulation. This effect was not observed for task measures of self-regulation and led to paradoxical predictions of college persistence 6 years later. These findings suggest that standards for self-regulation vary by social group, limiting the policy applications of self-report questionnaires.
Collapse
|
3
|
Comparing the reliability and predictive power of child, teacher, and guardian reports of noncognitive skills. Proc Natl Acad Sci U S A 2022; 119:2113992119. [PMID: 35131849 PMCID: PMC8833216 DOI: 10.1073/pnas.2113992119] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/21/2021] [Indexed: 11/18/2022] Open
Abstract
Children’s noncognitive or socioemotional skills (e.g., persistence and self-control) are typically measured using surveys in which either children rate their own skills or adults rate the skills of children. For many purposes—including program evaluation and monitoring school systems—ratings are often collected from multiple perspectives about a single child (e.g., from both the child and an adult). Collecting data from multiple perspectives is costly, and there is limited evidence on the benefits of this approach. Using a longitudinal survey, this study compares children’s noncognitive skills as reported by themselves, their guardians, and their teachers. Although reports from all three types of respondents are correlated with each other, teacher reports have the highest internal consistency and are the most predictive of children’s later cognitive outcomes and behavior in school. The teacher reports add predictive power beyond baseline measures of Intelligence Quotient (IQ) for most outcomes in schools. Measures collected from children and guardians add minimal predictive power beyond the teacher reports.
Collapse
|
4
|
Coenen J, Golsteyn BHH, Stolp T, Tempelaar D. Personality traits and academic performance: Correcting self-assessed traits with vignettes. PLoS One 2021; 16:e0248629. [PMID: 33765063 PMCID: PMC7993818 DOI: 10.1371/journal.pone.0248629] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 03/03/2021] [Indexed: 11/25/2022] Open
Abstract
In this study, we investigate whether Conscientiousness, Emotional Stability and Risk Preference relate to student performance in higher education. We employ anchoring vignettes to correct for heterogeneous scale use in these non-cognitive skills. Our data are gathered among first-year students at a Dutch university. The results show that Conscientiousness is positively related to student performance, but the estimates are strongly biased upward if we use the uncorrected variables. We do not find significant relationships for Emotional Stability but find that the point estimates are larger when using the uncorrected variables. Measured Risk Preference is negatively related to student performance, yet this is fully explained by heterogeneous scale use. These results indicate the importance of using more objective measurements of personality traits.
Collapse
Affiliation(s)
- Johan Coenen
- School of Business and Economics, Maastricht University, Maastricht, The Netherlands
| | - Bart H. H. Golsteyn
- School of Business and Economics, Maastricht University, Maastricht, The Netherlands
- * E-mail:
| | - Tom Stolp
- School of Business and Economics, Maastricht University, Maastricht, The Netherlands
| | - Dirk Tempelaar
- School of Business and Economics, Maastricht University, Maastricht, The Netherlands
| |
Collapse
|
5
|
The importance of development standards for anchoring vignettes: an illustrative example from pediatric localized scleroderma quality of life. Qual Life Res 2020; 29:3263-3272. [PMID: 32654054 DOI: 10.1007/s11136-020-02575-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/02/2020] [Indexed: 10/23/2022]
Abstract
PURPOSE Anchoring vignettes (AVs) are a promising measurement technique to reduce bias in patient-reported outcome (PRO) measures by helping researchers understand differences in how individuals and groups interpret response options. However, little attention has been paid to ensure quality development of AVs, and their performance has not been well assessed in pediatric populations. In this study, we explore the application of a rigorous development process for AVs based upon current standards for PROs, as well as feasibility of AVs when administered to children and adolescents. METHODS We developed AVs using a rigorous, patient-centered mixed methods process including three phases: (1) development, (2) a pilot study, and (3) a field test. Our proposed process included the generation of a conceptual framework based on the PRO, the Localized Scleroderma Quality of Life Instrument, and numerous vignette-specific considerations. We qualitatively explored readability and comprehension of the AVs (pilot study) and then analyzed ranking patterns within vignette sets (field test). RESULTS Four sets of four vignettes were developed. Revisions were suggested at each phase of development. The pilot study demonstrated that children ≥ 10 years had no trouble indicating understanding of the AVs. In the field test, although appropriate rankings of vignettes were generally demonstrated by participants, the percentage of tied rankings was higher than expected in this pediatric group. CONCLUSIONS This work supports the need for rigorous developmental standards for AVs, as each stage of development suggested revisions. Additionally, AVs showed initial promise for use with pediatric populations; general feasibility and understanding were supported.
Collapse
|
6
|
Associations between self-rated health and the assessments of anchoring vignettes in cardiovascular patients. Int J Clin Health Psychol 2020; 20:100-107. [PMID: 32550849 PMCID: PMC7296254 DOI: 10.1016/j.ijchp.2020.04.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Accepted: 04/01/2020] [Indexed: 01/08/2023] Open
Abstract
Background/Objective Assessments of health can be biased by response shift effects. One method for detecting such effects is the use of anchoring vignettes. The aim of this study was to analyze the relationship between participants' self-assessed health state and their assessments of these vignettes. Method A total of 342 cardiovascular patients assessed their own state of health on a 0-100 visual analogue scale. The patients additionally assessed two vignettes featuring fictional persons suffering from specific complaints. A sample of the general population (N = 1,236) served as controls. Results The participants rated the health state of the vignette character featuring physical problems as being significantly better than the general population did (effect size: d = 0.53). The group difference in the assessment of the vignette featuring primarily mental health problems was lower (d = -0.17). Participants' assessments of the vignettes were positively correlated with their assessments of their own health state (r = .26 and r = .10) and with several quality of life variables. Conclusions Anchoring vignettes are a useful tool for detecting response shift effects.
Collapse
|
7
|
von Davier M, Shin HJ, Khorramdel L, Stankov L. The Effects of Vignette Scoring on Reliability and Validity of Self-Reports. APPLIED PSYCHOLOGICAL MEASUREMENT 2018; 42:291-306. [PMID: 29881126 PMCID: PMC5978608 DOI: 10.1177/0146621617730389] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
The research presented in this article combines mathematical derivations and empirical results to investigate effects of the nonparametric anchoring vignette approach proposed by King, Murray, Salomon, and Tandon on the reliability and validity of rating data. The anchoring vignette approach aims to correct rating data for response styles to improve comparability across individuals and groups. Vignettes are used to adjust self-assessment responses on the respondent level but entail significant assumptions: They are supposed to be invariant across respondents, and the responses to vignette prompts are supposed to be without error and strictly ordered. This article shows that these assumptions are not always met and that the anchoring vignette approach leads to higher Cronbach's alpha values and increased correlations among adjusted variables regardless of whether the assumptions of the approach are met or violated. Results suggest that the underlying assumptions and effects of the anchoring vignette approach should be carefully examined as the increased correlations and reliability estimates can be observed even for response variables that are independent random draws and uncorrelated with any other variable.
Collapse
Affiliation(s)
- Matthias von Davier
- National Board of Medical Examiners, Philadelphia, PA, USA
- Matthias von Davier, National Board of Medical Examiners, 3750 Market Street, Philadelphia, PA 19104, USA.
| | | | | | - Lazar Stankov
- Australian Catholic University, North Sydney, New South Wales, Australia
| |
Collapse
|
8
|
Weiss S, Roberts RD. Using Anchoring Vignettes to Adjust Self-Reported Personality: A Comparison Between Countries. Front Psychol 2018; 9:325. [PMID: 29593621 PMCID: PMC5861527 DOI: 10.3389/fpsyg.2018.00325] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2017] [Accepted: 02/26/2018] [Indexed: 01/07/2023] Open
Abstract
Data from self-report tools cannot be readily compared between cultures due to culturally specific ways of using a response scale. As such, anchoring vignettes have been proposed as a suitable methodology for correcting against this difference. We developed anchoring vignettes for the Big Five Inventory-44 (BFI-44) to supplement its Likert-type response options. Based on two samples (Rwanda: n = 423; Philippines: n = 143), we evaluated the psychometric properties of the measure both before and after applying the anchoring vignette adjustment. Results show that adjusted scores had better measurement properties, including improved reliability and a more orthogonal correlational structure, relative to scores based on the original Likert scale. Correlations of the Big Five Personality Factors with life satisfaction were essentially unchanged after the vignette-adjustment while correlations with counterproductive were noticeably lower. Overall, these changed findings suggest that the use of anchoring vignette methodology improves the cross-cultural comparability of self-reported personality, a finding of potential interest to the field of global workforce research and development as well as educational policymakers.
Collapse
Affiliation(s)
- Selina Weiss
- Department of Individual Differences and Psychological Assessment, Institute of Psychology and Education, Ulm University, Ulm, Germany
| | | |
Collapse
|
9
|
Stankov L, Lee J, von Davier M. A Note on Construct Validity of the Anchoring Method in PISA 2012. JOURNAL OF PSYCHOEDUCATIONAL ASSESSMENT 2017. [DOI: 10.1177/0734282917702270] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
We examine construct validity of the anchoring method used with 12 noncognitive scales from the Programme for International Student Assessment (PISA) 2012 project. This method combines individuals’ responses to vignettes and self-rated scores based on Likert-type items. It has been reported that the use of anchoring vignettes can reverse country-level correlations between academic achievement scores and noncognitive measures from negative to positive, and therefore align them with the typically reported individual-level correlations. Using the PISA 2012 data, we show that construct validity of this approach may be open to question because the anchored scales produce a different set of latent dimensions in comparison with nonanchored scales, even though both scales were created from the same set of individual responses. We also demonstrate that only one of three vignettes may be responsible for the resolution of the “paradox” highlighting that the choice of vignettes may be more important than what was previously reported.
Collapse
Affiliation(s)
- Lazar Stankov
- The University of Sydney, Sydney, Australia
- University of Southern Queensland, Toowoomba, QLD, Australia
| | - Jihyun Lee
- University of New South Wales, Sydney, Australia
| | | |
Collapse
|
10
|
He J, Buchholz J, Klieme E. Effects of Anchoring Vignettes on Comparability and Predictive Validity of Student Self-Reports in 64 Cultures. JOURNAL OF CROSS-CULTURAL PSYCHOLOGY 2017. [DOI: 10.1177/0022022116687395] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Anchoring vignettes are item batteries especially designed for correcting responses that might be affected by incomparability. This article investigates the effects of anchoring vignettes on the validity of student self-report data in 64 cultures. Using secondary data analysis from the 2012 Programme for International Student Assessment (PISA), we checked the validity of ratings on vignette questions, and investigated how rescaled item responses of two student scales, Teacher Support and Classroom Management, enhanced comparability and predictive validity. The main findings include that (a) responses to vignette questions represent valid individual and cultural differences; in particular, violations in these responses (i.e., misorderings) are related to low socioeconomic status and low cognitive sophistication; (b) the rescaled responses tend to show higher levels of comparability; and (c) the associations of rescaled Teacher Support and Classroom Management with math achievement, Student-Oriented Instruction, and Teacher-Directed Instruction are slightly different from raw scores of the two target constructs, and the associations with rescaled scores seem to be more in line with the literature. Namely, the associations among all self-report Likert-type scales are weaker with rescaled scores, presumably reducing common method variance, and both rescaled scale scores are more positively related to math achievement. The country ranking also changes substantially; in particular, Asian cultures top the ranking on Teacher Support after rescaling. However, anchoring vignettes are not a cure-all in solving measurement bias in cross-cultural surveys; we discuss the technicality and directions for further research on this technique.
Collapse
Affiliation(s)
- Jia He
- Tilburg University, The Netherlands
- German Institute for International Educational Research, Frankfurt, Germany
| | - Janine Buchholz
- German Institute for International Educational Research, Frankfurt, Germany
| | - Eckhard Klieme
- German Institute for International Educational Research, Frankfurt, Germany
| |
Collapse
|
11
|
Danner D, Blasius J, Breyer B, Eifler S, Menold N, Paulhus DL, Rammstedt B, Roberts RD, Schmitt M, Ziegler M. Current Challenges, New Developments, and Future Directions in Scale Construction. EUROPEAN JOURNAL OF PSYCHOLOGICAL ASSESSMENT 2016. [DOI: 10.1027/1015-5759/a000375] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Affiliation(s)
- Daniel Danner
- GESIS – Leibniz-Institute for the Social Sciences, Survey Design and Methodology, Mannheim, Germany
| | - Jörg Blasius
- University of Bonn, Institut für Politische Wissenschaft und Soziologie, Germany
| | - Bianka Breyer
- GESIS – Leibniz-Institute for the Social Sciences, Survey Design and Methodology, Mannheim, Germany
| | - Stefanie Eifler
- Catholic University of Eichstätt-Ingolstadt, Sociology Institute, Germany
| | - Natalja Menold
- GESIS – Leibniz-Institute for the Social Sciences, Survey Design and Methodology, Mannheim, Germany
| | - Delroy L. Paulhus
- University of British Columbia, Department of Psychology, Vancouver, Canada
| | - Beatrice Rammstedt
- GESIS – Leibniz-Institute for the Social Sciences, Survey Design and Methodology, Mannheim, Germany
| | - Richard D. Roberts
- Center for Innovative Assessments, Professional Examination Service, New York, NY, USA
| | - Manfred Schmitt
- University of Koblenz-Landau, Fachbereich Psychologie, Germany
| | - Matthias Ziegler
- Institut für Psychologie, Humboldt Universität zu Berlin, Germany
| |
Collapse
|