Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Stark S, Chernyshenko OS, Drasgow F. Examining the effects of differential item (functioning and differential) test functioning on selection decisions: when are statistically significant effects practically important? ACTA ACUST UNITED AC 2004;89:497-508. [PMID: 15161408 DOI: 10.1037/0021-9010.89.3.497] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

For:	Stark S, Chernyshenko OS, Drasgow F. Examining the effects of differential item (functioning and differential) test functioning on selection decisions: when are statistically significant effects practically important? ACTA ACUST UNITED AC 2004;89:497-508. [PMID: 15161408 DOI: 10.1037/0021-9010.89.3.497] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Number

Cited by Other Article(s)

Jović M, Amir-Haeri M, Rimfeld K, Ensink JBM, Lindauer RJL, Vrijkotte TGM, Whitehouse A, van den Berg SM. Harmonization of SDQ and ASEBA Phenotypes: Measurement Variance Across Cohorts. JOURNAL OF PSYCHOPATHOLOGY AND BEHAVIORAL ASSESSMENT 2025;47:27. [PMID: 40062209 PMCID: PMC11889055 DOI: 10.1007/s10862-025-10204-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/18/2025] [Indexed: 03/28/2025]

Buchanan EM. Visualizemi: Visualization, Effect Size, and Replication of Measurement Invariance for Registered Reports. Assessment 2025;32:190-205. [PMID: 39473061 DOI: 10.1177/10731911241280763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/03/2025]

Ozcan M, Lai MHC. Exploring the Impact of Deleting (or Retaining) a Biased Item: A Procedure Based on Classification Accuracy. Assessment 2024:10731911241298081. [PMID: 39655755 DOI: 10.1177/10731911241298081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2024]

Oeffinger DJ, Iwinski H, Talwalkar V, Dueber DM. Psychometric analysis and the implications for the use of the scoliosis research society questionnaire (SRS-22r English) for individuals with adolescent idiopathic scoliosis. NORTH AMERICAN SPINE SOCIETY JOURNAL 2024;19:100545. [PMID: 39290847 PMCID: PMC11405851 DOI: 10.1016/j.xnsj.2024.100545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Revised: 07/26/2024] [Accepted: 07/27/2024] [Indexed: 09/19/2024]

Abstract

Background

Despite widespread usage of the SRS-22r questionnaire (Scoliosis Research Society Questionnaire-22r), the English version has only sparingly been subjected to analysis using modern psychometric techniques for patients with adolescent idiopathic scoliosis (AIS). The study purpose was to improve interpretation and clinical utility of the SRS-22r for adolescents with AIS by generating additional robust evidence, using modern statistical techniques. Questions about (1) Structure and (2) Item and Scale Functioning are addressed and interpreted for clinicians and researchers.

Methods

This retrospective case review analyzed SRS-22r data collected from 1823 patients (mean age 14.9±2.2years) with a primary diagnosis of AIS who clinically completed an SRS-22r questionnaire.Individual SRS-22r questions and domain scores were retrieved through data queries. Patient information collected through chart review included diagnosis, age at assessment, sex, race and radiographic parameters. From 6044 SRS-22r assessments, 1 assessment per patient was randomly selected. Exploratory structural equation modeling (ESEM) and item response theory (IRT) techniques were used for data modeling, item calibration, and reliability assessment.

Results

ESEM demonstrated acceptable fit to the data: χ2 (130)=343.73, p<.001; RMSEA=0.035; CFI=0.98; TLI=0.96; SRMR=0.02. Several items failed to adequately load onto their assigned factor. Item fit was adequate for all items except SRSq10 (Self-Image), SRSq16 (Mental Health), and SRSq20 (Mental Health). IRT models found item discriminations are within normal levels for items in psychological measures, except items SRSq1 (pain), SRSq2 (pain), and SRSq16 (mental health). Estimated reliability of the Function domain (ρ=0.69) was low, however, Pain, Self-Image and Mental Health domains exhibited high (ρ>0.80) reliability.

Conclusions

Modern psychometric assessment of the SRS-22r, in adolescent patients with AIS, are presented and interpreted to assist clinicians and researchers in understanding its strengths and limitations. Overall, the SRS-22r demonstrated good psychometric properties in all domains except function. Cautious interpretation of the total score is suggested, as it does not reflect a single HRQoL construct.

Collapse

Widaman KF, Revelle W. Thinking About Sum Scores Yet Again, Maybe the Last Time, We Don't Know, Oh No . . .: A Comment on. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 2024;84:637-659. [PMID: 39055096 PMCID: PMC11268387 DOI: 10.1177/00131644231205310] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/27/2024]

Black L, Humphrey N, Panayiotou M, Marquez J. Mental Health and Well-being Measures for Mean Comparison and Screening in Adolescents: An Assessment of Unidimensionality and Sex and Age Measurement Invariance. Assessment 2024;31:219-236. [PMID: 36864693 PMCID: PMC10822075 DOI: 10.1177/10731911231158623] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/04/2023]

DeCarlo M, Bean G. Assessing Measurement Invariance in ASWB Exams: Regulatory Research Proposal to Advance Equity. JOURNAL OF EVIDENCE-BASED SOCIAL WORK (2019) 2024;21:214-235. [PMID: 38345106 DOI: 10.1080/26408066.2024.2308814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/18/2024]

Goldammer P, Annen H, Lienhard C, Jonas K. An examination of model fit and measurement invariance of general mental ability and personality measures used in the multilingual context of the Swiss Armed Forces: A Bayesian structural equation modeling approach. MILITARY PSYCHOLOGY 2024;36:96-113. [PMID: 38193872 PMCID: PMC10790799 DOI: 10.1080/08995605.2021.1963632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 07/19/2021] [Indexed: 10/20/2022]

Richson BN, Hazzard VM, Christensen KA, Hagan KE. Do the SCOFF items function differently by food-security status in U.S. college students?: Statistically, but not practically, significant differences. Eat Behav 2023;49:101743. [PMID: 37209568 PMCID: PMC10681748 DOI: 10.1016/j.eatbeh.2023.101743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 03/05/2023] [Accepted: 04/24/2023] [Indexed: 05/22/2023]

Chalmers RP. A Unified Comparison of IRT‐Based Effect Sizes for DIF Investigations. JOURNAL OF EDUCATIONAL MEASUREMENT 2022. [DOI: 10.1111/jedm.12347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Joo S, Lee P. Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics. JOURNAL OF EDUCATIONAL MEASUREMENT 2022. [DOI: 10.1111/jedm.12316] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Taple BJ, Chapman R, Schalet BD, Brower R, Griffith JW. The Impact of Education on Depression Assessment: Differential Item Functioning Analysis. Assessment 2022;29:272-284. [PMID: 33218257 PMCID: PMC9060911 DOI: 10.1177/1073191120971357] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Tay L, Woo SE, Hickman L, Booth BM, D’Mello S. A Conceptual Framework for Investigating and Mitigating Machine-Learning Measurement Bias (MLMB) in Psychological Assessment. ADVANCES IN METHODS AND PRACTICES IN PSYCHOLOGICAL SCIENCE 2022. [DOI: 10.1177/25152459211061337] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Lutz PK, O'Connor BP, Folk D. Dimensionality, Item Response Theory, Effect Size Attenuation, and Test Bias Analyses of the Self-Importance of Moral Identity Scale (SIMIS). J Pers Assess 2021;104:586-598. [PMID: 34704515 DOI: 10.1080/00223891.2021.1991359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Teresi JA, Wang C, Kleinman M, Jones RN, Weiss DJ. Differential Item Functioning Analyses of the Patient-Reported Outcomes Measurement Information System (PROMIS®) Measures: Methods, Challenges, Advances, and Future Directions. PSYCHOMETRIKA 2021;86:674-711. [PMID: 34251615 PMCID: PMC8889890 DOI: 10.1007/s11336-021-09775-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 03/02/2021] [Accepted: 05/19/2021] [Indexed: 06/12/2023]

Abstract

Several methods used to examine differential item functioning (DIF) in Patient-Reported Outcomes Measurement Information System (PROMIS®) measures are presented, including effect size estimation. A summary of factors that may affect DIF detection and challenges encountered in PROMIS DIF analyses, e.g., anchor item selection, is provided. An issue in PROMIS was the potential for inadequately modeled multidimensionality to result in false DIF detection. Section 1 is a presentation of the unidimensional models used by most PROMIS investigators for DIF detection, as well as their multidimensional expansions. Section 2 is an illustration that builds on previous unidimensional analyses of depression and anxiety short-forms to examine DIF detection using a multidimensional item response theory (MIRT) model. The Item Response Theory-Log-likelihood Ratio Test (IRT-LRT) method was used for a real data illustration with gender as the grouping variable. The IRT-LRT DIF detection method is a flexible approach to handle group differences in trait distributions, known as impact in the DIF literature, and was studied with both real data and in simulations to compare the performance of the IRT-LRT method within the unidimensional IRT (UIRT) and MIRT contexts. Additionally, different effect size measures were compared for the data presented in Section 2. A finding from the real data illustration was that using the IRT-LRT method within a MIRT context resulted in more flagged items as compared to using the IRT-LRT method within a UIRT context. The simulations provided some evidence that while unidimensional and multidimensional approaches were similar in terms of Type I error rates, power for DIF detection was greater for the multidimensional approach. Effect size measures presented in Section 1 and applied in Section 2 varied in terms of estimation methods, choice of density function, methods of equating, and anchor item selection. Despite these differences, there was considerable consistency in results, especially for the items showing the largest values. Future work is needed to examine DIF detection in the context of polytomous, multidimensional data. PROMIS standards included incorporation of effect size measures in determining salient DIF. Integrated methods for examining effect size measures in the context of IRT-based DIF detection procedures are still in early stages of development.

Collapse

Lee P, Joo SH, Stark S. Detecting DIF in Multidimensional Forced Choice Measures Using the Thurstonian Item Response Theory Model. ORGANIZATIONAL RESEARCH METHODS 2020. [DOI: 10.1177/1094428120959822] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Dong Y, Dumas D. Are personality measures valid for different populations? A systematic review of measurement invariance across cultures, gender, and age. PERSONALITY AND INDIVIDUAL DIFFERENCES 2020. [DOI: 10.1016/j.paid.2020.109956] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Lineberry M, Park YS, Hennessy SA, Ritter EM. The Fundamentals of Endoscopic Surgery (FES) skills test: factors associated with first-attempt scores and pass rate. Surg Endosc 2020;34:3633-3643. [PMID: 32519273 DOI: 10.1007/s00464-020-07690-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 05/27/2020] [Indexed: 10/24/2022]

Abstract

BACKGROUND

The Fundamentals of Endoscopic Surgery (FES) program became required for American Board of Surgery certification as part of the Flexible Endoscopy Curriculum (FEC) for residents graduating in 2018. This study expands prior psychometric investigation of the FES skills test.

METHODS

We analyzed de-identified first-attempt skills test scores and self-reported demographic characteristics of 2023 general surgery residents who were required to pass FES.

RESULTS

The overall pass rate was 83%. "Loop Reduction" was the most difficult sub-task. Subtasks related to one another only modestly (Spearman's ρ ranging from 0.11 to 0.42; coefficient α = .55). Both upper and lower endoscopic procedural experience had modest positive association with scores (ρ = 0.14 and 0.15) and passing. Examinees who tested on the GI Mentor Express simulator had lower total scores and a lower pass rate than those tested on the GI Mentor II (pass rates = 73% vs. 85%). Removing an Express-specific scoring rule that had been applied eliminated these differences. Gender, glove size, and height were closely related. Women scored lower than men (408- vs. 489-point averages) and had a lower first-attempt pass rate (71% vs. 92%). Glove size correlated positively with score (ρ = 0.31) and pass rate. Finally, height correlated positively with score (r = 0.27) and pass rate. Statistically controlling for glove size and height did not eliminate gender differences, with men still having 3.2 times greater odds of passing.

CONCLUSIONS

FES skills test scores show both consistencies with the assessment's validity argument and several remarkable findings. Subtasks reflect distinct skills, so passing standards should perhaps be set for each subtask. The Express simulator-specific scoring penalty should be removed. Differences seen by gender are concerning. We argue those differences do not reflect measurement bias, but rather highlight equity concerns in surgical technology, training, and practice.

Collapse

Terluin B, van der Wouden JC, de Vet HCW. Measurement equivalence of the Four-Dimensional Symptom Questionnaire (4DSQ) in adolescents and emerging adults. PLoS One 2019;14:e0221904. [PMID: 31465490 PMCID: PMC6715201 DOI: 10.1371/journal.pone.0221904] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 08/16/2019] [Indexed: 11/18/2022] Open

Quantifying the impact of partial measurement invariance in diagnostic research: An application to addiction research. Addict Behav 2019;94:50-56. [PMID: 30502928 DOI: 10.1016/j.addbeh.2018.11.029] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Revised: 09/20/2018] [Accepted: 11/19/2018] [Indexed: 11/23/2022]

Nye CD, Joo SH, Zhang B, Stark S. Advancing and Evaluating IRT Model Data Fit Indices in Organizational Research. ORGANIZATIONAL RESEARCH METHODS 2019. [DOI: 10.1177/1094428119833158] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Shi D, Song H, DiStefano C, Maydeu-Olivares A, McDaniel HL, Jiang Z. Evaluating Factorial Invariance: An Interval Estimation Approach Using Bayesian Structural Equation Modeling. MULTIVARIATE BEHAVIORAL RESEARCH 2019;54:224-245. [PMID: 30569738 DOI: 10.1080/00273171.2018.1514484] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Sommer M, Arendasy ME, Punter JF, Feldhammer-Kahr M, Rieder A. Do individual differences in test-takers' appraisal of admission testing compromise measurement fairness? INTELLIGENCE 2019. [DOI: 10.1016/j.intell.2019.01.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Herde CN, Lievens F, Solberg EG, Harbaugh JL, Strong MH, J. Burkholder G. Situational Judgment Tests as Measures of 21st Century Skills: Evidence across Europe and Latin America. REVISTA DE PSICOLOGÍA DEL TRABAJO Y DE LAS ORGANIZACIONES 2019. [DOI: 10.5093/jwop2019a8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Chalmers RP. Model-Based Measures for Detecting and Quantifying Response Bias. PSYCHOMETRIKA 2018;83:696-732. [PMID: 29907891 DOI: 10.1007/s11336-018-9626-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2017] [Revised: 03/06/2018] [Indexed: 06/08/2023]

Rome L, Zhang B. Investigating the Effects of Differential Item Functioning on Proficiency Classification. APPLIED PSYCHOLOGICAL MEASUREMENT 2018;42:259-274. [PMID: 29881124 PMCID: PMC5978605 DOI: 10.1177/0146621617726789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Nye CD, Bradburn J, Olenick J, Bialko C, Drasgow F. How Big Are My Effects? Examining the Magnitude of Effect Sizes in Studies of Measurement Equivalence. ORGANIZATIONAL RESEARCH METHODS 2018. [DOI: 10.1177/1094428118761122] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Adroher ND, Prodinger B, Fellinghauer CS, Tennant A. All metrics are equal, but some metrics are more equal than others: A systematic search and review on the use of the term 'metric'. PLoS One 2018;13:e0193861. [PMID: 29509813 PMCID: PMC5839589 DOI: 10.1371/journal.pone.0193861] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Accepted: 02/19/2018] [Indexed: 11/24/2022] Open

Does the 15-item Geriatric Depression Scale function differently in old people with different levels of cognitive functioning? J Affect Disord 2018;227:471-476. [PMID: 29156360 DOI: 10.1016/j.jad.2017.11.045] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/28/2017] [Revised: 09/09/2017] [Accepted: 11/11/2017] [Indexed: 11/20/2022]

Chiesi F, Primi C, Pigliautile M, Baroni M, Ercolani S, Boccardi V, Ruggiero C, Mecocci P. Is the 15-item Geriatric Depression Scale a Fair Screening Tool? A Differential Item Functioning Analysis Across Gender and Age. Psychol Rep 2017;121:1167-1182. [PMID: 29298589 DOI: 10.1177/0033294117745561] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Bowe AG. Moving Toward More Conclusive Measures of Sociocultural Adaptation for Ethnically Diverse Adolescents in England. CANADIAN JOURNAL OF SCHOOL PSYCHOLOGY 2017. [DOI: 10.1177/0829573517739392] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Foster GC, Min H, Zickar MJ. Review of Item Response Theory Practices in Organizational Research. ORGANIZATIONAL RESEARCH METHODS 2017. [DOI: 10.1177/1094428116689708] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

The Four-Dimensional Symptom Questionnaire (4DSQ) in the general population: scale structure, reliability, measurement invariance and normative data: a cross-sectional survey. Health Qual Life Outcomes 2016;14:130. [PMID: 27629535 PMCID: PMC5024427 DOI: 10.1186/s12955-016-0533-4] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2016] [Accepted: 09/07/2016] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The Four-Dimensional Symptom Questionnaire (4DSQ) is a self-report questionnaire measuring distress, depression, anxiety and somatization with separate scales. The 4DSQ has extensively been validated in clinical samples, especially from primary care settings. Information about measurement properties and normative data in the general population was lacking. In a Dutch general population sample we examined the 4DSQ scales' structure, the scales' reliability and measurement invariance with respect to gender, age and education, the scales' score distributions across demographic categories, and normative data.

METHODS

4DSQ data were collected in a representative Dutch Internet panel. Confirmatory factor analysis was used to examine the scales' structure. Reliability was examined by Cronbach's alpha, and coefficients omega-total and omega-hierarchical. Differential item functioning (DIF) analysis was used to evaluate measurement invariance across gender, age and education.

RESULTS

The total response rate was 82.4 % (n = 5273/6399). The depression scale proved to be unidimensional. The other scales were best represented as bifactor models consisting of a large general factor and one or more smaller specific factors. The general factors accounted for more than 95 % of the reliable variance of the scales. Reliability was high (≥0.85) by all estimates. The distress-, depression- and anxiety scales were invariant across gender, age and education. The somatization scale demonstrated some lack of measurement invariance as a result of decreased thresholds for some of the items in young people (16-24 years) and increased thresholds in elderly people (65+ years). The somatization scale was invariant regarding gender and education. The 4DSQ scores varied significantly across demographic categories, but the explained variance was small (<6 %). Normative data were generated for gender and age categories. Approximately 17 % of the participants scored above average on de distress scale, whereas 12 % scored above average on de somatization scale. Percentages of people scoring high enough on depression or anxiety as to suspect the presence of depressive or anxiety disorder were 4.1 and 2.5 respectively.

CONCLUSIONS

Evidence supports reliability and measurement invariance of the 4DSQ in the general Dutch population. The normative data provided in this study can be used to compare a subject's 4DSQ scores with a general population reference group.

Collapse

Nye CD, Sackett PR. New Effect Sizes for Tests of Categorical Moderation and Differential Prediction. ORGANIZATIONAL RESEARCH METHODS 2016. [DOI: 10.1177/1094428116644505] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Do individual differences in test preparation compromise the measurement fairness of admission tests? INTELLIGENCE 2016. [DOI: 10.1016/j.intell.2016.01.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Teresi JA, Jones RN. Methodological Issues in Examining Measurement Equivalence in Patient Reported Outcomes Measures: Methods Overview to the Two-Part Series, "Measurement Equivalence of the Patient Reported Outcomes Measurement Information System^® (PROMIS^®) Short Forms". PSYCHOLOGICAL TEST AND ASSESSMENT MODELING 2016;58:37-78. [PMID: 28983448 PMCID: PMC5625814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Kleinman M, Teresi JA. Differential item functioning magnitude and impact measures from item response theory models. PSYCHOLOGICAL TEST AND ASSESSMENT MODELING 2016;58:79-98. [PMID: 28706769 PMCID: PMC5505278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Sommer M, Arendasy ME. Further evidence for the deficit account of the test anxiety–test performance relationship from a high-stakes admission testing setting. INTELLIGENCE 2015. [DOI: 10.1016/j.intell.2015.08.007] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Wright NA, Kutschenko K, Bush BA, Hannum KM, Braddy PW. Measurement and Predictive Invariance of a Work-Life Boundary Measure Across Gender. INTERNATIONAL JOURNAL OF SELECTION AND ASSESSMENT 2015. [DOI: 10.1111/ijsa.12102] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Nye CD, Allemand M, Gosling SD, Potter J, Roberts BW. Personality Trait Differences Between Young and Middle-Aged Adults: Measurement Artifacts or Actual Trends? J Pers 2015;84:473-92. [DOI: 10.1111/jopy.12173] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Egberink IJL, Meijer RR, Tendeiro JN. Investigating Measurement Invariance in Computer-Based Personality Testing: The Impact of Using Anchor Items on Effect Size Indices. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 2015;75:126-145. [PMID: 29795815 PMCID: PMC5965504 DOI: 10.1177/0013164414520965] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]

Do BR. Research on Unproctored Internet Testing. INDUSTRIAL AND ORGANIZATIONAL PSYCHOLOGY-PERSPECTIVES ON SCIENCE AND PRACTICE 2015. [DOI: 10.1111/j.1754-9434.2008.01107.x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Tay L, Meade AW, Cao M. An Overview and Practical Guide to IRT Measurement Equivalence Analysis. ORGANIZATIONAL RESEARCH METHODS 2014. [DOI: 10.1177/1094428114553062] [Citation(s) in RCA: 71] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Terluin B, Smits N, Miedema B. The English version of the four-dimensional symptom questionnaire (4DSQ) measures the same as the original Dutch questionnaire: a validation study. Eur J Gen Pract 2014;20:320-6. [PMID: 24779532 DOI: 10.3109/13814788.2014.905826] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Retelsdorf J, Bauer J, Gebauer SK, Kauper T, Möller J. Erfassung berufsbezogener Selbstkonzepte von angehenden Lehrkräften (ERBSE-L). DIAGNOSTICA 2014. [DOI: 10.1026/0012-1924/a000108] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

DuVernet AM, Wright NA, Meade AW, Coughlin C, Kantrowitz TM. General Mental Ability as a Source of Differential Functioning in Personality Scales. ORGANIZATIONAL RESEARCH METHODS 2014. [DOI: 10.1177/1094428114525996] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Janulis P. Improving measurement of injection drug risk behavior using item response theory. THE AMERICAN JOURNAL OF DRUG AND ALCOHOL ABUSE 2013;40:143-50. [PMID: 24266632 DOI: 10.3109/00952990.2013.848212] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Berry CM, Barratt CL, Dovalina CL, Zhao P. Can racial/ethnic subgroup criterion-to-test standard deviation ratios account for conflicting differential validity and differential prediction evidence for cognitive ability tests? JOURNAL OF OCCUPATIONAL AND ORGANIZATIONAL PSYCHOLOGY 2013. [DOI: 10.1111/joop.12036] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Scherbaum CA, Sabet J, Kern MJ, Agnello P. Examining faking on personality inventories using unfolding item response theory models. J Pers Assess 2012;95:207-16. [PMID: 23030769 DOI: 10.1080/00223891.2012.725439] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Teresi JA, Ramirez M, Jones RN, Choi S, Crane PK. Modifying measures based on differential item functioning (DIF) impact analyses. J Aging Health 2012;24:1044-76. [PMID: 22422759 PMCID: PMC4030595 DOI: 10.1177/0898264312436877] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]