1
|
Gilbert JB. Modeling item-level heterogeneous treatment effects: A tutorial with the glmer function from the lme4 package in R. Behav Res Methods 2024; 56:5055-5067. [PMID: 38030928 DOI: 10.3758/s13428-023-02245-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/16/2023] [Indexed: 12/01/2023]
Abstract
Recent advancements in education scholarship have introduced Item Response Theory (IRT) models to address treatment heterogeneity at the assessment item level. These models for item-level heterogeneous treatment effects (IL-HTE) enable detailed analyses of treatments that may have varying impacts on individual items within an assessment. This article offers a comprehensive tutorial for applied researchers interested in implementing IL-HTE analysis in R, utilizing the lme4 package. Using empirical data from a second-grade reading comprehension assessment as a running example, this tutorial emphasizes model-building strategies, interpretation techniques, visualization methods, and extensions. By following this tutorial, researchers will gain practical insights into utilizing IL-HTE analysis for enhanced understanding and interpretation of treatment effects at the item level.
Collapse
Affiliation(s)
- Joshua B Gilbert
- Harvard Graduate School of Education, 13 Appian Way, Cambridge, 02138, MA, USA.
| |
Collapse
|
2
|
Huang S, Luo J(J, Cai L. An Explanatory Multidimensional Random Item Effects Rating Scale Model. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 2023; 83:1229-1248. [PMID: 37974656 PMCID: PMC10638980 DOI: 10.1177/00131644221140906] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2023]
Abstract
Random item effects item response theory (IRT) models, which treat both person and item effects as random, have received much attention for more than a decade. The random item effects approach has several advantages in many practical settings. The present study introduced an explanatory multidimensional random item effects rating scale model. The proposed model was formulated under a novel parameterization of the nominal response model (NRM), and allows for flexible inclusion of person-related and item-related covariates (e.g., person characteristics and item features) to study their impacts on the person and item latent variables. A new variant of the Metropolis-Hastings Robbins-Monro (MH-RM) algorithm designed for latent variable models with crossed random effects was applied to obtain parameter estimates for the proposed model. A preliminary simulation study was conducted to evaluate the performance of the MH-RM algorithm for estimating the proposed model. Results indicated that the model parameters were well recovered. An empirical data set was analyzed to further illustrate the usage of the proposed model.
Collapse
Affiliation(s)
| | | | - Li Cai
- University of California, Los Angeles, USA
| |
Collapse
|
3
|
Aßmann C, Gaasch JC, Stingl D. A Bayesian Approach Towards Missing Covariate Data in Multilevel Latent Regression Models. PSYCHOMETRIKA 2023; 88:1495-1528. [PMID: 36418780 PMCID: PMC10656345 DOI: 10.1007/s11336-022-09888-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 08/29/2022] [Accepted: 09/20/2022] [Indexed: 06/16/2023]
Abstract
The measurement of latent traits and investigation of relations between these and a potentially large set of explaining variables is typical in psychology, economics, and the social sciences. Corresponding analysis often relies on surveyed data from large-scale studies involving hierarchical structures and missing values in the set of considered covariates. This paper proposes a Bayesian estimation approach based on the device of data augmentation that addresses the handling of missing values in multilevel latent regression models. Population heterogeneity is modeled via multiple groups enriched with random intercepts. Bayesian estimation is implemented in terms of a Markov chain Monte Carlo sampling approach. To handle missing values, the sampling scheme is augmented to incorporate sampling from the full conditional distributions of missing values. We suggest to model the full conditional distributions of missing values in terms of non-parametric classification and regression trees. This offers the possibility to consider information from latent quantities functioning as sufficient statistics. A simulation study reveals that this Bayesian approach provides valid inference and outperforms complete cases analysis and multiple imputation in terms of statistical efficiency and computation time involved. An empirical illustration using data on mathematical competencies demonstrates the usefulness of the suggested approach.
Collapse
Affiliation(s)
- Christian Aßmann
- Leibniz Institute for Educational Trajectories Bamberg, Bamberg, Germany
- Otto-Friedrich-Universität Bamberg, Bamberg, Germany
| | | | - Doris Stingl
- Otto-Friedrich-Universität Bamberg, Bamberg, Germany.
| |
Collapse
|
4
|
Mair P, Gruber K. Bayesian explanatory additive IRT models. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2022; 75:59-87. [PMID: 34089620 DOI: 10.1111/bmsp.12245] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Revised: 03/01/2021] [Indexed: 06/12/2023]
Abstract
In this article we extend the framework of explanatory mixed IRT models to a more general class called explanatory additive IRT models. We do this by augmenting the linear predictors in terms of smooth functions. This development offers many new modeling options such as the inclusion of nonlinear covariate effects, the specification of various temporal and spatial dependency patterns, and parameter partitioning across covariates. We use integrated nested Laplace approximation (INLA) for accurate and computationally efficient estimation of the parameters. Uninformative, weakly informative, and informative prior settings for the hyperparameters are discussed. Running time experiments and Monte Carlo parameter recovery simulations are performed in order to study the accuracy and computational efficiency of INLA when applied to the proposed explanatory additive IRT model class. Using a real-life dataset, a variety of application scenarios is explored, and the results are compared with classical maximum likelihood estimation when possible. R code is included in the supplemental materials to allow readers to fully reproduce the examples computed in the paper.
Collapse
Affiliation(s)
- Patrick Mair
- Harvard University, Cambridge, Massachusetts, USA
| | | |
Collapse
|
5
|
Cremaschi A, De Iorio M, Seng Chong Y, Broekman B, Meaney MJ, Kee MZL. A Bayesian nonparametric approach to dynamic item-response modeling: An application to the GUSTO cohort study. Stat Med 2021; 40:6021-6037. [PMID: 34412151 PMCID: PMC9546363 DOI: 10.1002/sim.9167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 08/03/2021] [Accepted: 08/03/2021] [Indexed: 11/08/2022]
Abstract
Statistical analysis of questionnaire data is often performed employing techniques from item-response theory. In this framework, it is possible to differentiate respondent profiles and characterize the questions (items) included in the questionnaire via interpretable parameters. These models are often crosssectional and aim at evaluating the performance of the respondents. The motivating application of this work is the analysis of psychometric questionnaires taken by a group of mothers at different time points and by their children at one later time point. The data are available through the GUSTO cohort study. To this end, we propose a Bayesian semiparametric model and extend the current literature by: (i) introducing temporal dependence among questionnaires taken at different time points; (ii) jointly modeling the responses to questionnaires taken from different, but related, groups of subjects (in our case mothers and children), introducing a further dependency structure and therefore sharing of information; (iii) allowing clustering of subjects based on their latent response profile. The proposed model is able to identify three main groups of mother/child pairs characterized by their response profiles. Furthermore, we report an interesting maternal reporting bias effect strongly affecting the clustering structure of the mother/child dyads.
Collapse
Affiliation(s)
- Andrea Cremaschi
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Brenner Centre for Molecular Medicine, Singapore
| | - Maria De Iorio
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Brenner Centre for Molecular Medicine, Singapore.,Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.,Division of Science, Yale-NUS College, Singapore, Singapore.,Department of Statistical Science, University College London, London, UK
| | - Yap Seng Chong
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Brenner Centre for Molecular Medicine, Singapore.,Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Birit Broekman
- Department of Psychiatry, VU Medical Centre, Amsterdam, the Netherlands
| | - Michael J Meaney
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Brenner Centre for Molecular Medicine, Singapore.,Department of Psychiatry, Douglas Mental Health University Research Institute, McGill University, Montreal, Quebec, Canada
| | - Michelle Z L Kee
- Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology and Research (A*STAR), Brenner Centre for Molecular Medicine, Singapore
| |
Collapse
|
6
|
Resnik L, Borgia M, Cancio JM, Delikat J, Ni P. Psychometric evaluation of the Southampton hand assessment procedure (SHAP) in a sample of upper limb prosthesis users. J Hand Ther 2021; 36:110-120. [PMID: 34400030 DOI: 10.1016/j.jht.2021.07.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 06/22/2021] [Accepted: 07/04/2021] [Indexed: 02/09/2023]
Abstract
BACKGROUND The 26-item Southampton Hand Assessment Protocol (SHAP) is a test of prosthetic hand function that generates an Index of Functionality (IOF), and prehensile pattern (PP) scores. Prior researchers identified potential issues in SHAP scoring, proposing alternative scoring methods (LIF and W-LIF). STUDY DESIGN Cross-sectional study. PURPOSE Evaluate the psychometric properties of the SHAP IOF, LIF, and W-LIF and PP scores and develop the Prosthesis Index of Functionality (P-IOF). METHODS We examined item completion, floor andceiling effects, concurrent, discriminant, construct and structural validity. The P-IOF used increased boundary limits and information from item completion and completion time. Calibration used a nonlinear mixed model. Scores were estimated using maximum a posteriori Bayesian estimation. Mixed integer linear programing (MILP) informed development of a shorter measure. Validity analyses were repeated using the P-IOF. RESULTS 126 persons, mean age 57 (sd 15.8), 69% with transradial amputation were included. Floors effects were observed in 18.3%-19.1% for the IOF, LIF, and W-LIF. Ten items were not completed by >15% of participants. Boundary limits were problematic for all but 1 item. Correlations with dexterity measures were strong (r = 0.54-0.73). Scores differed by amputation level (p > .0001). Factor analysis did not support use of PP scores. The P-IOF used expanded boundary limits to decrease floor effects. MILP identified 10 items that could be dropped. The 26-item P-IOF and 16-item P-IOF had reduced floor effects (<7.5%), strong evidence of concurrent and discriminant validity, and construct validity. P-IOF reduced administrative burden by 9.5 (sd 5.6) minutes. DISCUSSION Floor effects limit a measure's ability to distinguish between persons with low function. CONCLUSION Analyses supported the validity of the SHAP IOF, LIF, and W-LIF, but identified large floor effects, as well as issues with structural validity of the PP scores. The 16-item P-IOF minimizes floor effects and reduces administrative burden.
Collapse
Affiliation(s)
- Linda Resnik
- Providence VA Medical Center, Providence, RI, USA; Health Services, Policy and Practice, Brown University, Providence, RI, USA.
| | | | - Jill M Cancio
- United States Army Institute of Surgical Research Burn Center, JBSA Ft. Sam Houston, TX, USA
| | | | | |
Collapse
|
7
|
Reinstein I, Hill J, Cook DA, Lineberry M, Pusic MV. Multi-level longitudinal learning curve regression models integrated with item difficulty metrics for deliberate practice of visual diagnosis: groundwork for adaptive learning. ADVANCES IN HEALTH SCIENCES EDUCATION : THEORY AND PRACTICE 2021; 26:881-912. [PMID: 33646468 DOI: 10.1007/s10459-021-10027-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Accepted: 01/07/2021] [Indexed: 06/12/2023]
Abstract
Visual diagnosis of radiographs, histology and electrocardiograms lends itself to deliberate practice, facilitated by large online banks of cases. Which cases to supply to which learners in which order is still to be worked out, with there being considerable potential for adapting the learning. Advances in statistical modeling, based on an accumulating learning curve, offer methods for more effectively pairing learners with cases of known calibrations. Using demonstration radiograph and electrocardiogram datasets, the advantages of moving from traditional regression to multilevel methods for modeling growth in ability or performance are demonstrated, with a final step of integrating case-level item-response information based on diagnostic grouping. This produces more precise individual-level estimates that can eventually support learner adaptive case selection. The progressive increase in model sophistication is not simply statistical but rather brings the models into alignment with core learning principles including the importance of taking into account individual differences in baseline skill and learning rate as well as the differential interaction with cases of varying diagnosis and difficulty. The developed approach can thus give researchers and educators a better basis on which to anticipate learners' pathways and individually adapt their future learning.
Collapse
Affiliation(s)
- Ilan Reinstein
- Institute for Innovations in Medical Education, NYU Grossman School of Medicine, 550 First Avenue, MSB G109, New York, NY, 10016, USA
| | - Jennifer Hill
- Department of Applied Statistics, Social Science, and the Humanities, New York University, New York, NY, USA
| | - David A Cook
- Department of Medicine, Office of Applied Scholarship and Education Science, School of Continuous Professional Development, Mayo Clinic College of Medicine and Science, Rochester, MN, USA
| | - Matthew Lineberry
- Zamierowksi Institute for Experiential Learning, University of Kansas Medical Center, Kansas City, KS, USA
| | - Martin V Pusic
- Institute for Innovations in Medical Education, NYU Grossman School of Medicine, 550 First Avenue, MSB G109, New York, NY, 10016, USA.
- Department of Emergency Medicine, NYU Grossman School of Medicine, New York, NY, USA.
| |
Collapse
|
8
|
Daadi BE, Latacz-Lohmann U. Assessing farmers' attitudes to, and the behavioural costs of, organic fertiliser practices in northern Ghana: An application of the behavioural cost approach. Heliyon 2021; 7:e07312. [PMID: 34222688 PMCID: PMC8243018 DOI: 10.1016/j.heliyon.2021.e07312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Revised: 03/22/2021] [Accepted: 06/10/2021] [Indexed: 11/25/2022] Open
Abstract
The use of organic fertiliser to improve soil health is crucial to halting the downward trend of crop yields in sub-Saharan Africa. If this goal is to be achieved, however, farmers require support to adopt organic fertiliser practices that match their attitudes and decision-making capacity. This study evaluated farmers' attitudes to a set of prevailing organic fertiliser practices and their associated behavioural costs (difficulty). The explanatory Rasch model was applied to a set of primary data from 250 farming households in north-east Ghana. The results showed that the average attitude of farmers was much less than the difficulty estimate of an average organic fertiliser practice, although the practices generally showed a moderate difficulty. On average, farmers' attitudes matched just three of sixteen practices on the scale, with most (70 %) of the farmers showing very weak attitudes towards the input. Latent regression results revealed that the weak attitude levels were strongly related to key factors in the farmers' background, including education, resource endowment and access to extension services. Participation in determining policies on organic fertiliser use enhances farmers' knowledge and skills concerning use of the input. Hence, access to such policies can replace education for the less-educated majority of farmers. Thus, training programmes are proposed that develop the average farmer's capacity to adopt these practices in this area, especially the less difficult ones. Supporting farmers with the acquisition of animal-drawn vehicles can also facilitate uptake of the more difficult organic fertiliser practices and increase use of the input.
Collapse
Affiliation(s)
- Bunbom Edward Daadi
- Department of Farm Management & Production Economics, Institute of Agricultural Economics (Institut für Agrarökonomie), Christian-Albrechts University, 24098 Kiel, Germany
| | - Uwe Latacz-Lohmann
- Department of Farm Management & Production Economics, Institute of Agricultural Economics (Institut für Agrarökonomie), Christian-Albrechts University, 24098 Kiel, Germany.,The School of Agriculture, University of Western Australia, Australia
| |
Collapse
|
9
|
Bacci S, Fabbricatore R, Iannario M. Latent trait models for perceived risk assessment using a Covid-19 data survey. J Appl Stat 2021; 50:2575-2598. [PMID: 37529576 PMCID: PMC10388822 DOI: 10.1080/02664763.2021.1937584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 05/28/2021] [Indexed: 10/21/2022]
Abstract
Aim of the contribution is analyzing potential events that may negatively impact individuals, assets, and/or the environment, and making judgments about the perceived personal and social riskiness of Covid-19 compared to other hazards belonging to health (AIDS, cancer, infarction), environmental (climate change), behavioral (serious car accidents), and technological (nuclear weapons) domains. The comparative risk analysis has been performed on a survey data collected during the first Italian Covid-19 lockdown. An item response theory model for polytomously scored items has been implemented for the analysis of the positioning of Covid-19 with respect to the other hazards in terms of perceived risk. Among the attributes determining the hazard's perceived risk, Covid-19 distinguishes for the knowledge of risks from the hazard, media attention, and fear caused by the hazard in the peers. Besides, through a latent regression analysis, the role of some individual characteristics on the perceived risk for Covid-19 has been examined. Our contribution allows us to disentangle among several aspects of hazards and describe the main factors affecting the perceived risk. It also contributes to determine if existing control measures are perceived as adequate and the interest for new media with related impact on a person's reaction.
Collapse
Affiliation(s)
- S. Bacci
- Department of Statistics, Computer Science, Applications “G. Parenti”, University of Florence, Firenze, Italy
| | - R. Fabbricatore
- Department of Social Sciences, University of Naples Federico II, Napoli, Italy
| | - Maria Iannario
- Department of Political Sciences, University of Naples Federico II, Napoli, Italy
| |
Collapse
|
10
|
Reis Costa D, Bolsinova M, Tijmstra J, Andersson B. Improving the Precision of Ability Estimates Using Time-On-Task Variables: Insights From the PISA 2012 Computer-Based Assessment of Mathematics. Front Psychol 2021; 12:579128. [PMID: 33815190 PMCID: PMC8017127 DOI: 10.3389/fpsyg.2021.579128] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 02/22/2021] [Indexed: 11/13/2022] Open
Abstract
Log-file data from computer-based assessments can provide useful collateral information for estimating student abilities. In turn, this can improve traditional approaches that only consider response accuracy. Based on the amounts of time students spent on 10 mathematics items from the PISA 2012, this study evaluated the overall changes in and measurement precision of ability estimates and explored country-level heterogeneity when combining item responses and time-on-task measurements using a joint framework. Our findings suggest a notable increase in precision with the incorporation of response times and indicate differences between countries in how respondents approached items as well as in their response processes. Results also showed that additional information could be captured through differences in the modeling structure when response times were included. However, such information may not reflect the testing objective.
Collapse
Affiliation(s)
- Denise Reis Costa
- Centre for Educational Measurement, Faculty of Educational Sciences, University of Oslo, Oslo, Norway
| | - Maria Bolsinova
- Methodology and Statistics, Tilburg University, Tilburg, Netherlands
| | - Jesper Tijmstra
- Methodology and Statistics, Tilburg University, Tilburg, Netherlands
| | - Björn Andersson
- Centre for Educational Measurement, Faculty of Educational Sciences, University of Oslo, Oslo, Norway
| |
Collapse
|
11
|
Campos-Castillo C, Anthony D. Racial and ethnic differences in self-reported telehealth use during the COVID-19 pandemic: a secondary analysis of a US survey of internet users from late March. J Am Med Inform Assoc 2021; 28:119-125. [PMID: 32894772 PMCID: PMC7499625 DOI: 10.1093/jamia/ocaa221] [Citation(s) in RCA: 101] [Impact Index Per Article: 33.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 08/28/2020] [Indexed: 12/15/2022] Open
Abstract
Objective Widespread technological changes, like the rapid uptake of telehealth in the US during the COVID-19 pandemic, risk creating or widening racial/ethnic disparities. We conducted a secondary analysis of a cross-sectional, nationally representative survey of internet users to evaluate whether there were racial/ethnic disparities in self-reported telehealth use early in the pandemic. Materials and Methods The Pew Research Center fielded the survey March 19–24, 2020. Telehealth use because of the pandemic was measured by asking whether respondents (N = 10 624) “used the internet or e-mail to connect with doctors or other medical professionals as a result of the coronavirus outbreak.” We conducted survey-weighted logistic regressions, adjusting for respondents’ socioeconomic characteristics and perceived threat of the pandemic to their own health (eg, no threat, minor, major). Results Approximately 17% of respondents reported using telehealth because of the pandemic, with significantly higher unadjusted odds among Blacks, Latinos, and those identified with other race compared to White respondents. The multivariable logistic regressions and sensitivity analyses show Black respondents were more likely than Whites to report using telehealth because of the pandemic, particularly when perceiving the pandemic as a minor threat to their own health. Discussion Black respondents are most likely to report using telehealth because of the COVID-19 pandemic, particularly when they perceive the pandemic as a minor health threat. Conclusion The systemic racism creating health and health care disparities has likely raised the need for telehealth among Black patients during the pandemic. Findings suggest opportunities to leverage a broadly defined set of telehealth tools to reduce health care disparities postpandemic.
Collapse
Affiliation(s)
| | - Denise Anthony
- Department of Health Management & Policy, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
12
|
Smits DJM, De Boeck P, Vansteelandt K. The inhibition of verbally aggressive behaviour. EUROPEAN JOURNAL OF PERSONALITY 2020. [DOI: 10.1002/per.529] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
We studied the inhibition of verbal aggression, defined as not displaying verbal aggression when one would want to. The approach we used was based on a situation–response questionnaire containing 15 anger provoking situations and three verbally aggressive responses. Two questions were asked for each combination of a situation and a response: one about wanting to react in a verbally aggressive way and one about actually displaying the reaction. This questionnaire was administered to 316 participants. Based on different theories about inhibition, several logistic mixed models were constructed and tested against each other. In the best fitting model, inhibition was conceptualized as a trait. Trait inhibition was negatively correlated with external measures of Anger Out and positively with Control of Anger Out. Copyright © 2004 John Wiley & Sons, Ltd.
Collapse
|
13
|
Measurement of Inter-Individual Variability in Assessing the Quality of Life in Respondents with Celiac Disease. PSYCH 2020. [DOI: 10.3390/psych2040023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Quality of life of Celiac Disease (CD) patients is affected by constraints in their physical, social and emotional behaviour. Our objective is to assess differences in two relevant dimensions of the Celiac Quality of Life (CQoL) scale, Limitations due to the disease and Dysphoria (i.e., feelings of depression and discomfort), in relation to the perceived social support and some individual and disease-related characteristics. The paper exploits suitable unidimensional Item Response Theory (IRT) models to individually analyse the two mentioned dimensions of the CQoL and Multidimensional Latent Class IRT models for ordinal polytomous items in order to detect sub-populations of CD patients that are homogenous with respect to the perceived CQoL. The latter methods allow to address patients with similar characteristics to the same treatment, performing at the same time a more tailored overture to health promotion programmes. The analysis extracts the relevant patterns and relations among CD patients, disentangling respondents receiving CD diagnosis in adolescence or adult age rather than in childhood (the first perceive high levels of Limitations and Dysphoria), patients with high perceived social support, a factor influencing in a positive way motivation to engage in management of CD-related distress and psychological well-being, and participants who are married or cohabiting. The latter report higher latent trait levels.
Collapse
|
14
|
Kim J, Wilson M. Polytomous Item Explanatory Item Response Theory Models. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 2020; 80:726-755. [PMID: 32616956 PMCID: PMC7307487 DOI: 10.1177/0013164419892667] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
This study investigates polytomous item explanatory item response theory models under the multivariate generalized linear mixed modeling framework, using the linear logistic test model approach. Building on the original ideas of the many-facet Rasch model and the linear partial credit model, a polytomous Rasch model is extended to the item location explanatory many-facet Rasch model and the step difficulty explanatory linear partial credit model. To demonstrate the practical differences between the two polytomous item explanatory approaches, two empirical studies examine how item properties explain and predict the overall item difficulties or the step difficulties each in the Carbon Cycle assessment data and in the Verbal Aggression data. The results suggest that the two polytomous item explanatory models are methodologically and practically different in terms of (a) the target difficulty parameters of polytomous items, which are explained by item properties; (b) the types of predictors for the item properties incorporated into the design matrix; and (c) the types of item property effects. The potentials and methodological advantages of item explanatory modeling are discussed as well.
Collapse
Affiliation(s)
- Jinho Kim
- University of California at Berkeley,
Berkeley, CA, USA
- KU Leuven and ITEC, imec research group
at KU Leuven, Kortrijk, Belgium
| | - Mark Wilson
- University of California at Berkeley,
Berkeley, CA, USA
| |
Collapse
|
15
|
Mundt D, Abel R, Hänze M. Exploring the effect of testing on forgetting in vocabulary learning: an examination of the bifurcation model. JOURNAL OF COGNITIVE PSYCHOLOGY 2020. [DOI: 10.1080/20445911.2020.1733584] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Affiliation(s)
- Daria Mundt
- Human Sciences, Department of Psychology, University of Kassel, Kassel, Germany
| | - Roman Abel
- Human Sciences, Department of Psychology, University of Kassel, Kassel, Germany
| | - Martin Hänze
- Human Sciences, Department of Psychology, University of Kassel, Kassel, Germany
| |
Collapse
|
16
|
Belzak WCM, Bauer DJ. Improving the assessment of measurement invariance: Using regularization to select anchor items and identify differential item functioning. Psychol Methods 2020; 25:673-690. [PMID: 31916799 DOI: 10.1037/met0000253] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
A common challenge in the behavioral sciences is evaluating measurement invariance, or whether the measurement properties of a scale are consistent for individuals from different groups. Measurement invariance fails when differential item functioning (DIF) exists, that is, when item responses relate to the latent variable differently across groups. To identify DIF in a scale, many data-driven procedures iteratively test for DIF one item at a time while assuming other items have no DIF. The DIF-free items are used to anchor the scale of the latent variable across groups, identifying the model. A major drawback to these iterative testing procedures is that they can fail to select the correct anchor items and identify true DIF, particularly when DIF is present in many items. We propose an alternative method for selecting anchors and identifying DIF. Namely, we use regularization, a machine learning technique that imposes a penalty function during estimation to remove parameters that have little impact on the fit of the model. We focus specifically here on a lasso penalty for group differences in the item parameters within the two-parameter logistic item response theory model. We compare lasso regularization with the more commonly used likelihood ratio test method in a 2-group DIF analysis. Simulation and empirical results show that when large amounts of DIF are present and sample sizes are large, lasso regularization has far better control of Type I error than the likelihood ratio test method with little decrement in power. This provides strong evidence that lasso regularization is a promising alternative for testing DIF and selecting anchors. (PsycInfo Database Record (c) 2020 APA, all rights reserved).
Collapse
Affiliation(s)
- William C M Belzak
- Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill
| | - Daniel J Bauer
- Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill
| |
Collapse
|
17
|
Chen F, Yang H, Bulut O, Cui Y, Xin T. Examining the relation of personality factors to substance use disorder by explanatory item response modeling of DSM-5 symptoms. PLoS One 2019; 14:e0217630. [PMID: 31194760 PMCID: PMC6563981 DOI: 10.1371/journal.pone.0217630] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 05/15/2019] [Indexed: 11/24/2022] Open
Abstract
This paper explores how personality factors affect substance use disorders (SUDs) using explanatory item response modeling (EIRM). A total of 606 Chinese illicit drug users participated in our study. After removing the cases with missing values on the covariate measures, a final sample of 573 participants was used for data analysis. The Diagnostic and Statistical Manual of Mental Disorders (DSM-5) was used to measure the illicit drug users’ SUD level. Four personality factors–anxiety sensitivity, impulsivity, sensation seeking and hopelessness–along with gender and alcohol use were included in EIRM as person covariates. The results indicated that gender, alcohol use, and their interaction significantly predicted the SUD level. The only personality factor that strongly predicted the SUD level was sensation seeking. In addition, the interaction between gender and hopelessness was also found to be a significant predictor of the SUD level, indicating that the negative effect of hopelessness on SUD is stronger for women than for men. The findings suggest that sensation seeking plays an important role in influencing SUDs, and thus, it should be considered when designing intervention or screening procedures for potential illicit drug users. In addition, several DSM-5 SUD symptoms were found to exhibit differential effects by gender, alcohol use, and personality factors. The possible explanations were discussed.
Collapse
Affiliation(s)
- Fu Chen
- Faculty of Psychology, Beijing Normal University, Beijing, China
- Department of Educational Psychology, University of Alberta, Edmonton, AB, Canada
| | - Hongmei Yang
- Faculty of Psychology, Beijing Normal University, Beijing, China
| | - Okan Bulut
- Department of Educational Psychology, University of Alberta, Edmonton, AB, Canada
| | - Ying Cui
- Department of Educational Psychology, University of Alberta, Edmonton, AB, Canada
| | - Tao Xin
- Collaborative Innovation Center of Assessment toward Basic Education Quality, Beijing Normal University, Beijing, China
- * E-mail:
| |
Collapse
|
18
|
Rose N, Nagy G, Nagengast B, Frey A, Becker M. Modeling Multiple Item Context Effects With Generalized Linear Mixed Models. Front Psychol 2019; 10:248. [PMID: 30858809 PMCID: PMC6397884 DOI: 10.3389/fpsyg.2019.00248] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Accepted: 01/25/2019] [Indexed: 11/30/2022] Open
Abstract
Item context effects refer to the impact of features of a test on an examinee's item responses. These effects cannot be explained by the abilities measured by the test. Investigations typically focus on only a single type of item context effects, such as item position effects, or mode effects, thereby ignoring the fact that different item context effects might operate simultaneously. In this study, two different types of context effects were modeled simultaneously drawing on data from an item calibration study of a multidimensional computerized test (N = 1,632) assessing student competencies in mathematics, science, and reading. We present a generalized linear mixed model (GLMM) parameterization of the multidimensional Rasch model including item position effects (distinguishing between within-block position effects and block position effects), domain order effects, and the interactions between them. Results show that both types of context effects played a role, and that the moderating effect of domain orders was very strong. The findings have direct consequences for planning and applying mixed domain assessment designs.
Collapse
Affiliation(s)
- Norman Rose
- Hector Research Institute of Education Sciences and Psychology, University of Tübingen, Tübingen, Germany
| | - Gabriel Nagy
- Leibniz Institute for Science and Mathematics Education, Kiel, Germany
| | - Benjamin Nagengast
- Hector Research Institute of Education Sciences and Psychology, University of Tübingen, Tübingen, Germany
| | - Andreas Frey
- Department of Educational Psychology, Measurement, Evaluation and Counseling, Institute of Psychology, Goethe-University Frankfurt, Frankfurt, Germany.,Faculty of Education, Centre for Educational Measurement, University of Oslo, Oslo, Norway
| | - Michael Becker
- Leibniz Institute for Science and Mathematics Education, Kiel, Germany.,German Institute for International Educational Research, Frankfurt, Germany
| |
Collapse
|
19
|
Ip EH, Magee MF, Youssef GA, Chen SH. Gleaning Information for Cognitive Operations from Don't Know Responses in Cognitive and Noncognitive Assessments. MULTIVARIATE BEHAVIORAL RESEARCH 2019; 54:159-172. [PMID: 30380920 PMCID: PMC6494712 DOI: 10.1080/00273171.2018.1503075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2017] [Revised: 04/19/2018] [Accepted: 04/22/2018] [Indexed: 06/08/2023]
Abstract
The Don't Know (DK) response - taking the form of an omitted response or not-reached at the end of a cognitive test, or explicitly presented as a response option in a social survey - contains important information that is often overlooked. Direct psychometric modeling efforts for DK responses are few and far between. In this article, the linear logistic test model (LLTM) is proposed for delineating the impacts of cognitive operations for a test that contains DK responses. We assume that the DK response is a valid response. The assumption is reasonable for many situations, including low-stakes cognitive tests and attitudinal assessments. By extracting information embedded in the DK response, the method shows how DK can inform the latent construct of interest and the cognitive operations underlying the response to stimuli. Using a proven recoding scheme, the LLTM could be implemented through commonly used programs such as PROC GLIMMIX. Two simulation experiments to evaluate how well the parameters can be recovered were conducted. In addition, two real data examples, from a noncognitive test of health belief assessment and a cognitive test of knowledge in diabetes, are also presented as case studies to illustrate the LLTM for DK response.
Collapse
Affiliation(s)
- Edward H Ip
- a Department of Biostatistical Sciences, Wake Forest School of Medicine
- b Department of Social Sciences & Health Policy , Wake Forest School of Medicine
| | - Michelle F Magee
- c MedStar Diabetes Institute
- d Georgetown University School of Medicine
| | | | - Shyh-Huei Chen
- a Department of Biostatistical Sciences, Wake Forest School of Medicine
| |
Collapse
|
20
|
Trippas D, Kellen D, Singmann H, Pennycook G, Koehler DJ, Fugelsang JA, Dubé C. Characterizing belief bias in syllogistic reasoning: A hierarchical Bayesian meta-analysis of ROC data. Psychon Bull Rev 2018; 25:2141-2174. [PMID: 29943172 PMCID: PMC6267550 DOI: 10.3758/s13423-018-1460-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The belief-bias effect is one of the most-studied biases in reasoning. A recent study of the phenomenon using the signal detection theory (SDT) model called into question all theoretical accounts of belief bias by demonstrating that belief-based differences in the ability to discriminate between valid and invalid syllogisms may be an artifact stemming from the use of inappropriate linear measurement models such as analysis of variance (Dube et al., Psychological Review, 117(3), 831-863, 2010). The discrepancy between Dube et al.'s, Psychological Review, 117(3), 831-863 (2010) results and the previous three decades of work, together with former's methodological criticisms suggests the need to revisit earlier results, this time collecting confidence-rating responses. Using a hierarchical Bayesian meta-analysis, we reanalyzed a corpus of 22 confidence-rating studies (N = 993). The results indicated that extensive replications using confidence-rating data are unnecessary as the observed receiver operating characteristic functions are not systematically asymmetric. These results were subsequently corroborated by a novel experimental design based on SDT's generalized area theorem. Although the meta-analysis confirms that believability does not influence discriminability unconditionally, it also confirmed previous results that factors such as individual differences mediate the effect. The main point is that data from previous and future studies can be safely analyzed using appropriate hierarchical methods that do not require confidence ratings. More generally, our results set a new standard for analyzing data and evaluating theories in reasoning. Important methodological and theoretical considerations for future work on belief bias and related domains are discussed.
Collapse
Affiliation(s)
- Dries Trippas
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany.
| | | | | | | | | | | | - Chad Dubé
- University of South Florida, Tampa, FL, USA
| |
Collapse
|
21
|
Chae D, Park K. An item response theory based integrated model of headache, nausea, photophobia, and phonophobia in migraine patients. J Pharmacokinet Pharmacodyn 2018; 45:721-731. [PMID: 30043250 DOI: 10.1007/s10928-018-9602-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2018] [Accepted: 07/17/2018] [Indexed: 11/28/2022]
Abstract
This study developed an integrated model of severity scores of migraine headache and the incidence of nausea, photophobia, and phonophobia to predict the natural time course of migraine symptoms, which are likely to occur by a common disease progression mechanism. Data were acquired from two phase 3 clinical trials conducted during the development of eletriptan. Only the placebo arm was used for analysis. A conventional proportional odds model was compared with an item response theory (IRT) based approach. Results suggested that the IRT based approach led to a better model fit, successfully revealing the difference in relief rates among different symptoms, which was the fastest in phonophobia and the slowest in headache. Simulation with the developed model suggested that using headache scores at 4 h post-dose attained greatest statistical power, yielding sample size of 100 per arm given drug effect of 40%, as compared to that of 200 per arm when 2 h post-dose scores were used as in the original eletriptan protocol. This work demonstrated the usefulness of an IRT based model as applied to analyzing multidimensional migraine symptoms and designing clinical trials. Our model can be similarly applied to analyzing other multiple endpoints sharing a common underlying mechanism.
Collapse
Affiliation(s)
- Dongwoo Chae
- Department of Pharmacology, Yonsei University College of Medicine, 50-1 Yonsei-ro, Seodaemun-gu, Seoul, 03722, South Korea.,Brain Korea 21 PLUS Project for Medical Science, Yonsei University, Seoul, South Korea
| | - Kyungsoo Park
- Department of Pharmacology, Yonsei University College of Medicine, 50-1 Yonsei-ro, Seodaemun-gu, Seoul, 03722, South Korea.
| |
Collapse
|
22
|
DiTrapani J, Rockwood N, Jeon M. IRT in SPSS Using the SPIRIT Macro. APPLIED PSYCHOLOGICAL MEASUREMENT 2018; 42:173-174. [PMID: 29881119 PMCID: PMC5978649 DOI: 10.1177/0146621617733956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
|
23
|
Drabinová A, Martinková P. Detection of Differential Item Functioning with Nonlinear Regression: A Non-IRT Approach Accounting for Guessing. JOURNAL OF EDUCATIONAL MEASUREMENT 2017. [DOI: 10.1111/jedm.12158] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Adéla Drabinová
- Institute of Computer Science of the Czech Academy of Sciences and Faculty of Mathematics and Physics; Charles University
| | - Patrícia Martinková
- Institute of Computer Science of the Czech Academy of Sciences and Faculty of Education; Charles University
| |
Collapse
|
24
|
Thielemann D, Sengewald MA, Kappler G, Steyer R. A Probit Latent State IRT Model With Latent Item-Effect Variables. EUROPEAN JOURNAL OF PSYCHOLOGICAL ASSESSMENT 2017. [DOI: 10.1027/1015-5759/a000417] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Abstract. We present a probit latent state model for dichotomous items with one latent state variable for each occasion of measurement and one latent item-effect variable for each item, except for a reference item. All latent variables are well defined in terms of conditional probabilities. The model offers the possibility to include explanatory variables for the latent states as well as for the latent item-effect variables. We illustrate the model by a data example with the life satisfaction scale of the Freiburg Personality Inventory (FPI-R; Fahrenberg, Hampel, & Selg, 1984 ) assessed at three time points. Allowing for item-effect variables improves model fit considerably and enhances our knowledge about the items. In some applications, this model opens new ways to investigate differential item functioning, and in others it allows to study response styles.
Collapse
Affiliation(s)
- Désirée Thielemann
- Department of Methodology and Evaluation Research, Friedrich Schiller University, Jena, Germany
| | - Marie-Ann Sengewald
- Department of Methodology and Evaluation Research, Friedrich Schiller University, Jena, Germany
| | - Gregor Kappler
- Department of Methodology and Evaluation Research, Friedrich Schiller University, Jena, Germany
| | - Rolf Steyer
- Department of Methodology and Evaluation Research, Friedrich Schiller University, Jena, Germany
| |
Collapse
|
25
|
Jeon M, Rijmen F, Rabe-Hesketh S. A Variational Maximization-Maximization Algorithm for Generalized Linear Mixed Models with Crossed Random Effects. PSYCHOMETRIKA 2017; 82:10.1007/s11336-017-9555-z. [PMID: 28247165 DOI: 10.1007/s11336-017-9555-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Revised: 10/18/2016] [Indexed: 06/06/2023]
Abstract
We present a variational maximization-maximization algorithm for approximate maximum likelihood estimation of generalized linear mixed models with crossed random effects (e.g., item response models with random items, random raters, or random occasion-specific effects). The method is based on a factorized variational approximation of the latent variable distribution given observed variables, which creates a lower bound of the log marginal likelihood. The lower bound is maximized with respect to the factorized distributions as well as model parameters. With the proposed algorithm, a high-dimensional intractable integration is translated into a two-dimensional integration problem. We incorporate an adaptive Gauss-Hermite quadrature method in conjunction with the variational method in order to increase computational efficiency. Numerical studies show that under the small sample size conditions that are considered the proposed algorithm outperforms the Laplace approximation.
Collapse
Affiliation(s)
- Minjeong Jeon
- Department of Education, University of California, Los Angeles, 405 Hilgard Avenue, Los Angeles, CA, 90095 , USA.
| | - Frank Rijmen
- American Institutes for Research, Washington D.C., USA
| | | |
Collapse
|
26
|
Välitalo PA, Krekels EH, van Dijk M, Simons S, Tibboel D, Knibbe CA. Morphine Pharmacodynamics in Mechanically Ventilated Preterm Neonates Undergoing Endotracheal Suctioning. CPT-PHARMACOMETRICS & SYSTEMS PHARMACOLOGY 2017; 6:239-248. [PMID: 28109060 PMCID: PMC5397563 DOI: 10.1002/psp4.12156] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/03/2016] [Accepted: 11/03/2016] [Indexed: 11/08/2022]
Abstract
To date, morphine pharmacokinetics (PKs) are well quantified in neonates, but results about its efficacy are ambiguous. This work presents an analysis of a previously published study on pain measurements in mechanically ventilated preterm neonates who received either morphine or placebo to improve comfort during invasive ventilation. The research question was whether morphine reduces the pain associated with endotracheal or nasal suctioning before, during, and after suctioning. Because these neonates cannot verbalize their pain levels, pain was assessed on the basis of several validated pain measurement instruments (i.e., COMFORT‐B, preterm infant pain profile [PIPP], Neonatal Infant Pain Scale (NIPS), and visual analogue scale (VAS)). The item response theory (IRT) was used to analyze the data in order for us to handle the data from multiple‐item pain scores. The analysis showed an intra‐individual relationship between morphine concentrations and pain reduction, as measured by COMFORT‐B and VAS. However, the small magnitude of the morphine effect was not considered clinically relevant for this intervention in preterm neonates.
Collapse
Affiliation(s)
- P A Välitalo
- Division of Pharmacology, Leiden University, Leiden, The Netherlands
| | - E H Krekels
- Division of Pharmacology, Leiden University, Leiden, The Netherlands
| | - M van Dijk
- Intensive Care and Department of Pediatric Surgery Erasmus MC Sophia Children's Hospital, Rotterdam, The Netherlands.,Department of Pediatrics, Division of Neonatology, Erasmus MC Sophia Children's Hospital, Rotterdam, The Netherlands
| | - Shp Simons
- Department of Pediatrics, Division of Neonatology, Erasmus MC Sophia Children's Hospital, Rotterdam, The Netherlands
| | - D Tibboel
- Intensive Care and Department of Pediatric Surgery Erasmus MC Sophia Children's Hospital, Rotterdam, The Netherlands
| | - C A Knibbe
- Division of Pharmacology, Leiden University, Leiden, The Netherlands.,Intensive Care and Department of Pediatric Surgery Erasmus MC Sophia Children's Hospital, Rotterdam, The Netherlands.,Department of Clinical Pharmacy, St Antonius Hospital, Nieuwegein, The Netherlands
| |
Collapse
|
27
|
Abstract
Many have criticized the Diagnostic and Statistical Manual of Mental Disorders ( DSM-IV), and few regard it as a vehicle of truth, yet its most serious limitation is that its frank operationism in defining manifest categories has distracted attention from theories about what is going on at the latent level. We sketch a Generalized Interpersonal Theory of Personality and Psychopathology and apply it to interpersonal aspects of depression to illustrate how structural individual differences combine with functional dynamic processes to cause interpersonal behavior and affect. Such a causal account relies on a realist ontology in which manifest diagnoses are only a means to learning about the latent distribution, whether categorical or dimensional. Comorbidity of DSM diagnoses suggests that dimensionality will be the rule, not the exception, with internalization and externalization describing common diagnoses.
Collapse
|
28
|
Visser RM, Haver P, Zwitser RJ, Scholte HS, Kindt M. First Steps in Using Multi-Voxel Pattern Analysis to Disentangle Neural Processes Underlying Generalization of Spider Fear. Front Hum Neurosci 2016; 10:222. [PMID: 27303278 PMCID: PMC4882315 DOI: 10.3389/fnhum.2016.00222] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Accepted: 04/29/2016] [Indexed: 01/02/2023] Open
Abstract
A core symptom of anxiety disorders is the tendency to interpret ambiguous information as threatening. Using electroencephalography and blood oxygenation level dependent magnetic resonance imaging (BOLD-MRI), several studies have begun to elucidate brain processes involved in fear-related perceptual biases, but thus far mainly found evidence for general hypervigilance in high fearful individuals. Recently, multi-voxel pattern analysis (MVPA) has become popular for decoding cognitive states from distributed patterns of neural activation. Here, we used this technique to assess whether biased fear generalization, characteristic of clinical fear, is already present during the initial perception and categorization of a stimulus, or emerges during the subsequent interpretation of a stimulus. Individuals with low spider fear (n = 20) and high spider fear (n = 18) underwent functional MRI scanning while viewing series of schematic flowers morphing to spiders. In line with previous studies, individuals with high fear of spiders were behaviorally more likely to classify ambiguous morphs as spiders than individuals with low fear of spiders. Univariate analyses of BOLD-MRI data revealed stronger activation toward spider pictures in high fearful individuals compared to low fearful individuals in numerous areas. Yet, neither average activation, nor support vector machine classification (i.e., a form of MVPA) matched the behavioral results – i.e., a biased response toward ambiguous stimuli – in any of the regions of interest. This may point to limitations of the current design, and to challenges associated with classifying emotional and neutral stimuli in groups that differ in their judgment of emotionality. Improvements for future research are suggested.
Collapse
Affiliation(s)
- Renée M Visser
- Department of Clinical Psychology, University of AmsterdamAmsterdam, Netherlands; Department of Clinical Psychology, University of AmsterdamAmsterdam, Netherlands; Department of Clinical Psychology, University of AmsterdamAmsterdam, Netherlands
| | - Pia Haver
- Department of Clinical Psychology, University of Amsterdam Amsterdam, Netherlands
| | - Robert J Zwitser
- Department of Psychological Methods, University of Amsterdam Amsterdam, Netherlands
| | - H Steven Scholte
- Amsterdam Brain and Cognition, University of AmsterdamAmsterdam, Netherlands; Department of Brain and Cognition, University of AmsterdamAmsterdam, Netherlands
| | - Merel Kindt
- Department of Clinical Psychology, University of AmsterdamAmsterdam, Netherlands; Amsterdam Brain and Cognition, University of AmsterdamAmsterdam, Netherlands
| |
Collapse
|
29
|
Ip EH, Chen SH, Quandt SA. Analysis of Multiple Partially Ordered Responses to Belief Items with Don't Know Option. PSYCHOMETRIKA 2016; 81:483-505. [PMID: 25479822 PMCID: PMC4458241 DOI: 10.1007/s11336-014-9432-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2013] [Indexed: 06/04/2023]
Abstract
Understanding beliefs, values, and preferences of patients is a tenet of contemporary health sciences. This application was motivated by the analysis of multiple partially ordered set (poset) responses from an inventory on layman beliefs about diabetes. The partially ordered set arises because of two features in the data-first, the response options contain a Don't Know (DK) option, and second, there were two consecutive occasions of measurement. As predicted by the common sense model of illness, beliefs about diabetes were not necessarily stable across the two measurement occasions. Instead of analyzing the two occasions separately, we studied the joint responses across the occasions as a poset response. Few analytic methods exist for data structures other than ordered or nominal categories. Poset responses are routinely collapsed and then analyzed as either rank ordered or nominal data, leading to the loss of nuanced information that might be present within poset categories. In this paper we developed a general class of item response models for analyzing the poset data collected from the Common Sense Model of Diabetes Inventory. The inferential object of interest is the latent trait that indicates congruence of belief with the biomedical model. To apply an item response model to the poset diabetes inventory, we proved that a simple coding algorithm circumvents the requirement of writing new codes such that standard IRT software could be directly used for the purpose of item estimation and individual scoring. Simulation experiments were used to examine parameter recovery for the proposed poset model.
Collapse
Affiliation(s)
- Edward H Ip
- Department of Biostatistical Sciences, Wake Forest School of Medicine, Medical Center Blvd., Winston-Salem, NC, 27157, USA.
| | - Shyh-Huei Chen
- Department of Biostatistical Sciences, Wake Forest School of Medicine, Medical Center Blvd., Winston-Salem, NC, 27157, USA
| | - Sara A Quandt
- Department of Epidemiology and Prevention, Wake Forest School of Medicine, Medical Center Blvd., Winston-Salem, NC, 27157, USA
| |
Collapse
|
30
|
A Mixed-effects Location-Scale Model for Ordinal Questionnaire Data. HEALTH SERVICES AND OUTCOMES RESEARCH METHODOLOGY 2016; 16:117-131. [PMID: 27570476 DOI: 10.1007/s10742-016-0145-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
In health studies, questionnaire items are often scored on an ordinal scale, for example on a Likert scale. For such questionnaires, item response theory (IRT) models provide a useful approach for obtaining summary scores for subjects (i.e., the model's random subject effect) and characteristics of the items (e.g., item difficulty and discrimination). In this article, we describe a model that allows the items to additionally exhibit different within-subject variance, and also includes a subject-level random effect to the within-subject variance specification. This permits subjects to be characterized in terms of their mean level, or location, and their variability, or scale, and the model allows item difficulty and discrimination in terms of both random subject effects (location and scale). We illustrate application of this location-scale mixed model using data from the Social Subscale of the Drinking Motives Questionnaire (SS-DMQ) assessed in an adolescent study. We show that the proposed model fits the data significantly better than simpler IRT models, and is able to identify items and subjects that are not well-fit by the simpler models. The proposed model has useful applications in many areas where questionnaires are often rated on an ordinal scale, and there is interest in characterizing subjects in terms of both their mean and variability.
Collapse
|
31
|
Cho SJ, Goodwin AP. Modeling Learning in Doubly Multilevel Binary Longitudinal Data Using Generalized Linear Mixed Models: An Application to Measuring and Explaining Word Learning. PSYCHOMETRIKA 2016; 82:10.1007/s11336-016-9496-y. [PMID: 27038452 DOI: 10.1007/s11336-016-9496-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2015] [Indexed: 06/05/2023]
Abstract
When word learning is supported by instruction in experimental studies for adolescents, word knowledge outcomes tend to be collected from complex data structure, such as multiple aspects of word knowledge, multilevel reader data, multilevel item data, longitudinal design, and multiple groups. This study illustrates how generalized linear mixed models can be used to measure and explain word learning for data having such complexity. Results from this application provide deeper understanding of word knowledge than could be attained from simpler models and show that word knowledge is multidimensional and depends on word characteristics and instructional contexts.
Collapse
Affiliation(s)
- Sun-Joo Cho
- Vanderbilt University's Peabody College, Nashville, TN, USA.
| | | |
Collapse
|
32
|
Koch T, Schultze M, Jeon M, Nussbeck FW, Praetorius AK, Eid M. A Cross-Classified CFA-MTMM Model for Structurally Different and Nonindependent Interchangeable Methods. MULTIVARIATE BEHAVIORAL RESEARCH 2016; 51:67-85. [PMID: 26881958 DOI: 10.1080/00273171.2015.1101367] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Multirater (multimethod, multisource) studies are increasingly applied in psychology. Eid and colleagues (2008) proposed a multilevel confirmatory factor model for multitrait-multimethod (MTMM) data combining structurally different and multiple independent interchangeable methods (raters). In many studies, however, different interchangeable raters (e.g., peers, subordinates) are asked to rate different targets (students, supervisors), leading to violations of the independence assumption and to cross-classified data structures. In the present work, we extend the ML-CFA-MTMM model by Eid and colleagues (2008) to cross-classified multirater designs. The new C4 model (Cross-Classified CTC[M-1] Combination of Methods) accounts for nonindependent interchangeable raters and enables researchers to explicitly model the interaction between targets and raters as a latent variable. Using a real data application, it is shown how credibility intervals of model parameters and different variance components can be obtained using Bayesian estimation techniques.
Collapse
|
33
|
Park R, Pituch KA, Kim J, Dodd BG, Chung H. Marginalized Maximum Likelihood Estimation for the 1PL-AG IRT Model. APPLIED PSYCHOLOGICAL MEASUREMENT 2015; 39:448-464. [PMID: 29881018 PMCID: PMC5978613 DOI: 10.1177/0146621615574694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Marginal maximum likelihood estimation based on the expectation-maximization algorithm (MML/EM) is developed for the one-parameter logistic model with ability-based guessing (1PL-AG) item response theory (IRT) model. The use of the MML/EM estimator is cross-validated with estimates from NLMIXED procedure (PROC NLMIXED) in Statistical Analysis System. Numerical data are provided for comparisons of results from MML/EM and PROC NLMIXED.
Collapse
Affiliation(s)
| | | | | | | | - Hyewon Chung
- Chungnam National University, Daejeon, South Korea
| |
Collapse
|
34
|
Abstract
The new R package flirt is introduced for flexible item response theory (IRT) modeling of psychological, educational, and behavior assessment data. flirt integrates a generalized linear and nonlinear mixed modeling framework with graphical model theory. The graphical model framework allows for efficient maximum likelihood estimation. The key feature of flirt is its modular approach to facilitate convenient and flexible model specifications. Researchers can construct customized IRT models by simply selecting various modeling modules, such as parametric forms, number of dimensions, item and person covariates, person groups, link functions, etc. In this paper, we describe major features of flirt and provide examples to illustrate how flirt works in practice.
Collapse
|
35
|
Strobl C, Kopf J, Zeileis A. Rasch Trees: A New Method for Detecting Differential Item Functioning in the Rasch Model. PSYCHOMETRIKA 2015; 80:289-316. [PMID: 24352514 DOI: 10.1007/s11336-013-9388-3] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2010] [Indexed: 06/03/2023]
Abstract
A variety of statistical methods have been suggested for detecting differential item functioning (DIF) in the Rasch model. Most of these methods are designed for the comparison of pre-specified focal and reference groups, such as males and females. Latent class approaches, on the other hand, allow the detection of previously unknown groups exhibiting DIF. However, this approach provides no straightforward interpretation of the groups with respect to person characteristics. Here, we propose a new method for DIF detection based on model-based recursive partitioning that can be considered as a compromise between those two extremes. With this approach it is possible to detect groups of subjects exhibiting DIF, which are not pre-specified, but result from combinations of observed covariates. These groups are directly interpretable and can thus help generate hypotheses about the psychological sources of DIF. The statistical background and construction of the new method are introduced by means of an instructive example, and extensive simulation studies are presented to support and illustrate the statistical properties of the method, which is then applied to empirical data from a general knowledge quiz. A software implementation of the method is freely available in the R system for statistical computing.
Collapse
Affiliation(s)
- Carolin Strobl
- Department of Psychology, Universität Zürich, Binzmühlestr. 14, 8050, Zürich, Switzerland,
| | | | | |
Collapse
|
36
|
Milanzi E, Molenberghs G, Alonso A, Verbeke G, De Boeck P. Reliability measures in item response theory: manifest versus latent correlation functions. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2015; 68:43-64. [PMID: 24484622 DOI: 10.1111/bmsp.12033] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2014] [Revised: 02/13/2014] [Indexed: 06/03/2023]
Abstract
For item response theory (IRT) models, which belong to the class of generalized linear or non-linear mixed models, reliability at the scale of observed scores (i.e., manifest correlation) is more difficult to calculate than latent correlation based reliability, but usually of greater scientific interest. This is not least because it cannot be calculated explicitly when the logit link is used in conjunction with normal random effects. As such, approximations such as Fisher's information coefficient, Cronbach's α, or the latent correlation are calculated, allegedly because it is easy to do so. Cronbach's α has well-known and serious drawbacks, Fisher's information is not meaningful under certain circumstances, and there is an important but often overlooked difference between latent and manifest correlations. Here, manifest correlation refers to correlation between observed scores, while latent correlation refers to correlation between scores at the latent (e.g., logit or probit) scale. Thus, using one in place of the other can lead to erroneous conclusions. Taylor series based reliability measures, which are based on manifest correlation functions, are derived and a careful comparison of reliability measures based on latent correlations, Fisher's information, and exact reliability is carried out. The latent correlations are virtually always considerably higher than their manifest counterparts, Fisher's information measure shows no coherent behaviour (it is even negative in some cases), while the newly introduced Taylor series based approximations reflect the exact reliability very closely. Comparisons among the various types of correlations, for various IRT models, are made using algebraic expressions, Monte Carlo simulations, and data analysis. Given the light computational burden and the performance of Taylor series based reliability measures, their use is recommended.
Collapse
Affiliation(s)
- Elasma Milanzi
- Interuniversity Institute for Biostatistics and statistical Bioinformatics, Universiteit Hasselt, Diepenbeek, Belgium
| | | | | | | | | |
Collapse
|
37
|
Jiao H, Zhang Y. Polytomous multilevel testlet models for testlet-based assessments with complex sampling designs. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2015; 68:65-83. [PMID: 24571376 DOI: 10.1111/bmsp.12035] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2012] [Revised: 12/16/2013] [Indexed: 06/03/2023]
Abstract
Applications of standard item response theory models assume local independence of items and persons. This paper presents polytomous multilevel testlet models for dual dependence due to item and person clustering in testlet-based assessments with clustered samples. Simulation and survey data were analysed with a multilevel partial credit testlet model. This model was compared with three alternative models - a testlet partial credit model (PCM), multilevel PCM, and PCM - in terms of model parameter estimation. The results indicated that the deviance information criterion was the fit index that always correctly identified the true multilevel testlet model based on the quantified evidence in model selection, while the Akaike and Bayesian information criteria could not identify the true model. In general, the estimation model and the magnitude of item and person clustering impacted the estimation accuracy of ability parameters, while only the estimation model and the magnitude of item clustering affected the item parameter estimation accuracy. Furthermore, ignoring item clustering effects produced higher total errors in item parameter estimates but did not have much impact on the accuracy of ability parameter estimates, while ignoring person clustering effects yielded higher total errors in ability parameter estimates but did not have much effect on the accuracy of item parameter estimates. When both clustering effects were ignored in the PCM, item and ability parameter estimation accuracy was reduced.
Collapse
Affiliation(s)
- Hong Jiao
- Measurement, Statistics and Evaluation, Department of Human Development and Quantitative Methodology, University of Maryland, College Park, USA
| | | |
Collapse
|
38
|
van Rijn P, Rijmen F. On the explaining-away phenomenon in multivariate latent variable models. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2015; 68:1-22. [PMID: 25469472 DOI: 10.1111/bmsp.12046] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2013] [Revised: 10/24/2014] [Indexed: 06/04/2023]
Abstract
Many probabilistic models for psychological and educational measurements contain latent variables. Well-known examples are factor analysis, item response theory, and latent class model families. We discuss what is referred to as the 'explaining-away' phenomenon in the context of such latent variable models. This phenomenon can occur when multiple latent variables are related to the same observed variable, and can elicit seemingly counterintuitive conditional dependencies between latent variables given observed variables. We illustrate the implications of explaining away for a number of well-known latent variable models by using both theoretical and real data examples.
Collapse
|
39
|
Abstract
In this paper, the state of research on the assessment of competencies in higher education is reviewed. Fundamental conceptual and methodological issues are clarified by showing that current controversies are built on misleading dichotomies. By systematically sketching conceptual controversies, competing competence definitions are unpacked (analytic/trait vs. holistic/real-world performance) and commonplaces are identified. Disagreements are also highlighted. Similarly, competing statistical approaches to assessing competencies, namely item-response theory (latent trait) versus generalizability theory (sampling error variance), are unpacked. The resulting framework moves beyond dichotomies and shows how the different approaches complement each other. Competence is viewed along a continuum from traits that underlie perception, interpretation, and decision-making skills, which in turn give rise to observed behavior in real-world situations. Statistical approaches are also viewed along a continuum from linear to nonlinear models that serve different purposes. Item response theory (IRT) models may be used for scaling item responses and modeling structural relations, and generalizability theory (GT) models pinpoint sources of measurement error variance, thereby enabling the design of reliable measurements. The proposed framework suggests multiple new research studies and may serve as a “grand” structural model.
Collapse
Affiliation(s)
- Sigrid Blömeke
- Centre for Educational Measurement (CEMO), University of Oslo, Norway
| | - Jan-Eric Gustafsson
- Centre for Educational Measurement (CEMO), University of Oslo, Norway
- Department of Education and Special Education, University of Gothenburg, Sweden
| | - Richard J. Shavelson
- SK Partners LLC, Menlo Park, CA, USA
- Graduate School of Education, Stanford University, CA, USA
| |
Collapse
|
40
|
Arima S. Item selection via Bayesian IRT models. Stat Med 2014; 34:487-503. [PMID: 25327293 DOI: 10.1002/sim.6341] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2014] [Revised: 09/23/2014] [Accepted: 10/06/2014] [Indexed: 11/11/2022]
Abstract
With reference to a questionnaire that aimed to assess the quality of life for dysarthric speakers, we investigate the usefulness of a model-based procedure for reducing the number of items. We propose a mixed cumulative logit model, which is known in the psychometrics literature as the graded response model: responses to different items are modelled as a function of individual latent traits and as a function of item characteristics, such as their difficulty and their discrimination power. We jointly model the discrimination and the difficulty parameters by using a k-component mixture of normal distributions. Mixture components correspond to disjoint groups of items. Items that belong to the same groups can be considered equivalent in terms of both difficulty and discrimination power. According to decision criteria, we select a subset of items such that the reduced questionnaire is able to provide the same information that the complete questionnaire provides. The model is estimated by using a Bayesian approach, and the choice of the number of mixture components is justified according to information criteria. We illustrate the proposed approach on the basis of data that are collected for 104 dysarthric patients by local health authorities in Lecce and in Milan.
Collapse
Affiliation(s)
- Serena Arima
- Dipartimento di Metodi e Modelli per l'Economia, il Territorio e la Finanza, Sapienza Università di Roma, Rome, Italy
| |
Collapse
|
41
|
Rijmen F. EFFICIENT FULL INFORMATION MAXIMUM LIKELIHOOD ESTIMATION FOR MULTIDIMENSIONAL IRT MODELS. ACTA ACUST UNITED AC 2014. [DOI: 10.1002/j.2333-8504.2009.tb02160.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
42
|
Li Y, Li S, Wang L. APPLICATION OF A GENERAL POLYTOMOUS TESTLET MODEL TO THE READING SECTION OF A LARGE-SCALE ENGLISH LANGUAGE ASSESSMENT. ACTA ACUST UNITED AC 2014. [DOI: 10.1002/j.2333-8504.2010.tb02228.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
43
|
Chapman B, Fiscella K, Duberstein P, Kawachi I, Muennig P. Measurement confounding affects the extent to which verbal IQ explains social gradients in mortality. J Epidemiol Community Health 2014; 68:728-33. [PMID: 24729404 PMCID: PMC4846277 DOI: 10.1136/jech-2013-203741] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
BACKGROUND IQ is thought to explain social gradients in mortality. IQ scores are based roughly equally on Verbal IQ (VIQ) and Performance IQ tests. VIQ tests, however, are suspected to confound true verbal ability with socioeconomic status (SES), raising the possibility that associations between SES and IQ scores might be overestimated. We examined, first, whether two of the most common types of VIQ tests exhibited differential item functioning (DIF) favouring persons of higher SES and/or majority race/ethnicity. Second, we assessed what impact, if any, this had on estimates of the extent to which VIQ explains social gradients in mortality. METHODS Data from the General Social Survey-National Death Index cohort, a US population representative dataset, was used. Item response theory models queried social-factor DIF on the Thorndike Verbal Intelligence Scale and Wechsler Adult Intelligence Scales, Revised Similarities test. Cox models examined mortality associations among SES and VIQ scores corrected and uncorrected for DIF. RESULTS When uncorrected for DIF, VIQ was correlated with income, education, occupational prestige and race, with correlation coefficients ranging between |0.12| and |0.43|. After correcting for DIF, correlations ranged from |0.06| to |0.16|. Uncorrected VIQ scores explained 11-40% of the Relative Index of Inequalities in mortality for social factors, while DIF-corrected scores explained 2-29%. CONCLUSIONS Two of the common forms of VIQ tests appear confound verbal intelligence with SES. Since these tests appear in most IQ batteries, circumspection may be warranted in estimating the amount of social inequalities in mortality attributable to IQ.
Collapse
Affiliation(s)
- Benjamin Chapman
- Department of Psychiatry, University of Rochester Medical Center, Rochester, New York, USA
| | - Kevin Fiscella
- Department of Family Medicine, University of Rochester Medical Center, Center for Communication and Disparities Research, Rochester, New York, USA
- Department of Public Health Sciences, University of Rochester Medical Center, Rochester, New York, USA
| | - Paul Duberstein
- Department of Psychiatry, University of Rochester Medical Center, Rochester, New York, USA
- Department of Family Medicine, University of Rochester Medical Center, Center for Communication and Disparities Research, Rochester, New York, USA
| | - Ichiro Kawachi
- Department of Society, Human Development, and Health, Harvard University School of Public Health, Boston, Massachusetts, USA
| | - Peter Muennig
- Department of Health Management and Policy, Columbia University, Mailman School of Public Health, New York, New York, USA
| |
Collapse
|
44
|
Debeer D, Janssen R. Modeling Item-Position Effects Within an IRT Framework. JOURNAL OF EDUCATIONAL MEASUREMENT 2013. [DOI: 10.1111/jedm.12009] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
45
|
Thiel H, Thomsen SL. Noncognitive skills in economics: Models, measurement, and empirical evidence. ACTA ACUST UNITED AC 2013. [DOI: 10.1016/j.rie.2013.03.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
46
|
Cho SJ, Athay M, Preacher KJ. Measuring change for a multidimensional test using a generalized explanatory longitudinal item response model. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2013; 66:353-381. [PMID: 23082893 DOI: 10.1111/j.2044-8317.2012.02058.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Even though many educational and psychological tests are known to be multidimensional, little research has been done to address how to measure individual differences in change within an item response theory framework. In this paper, we suggest a generalized explanatory longitudinal item response model to measure individual differences in change. New longitudinal models for multidimensional tests and existing models for unidimensional tests are presented within this framework and implemented with software developed for generalized linear models. In addition to the measurement of change, the longitudinal models we present can also be used to explain individual differences in change scores for person groups (e.g., learning disabled students versus non-learning disabled students) and to model differences in item difficulties across item groups (e.g., number operation, measurement, and representation item groups in a mathematics test). An empirical example illustrates the use of the various models for measuring individual differences in change when there are person groups and multiple skill domains which lead to multidimensionality at a time point.
Collapse
Affiliation(s)
- Sun-Joo Cho
- Peabody College of Vanderbilt University, Nashville, TN 37203-5721, USA.
| | | | | |
Collapse
|
47
|
Luo S, Ma J, Kieburtz KD. Robust Bayesian inference for multivariate longitudinal data by using normal/independent distributions. Stat Med 2013; 32:3812-28. [PMID: 23494809 DOI: 10.1002/sim.5778] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2012] [Revised: 02/04/2013] [Accepted: 02/07/2013] [Indexed: 11/07/2022]
Abstract
Many randomized clinical trials collect multivariate longitudinal measurements in different scales, for example, binary, ordinal, and continuous. Multilevel item response models are used to evaluate the global treatment effects across multiple outcomes while accounting for all sources of correlation. Continuous measurements are often assumed to be normally distributed. But the model inference is not robust when the normality assumption is violated because of heavy tails and outliers. In this article, we develop a Bayesian method for multilevel item response models replacing the normal distributions with symmetric heavy-tailed normal/independent distributions. The inference is conducted using a Bayesian framework via Markov Chain Monte Carlo simulation implemented in BUGS language. Our proposed method is evaluated by simulation studies and is applied to Earlier versus Later Levodopa Therapy in Parkinson's Disease study, a motivating clinical trial assessing the effect of Levodopa therapy on the Parkinson's disease progression rate.
Collapse
Affiliation(s)
- Sheng Luo
- Division of Biostatistics, University of Texas School of Public Health, 1200 Pressler St, Houston, Texas 77030, U.S.A
| | | | | |
Collapse
|
48
|
Kern ML, Hampson SE, Goldberg LR, Friedman HS. Integrating prospective longitudinal data: modeling personality and health in the Terman Life Cycle and Hawaii Longitudinal Studies. Dev Psychol 2012; 50:1390-406. [PMID: 23231689 DOI: 10.1037/a0030874] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The present study used a collaborative framework to integrate 2 long-term prospective studies: the Terman Life Cycle Study and the Hawaii Personality and Health Longitudinal Study. Within a 5-factor personality-trait framework, teacher assessments of child personality were rationally and empirically aligned to establish similar factor structures across samples. Comparable items related to adult self-rated health, education, and alcohol use were harmonized, and data were pooled on harmonized items. A structural model was estimated as a multigroup analysis. Harmonized child personality factors were then used to examine markers of physiological dysfunction in the Hawaii sample and mortality risk in the Terman sample. Harmonized conscientiousness predicted less physiological dysfunction in the Hawaii sample and lower mortality risk in the Terman sample. These results illustrate how collaborative, integrative work with multiple samples offers the exciting possibility that samples from different cohorts and ages can be linked together to directly test life span theories of personality and health.
Collapse
|
49
|
Bacci S. Longitudinal data: different approaches in the context of item-response theory models. J Appl Stat 2012. [DOI: 10.1080/02664763.2012.700451] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
50
|
Twice random, once mixed: applying mixed models to simultaneously analyze random effects of language and participants. Behav Res Methods 2012; 44:232-47. [PMID: 21858733 DOI: 10.3758/s13428-011-0145-1] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Psychologists, psycholinguists, and other researchers using language stimuli have been struggling for more than 30 years with the problem of how to analyze experimental data that contain two crossed random effects (items and participants). The classical analysis of variance does not apply; alternatives have been proposed but have failed to catch on, and a statistically unsatisfactory procedure of using two approximations (known as F(1) and F(2)) has become the standard. A simple and elegant solution using mixed model analysis has been available for 15 years, and recent improvements in statistical software have made mixed models analysis widely available. The aim of this article is to increase the use of mixed models by giving a concise practical introduction and by giving clear directions for undertaking the analysis in the most popular statistical packages. The article also introduces the DJMIXED: add-on package for SPSS, which makes entering the models and reporting their results as straightforward as possible.
Collapse
|