51
|
de Rooij M, Schouteden M. The Mixed Effects Trend Vector Model. MULTIVARIATE BEHAVIORAL RESEARCH 2012; 47:635-664. [PMID: 26777672 DOI: 10.1080/00273171.2012.692640] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Maximum likelihood estimation of mixed effect baseline category logit models for multinomial longitudinal data can be prohibitive due to the integral dimension of the random effects distribution. We propose to use multidimensional unfolding methodology to reduce the dimensionality of the problem. As a by-product, readily interpretable graphical displays representing change are obtained. The methodology can be applied to both nominal and ordinal response variables. Relationships to standard statistical models for multinomial data are presented. Several empirical examples are given to show the merits of the proposed modeling framework.
Collapse
|
52
|
Modeling Language Components in Mathematical Items Using Multiple Random Effects IRT Models. ACTA PSYCHOLOGICA SINICA 2012. [DOI: 10.3724/sp.j.1041.2011.00462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
53
|
|
54
|
|
55
|
Proust-Lima C, Dartigues JF, Jacqmin-Gadda H. Misuse of the linear mixed model when evaluating risk factors of cognitive decline. Am J Epidemiol 2011; 174:1077-88. [PMID: 21965187 DOI: 10.1093/aje/kwr243] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The linear mixed model (LMM), which is routinely used to describe change in outcomes over time and its association with risk factors, assumes that a unit change in any predictor is associated with a constant change in the outcome. When it is used on psychometric tests, this assumption may not hold. Indeed, psychometric tests usually suffer from ceiling and/or floor effects and curvilinearity (i.e., varying sensitivity to change). The authors aimed to determine the consequences of such misspecification when evaluating predictors of cognitive decline. As an alternative to the LMM, they considered 2 mixed models based on latent processes that handle discrete and bounded outcomes. Model differences are illustrated here using data on 4 psychometric tests from the Personnes Agées QUID (PAQUID) Study (1989-2004). The type I error of the Wald test for risk-factor regression parameters was formally assessed in a simulation study. It demonstrated that type I errors in the LMM could be dramatically inflated for some tests, such that spurious associations with risk factors were found. In particular, confusion between effects on mean level and effects on change over time was highlighted. The authors recommend use of the alternative mixed models when studying psychometric tests and more generally quantitative scales (quality of life, activities of daily living).
Collapse
Affiliation(s)
- Cécile Proust-Lima
- Institut de Sante´ Publique, d’E´ pide´miologie et de De´veloppement, Universite´ Bordeaux Segalen, 146 rue Le´o Saignat, 33076 Bordeaux Cedex, France.
| | | | | |
Collapse
|
56
|
Swartz RJ, Schwartz C, Basch E, Cai L, Fairclough DL, McLeod L, Mendoza TR, Rapkin B. The king's foot of patient-reported outcomes: current practices and new developments for the measurement of change. Qual Life Res 2011; 20:1159-67. [PMID: 21336659 PMCID: PMC3178017 DOI: 10.1007/s11136-011-9863-1] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/21/2011] [Indexed: 11/01/2022]
Abstract
PURPOSE Assessing change remains a challenge in patient-reported outcomes. In June 2009, a group of psychometricians, biostatisticians, and behavioral researchers from other disciplines convened as a Longitudinal Analysis of Patient-Reported Outcomes Working group as part of the Statistical and Applied Mathematical Sciences Institute Summer Psychometric program to discuss the complex issues that arise when conceptualizing and operationalizing "change" in patient-reported outcome (PRO) measures and related constructs. This white paper summarizes these issues and provides recommendations and possible paths for dealing with the complexities of measuring change. METHODS/RESULTS This article presents and discusses issues associated with: (1) conceptualizing and operationalizing change in PRO measures; (2) modeling change using state-of-the-art statistical methods; (3) impediments to detecting true change; (4) new developments to deal with these challenges; and (5) important gaps that are fertile ground for future research. CONCLUSIONS There was a consensus that important research still needs to be performed in order develop and refine high-quality PRO measures and statistical methods to analyze and model change in PRO constructs.
Collapse
Affiliation(s)
- Richard J Swartz
- Jones Graduate School of Business, Rice University, Houston, TX, USA.
| | | | | | | | | | | | | | | |
Collapse
|
57
|
Isaac K, Hochweber J. Modellierung von Kompetenzen im Bereich „Sprache und Sprachgebrauch untersuchen” mit schwierigkeitsbestimmenden Aufgabenmerkmalen. ZEITSCHRIFT FUR ENTWICKLUNGSPSYCHOLOGIE UND PADAGOGISCHE PSYCHOLOGIE 2011. [DOI: 10.1026/0049-8637/a000054] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Zusammenfassung. Für den Bereich „Sprache und Sprachgebrauch untersuchen” der KMK-Bildungsstandards wurden basierend auf Kategorien sprachbewusster Tätigkeiten neun schwierigkeitsbestimmende Aufgabenmerkmale abgeleitet. Auf Grundlage einer Stichprobe von 3.276 Schülerinnen und Schülern der dritten und vierten Klassenstufe wurde im ersten Analyseschritt der Zusammenhang der Aufgabenmerkmale mit den Schwierigkeiten von 107 Testaufgaben untersucht. Im zweiten Schritt wurde untersucht, inwieweit die schwierigkeitsgenerierenden Effekte der Aufgabenmerkmale von der sprachlichen Herkunft der Schülerinnen und Schüler unter Berücksichtigung des kulturellen Kapitals sowie der Klassenstufe moderiert werden, d.h. Differential Facet Functioning (DFF) vorliegt. Für Schritt 1 wurde das linear-logistische Testmodell (LLTM) in der Formulierung als generalisiertes lineares gemischtes Modell (GLMM) verwendet, welches für Schritt 2 um Schülermerkmale als Prädiktoren ergänzt wurde. Die Aufgabenmerkmale weisen einen sehr hohen Erklärungswert für die Aufgabenschwierigkeiten auf (R2 = .695), wobei die Beziehung zur Aufgabenschwierigkeit bei acht der neun Aufgabenmerkmale von mindestens einem der Schülermerkmale oder der Klassenstufe abhängt.
Collapse
Affiliation(s)
- Kevin Isaac
- Ministerium für Schule und Weiterbildung des Landes Nordrhein-Westfalen, Soest
| | - Jan Hochweber
- Deutsches Institut für Internationale Pädagogische Forschung (DIPF), Frankfurt
| |
Collapse
|
58
|
Blanchin M, Hardouin JB, Le Neel T, Kubis G, Blanchard C, Mirallié E, Sébille V. Comparison of CTT and Rasch-based approaches for the analysis of longitudinal Patient Reported Outcomes. Stat Med 2010; 30:825-38. [PMID: 21432877 DOI: 10.1002/sim.4153] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2009] [Accepted: 11/01/2010] [Indexed: 11/11/2022]
Abstract
Health sciences frequently deal with Patient Reported Outcomes (PRO) data for the evaluation of concepts, in particular health-related quality of life, which cannot be directly measured and are often called latent variables. Two approaches are commonly used for the analysis of such data: Classical Test Theory (CTT) and Item Response Theory (IRT). Longitudinal data are often collected to analyze the evolution of an outcome over time. The most adequate strategy to analyze longitudinal latent variables, which can be either based on CTT or IRT models, remains to be identified. This strategy must take into account the latent characteristic of what PROs are intended to measure as well as the specificity of longitudinal designs. A simple and widely used IRT model is the Rasch model. The purpose of our study was to compare CTT and Rasch-based approaches to analyze longitudinal PRO data regarding type I error, power, and time effect estimation bias. Four methods were compared: the Score and Mixed models (SM) method based on the CTT approach, the Rasch and Mixed models (RM), the Plausible Values (PV), and the Longitudinal Rasch model (LRM) methods all based on the Rasch model. All methods have shown comparable results in terms of type I error, all close to 5 per cent. LRM and SM methods presented comparable power and unbiased time effect estimations, whereas RM and PV methods showed low power and biased time effect estimations. This suggests that RM and PV methods should be avoided to analyze longitudinal latent variables.
Collapse
Affiliation(s)
- Myriam Blanchin
- EA 4275 'Biostatistics, Clinical Research and Subjective Measures in Health Sciences', Faculty of Pharmaceutical Sciences, University of Nantes, Nantes, France.
| | | | | | | | | | | | | |
Collapse
|
59
|
Ip EH. Empirically indistinguishable multidimensional IRT and locally dependent unidimensional item response models. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2010; 63:395-416. [PMID: 19840494 PMCID: PMC6137815 DOI: 10.1348/000711009x466835] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Multidimensionality is a core concept in the measurement and analysis of psychological data. In personality assessment, for example, constructs are mostly theoretically defined as unidimensional, yet responses collected from the real world are almost always determined by multiple factors. Significant research efforts have concentrated on the use of simulated studies to evaluate the robustness of unidimensional item response models when applied to multidimensional data with a dominant dimension. In contrast, in the present paper, I report the result from a theoretical investigation that a multidimensional item response model is empirically indistinguishable from a locally dependent unidimensional model, of which the single dimension represents the actual construct of interest. A practical implication of this result is that multidimensional response data do not automatically require the use of multidimensional models. Circumstances under which the alternative approach of locally dependent unidimensional models may be useful are discussed.
Collapse
|
60
|
Strobl C, Malley J, Tutz G. An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychol Methods 2010; 14:323-48. [PMID: 19968396 DOI: 10.1037/a0016973] [Citation(s) in RCA: 1026] [Impact Index Per Article: 73.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Recursive partitioning methods have become popular and widely used tools for nonparametric regression and classification in many scientific fields. Especially random forests, which can deal with large numbers of predictor variables even in the presence of complex interactions, have been applied successfully in genetics, clinical medicine, and bioinformatics within the past few years. High-dimensional problems are common not only in genetics, but also in some areas of psychological research, where only a few subjects can be measured because of time or cost constraints, yet a large amount of data is generated for each subject. Random forests have been shown to achieve a high prediction accuracy in such applications and to provide descriptive variable importance measures reflecting the impact of each variable in both main effects and interactions. The aim of this work is to introduce the principles of the standard recursive partitioning methods as well as recent methodological improvements, to illustrate their usage for low and high-dimensional data exploration, but also to point out limitations of the methods and potential pitfalls in their practical application. Application of the methods is illustrated with freely available implementations in the R system for statistical computing.
Collapse
Affiliation(s)
- Carolin Strobl
- Department of Statistics, Ludwig-Maximilians-Universität Munich, Munich, Germany.
| | | | | |
Collapse
|
61
|
Investigating latent constructs with item response models: a MATLAB IRTm toolbox. Behav Res Methods 2010; 41:1127-37. [PMID: 19897820 DOI: 10.3758/brm.41.4.1127] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Item response theory (IRT) models are the central tools in modern measurement and advanced psychometrics. We offer a MATLAB IRT modeling (IRTm) toolbox that is freely available and that follows an explicit design matrix approach, giving the end user control and flexibility in building a model that goes beyond standard models, such as the Rasch model (Rasch, 1960) and the two-parameter logistic model. As such, IRTm allows for a large variety of unidimensional IRT models for binary responses, the incorporation of additional person and item information, and deviations from common model assumptions. An exclusive key feature of the toolbox is the inclusion of copula IRT models to handle local item dependencies. Two appendixes for this report, containing example code and information on the general copula IRT in IRTm, may be downloaded from brm.psychonomic-journals.org/content/supplemental.
Collapse
|
62
|
van Nispen RMA, Knol DL, Neve HJ, van Rens GHMB. A multilevel item response theory model was investigated for longitudinal vision-related quality-of-life data. J Clin Epidemiol 2009; 63:321-30. [PMID: 19766455 DOI: 10.1016/j.jclinepi.2009.06.012] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2008] [Revised: 05/19/2009] [Accepted: 06/02/2009] [Indexed: 11/26/2022]
Abstract
OBJECTIVE To investigate how a multilevel item response theory (IRT) model for longitudinal dependent data could provide average and individual quality-of-life outcomes of low-vision rehabilitation. STUDY DESIGN AND SETTING In a nonrandomized longitudinal design, visually impaired older patients (n=296) were referred to multidisciplinary rehabilitation or to an optometric service. The five-dimensional Low Vision Quality of Life Questionnaire was administered at four time points. The IRT model was characterized by the graded response model for rating scales. Covariates were added to the model, mainly to correct for missing data. The invariance assumption across time points was investigated. RESULTS Average and individual rehabilitation effects were estimated. For multidisciplinary rehabilitation, significant average deterioration was seen on three dimensions after 4.4 years. Many individuals in the optometric service group significantly improved on the "reading small print" dimension (18.5%); in both groups, many individuals significantly deteriorated on "visual (motor) skills" (22.2-30.0%). Invariance across time points could be assumed for all dimensions, except for "adjustment." Gender, education, visual acuity, and health status were significantly associated with the outcome. CONCLUSION We present how a multilevel IRT model can be applied to describe longitudinal dependent vision-related quality-of-life data, while focusing on average and individual effects.
Collapse
Affiliation(s)
- Ruth M A van Nispen
- Department of Ophthalmology, VU University Medical Center Amsterdam, Amsterdam, The Netherlands.
| | | | | | | |
Collapse
|
63
|
Bauer DJ, Hussong AM. Psychometric approaches for developing commensurate measures across independent studies: traditional and new models. Psychol Methods 2009; 14:101-25. [PMID: 19485624 DOI: 10.1037/a0015583] [Citation(s) in RCA: 149] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
When conducting an integrative analysis of data obtained from multiple independent studies, a fundamental problem is to establish commensurate measures for the constructs of interest. Fortunately, procedures for evaluating and establishing measurement equivalence across samples are well developed for the linear factor model and commonly used item response theory models. A newly proposed moderated nonlinear factor analysis model generalizes these models and procedures, allowing for items of different scale types (continuous or discrete) and differential item functioning across levels of categorical and/or continuous variables. The potential of this new model to resolve the problem of measurement in integrative data analysis is shown via an empirical example examining changes in alcohol involvement from ages 10 to 22 years across 2 longitudinal studies.
Collapse
Affiliation(s)
- Daniel J Bauer
- Department of Psychology, University of North Carolina, Chapel Hill, NC 27599-3270, USA.
| | | |
Collapse
|
64
|
Culpepper SA, Davenport EC. Assessing Differential Prediction of College Grades by Race/Ethnicity with a Multilevel Model. JOURNAL OF EDUCATIONAL MEASUREMENT 2009. [DOI: 10.1111/j.1745-3984.2009.00079.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
65
|
McArdle JJ, Grimm KJ, Hamagami F, Bowles RP, Meredith W. Modeling life-span growth curves of cognition using longitudinal data with multiple samples and changing scales of measurement. Psychol Methods 2009; 14:126-49. [PMID: 19485625 PMCID: PMC2831479 DOI: 10.1037/a0015857] [Citation(s) in RCA: 101] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The authors use multiple-sample longitudinal data from different test batteries to examine propositions about changes in constructs over the life span. The data come from 3 classic studies on intellectual abilities in which, in combination, 441 persons were repeatedly measured as many as 16 times over 70 years. They measured cognitive constructs of vocabulary and memory using 8 age-appropriate intelligence test batteries and explore possible linkage of these scales using item response theory (IRT). They simultaneously estimated the parameters of both IRT and latent curve models based on a joint model likelihood approach (i.e., NLMIXED and WINBUGS). They included group differences in the model to examine potential interindividual differences in levels and change. The resulting longitudinal invariant Rasch test analyses lead to a few new methodological suggestions for dealing with repeated constructs based on changing measurements in developmental studies.
Collapse
Affiliation(s)
- John J McArdle
- Department of Psychology, University of Southern California, 3620 South McClintock Avenue, SGM 501, Los Angeles, CA 90089-1061, USA.
| | | | | | | | | |
Collapse
|
66
|
Hickendorff M, Heiser WJ, van Putten CM, Verhelst ND. Solution Strategies and Achievement in Dutch Complex Arithmetic: Latent Variable Modeling of Change. PSYCHOMETRIKA 2009; 74:331-350. [PMID: 20037636 PMCID: PMC2792350 DOI: 10.1007/s11336-008-9074-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2007] [Revised: 05/09/2008] [Accepted: 07/08/2008] [Indexed: 05/28/2023]
Abstract
In the Netherlands, national assessments at the end of primary school (Grade 6) show a decline of achievement on problems of complex or written arithmetic over the last two decades. The present study aims at contributing to an explanation of the large achievement decrease on complex division, by investigating the strategies students used in solving the division problems in the two most recent assessments carried out in 1997 and in 2004. The students' strategies were classified into four categories. A data set resulted with two types of repeated observations within students: the nominal strategies and the dichotomous achievement scores (correct/incorrect) on the items administered.It is argued that latent variable modeling methodology is appropriate to analyze these data. First, latent class analyses with year of assessment as a covariate were carried out on the multivariate nominal strategy variables. Results showed a shift from application of the traditional long division algorithm in 1997, to the less accurate strategy of stating an answer without writing down any notes or calculations in 2004, especially for boys. Second, explanatory IRT analyses showed that the three main strategies were significantly less accurate in 2004 than they were in 1997.
Collapse
Affiliation(s)
- Marian Hickendorff
- Division of Methodology and Psychometrics, Institute for Psychological Research, Leiden University, P.O. Box 9555, 2300 RB Leiden, The Netherlands
| | - Willem J. Heiser
- Division of Methodology and Psychometrics, Institute for Psychological Research, Leiden University, P.O. Box 9555, 2300 RB Leiden, The Netherlands
| | - Cornelis M. van Putten
- Division of Methodology and Psychometrics, Institute for Psychological Research, Leiden University, P.O. Box 9555, 2300 RB Leiden, The Netherlands
| | - Norman D. Verhelst
- CITO, National Institute for Educational Measurement, Arnhem, The Netherlands
| |
Collapse
|
67
|
Haley SM, Ni P, Jette AM, Tao W, Moed R, Meyers D, Ludlow LH. Replenishing a computerized adaptive test of patient-reported daily activity functioning. Qual Life Res 2009; 18:461-71. [PMID: 19288222 DOI: 10.1007/s11136-009-9463-5] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2007] [Accepted: 03/01/2009] [Indexed: 11/30/2022]
Abstract
PURPOSE Computerized adaptive testing (CAT) item banks may need to be updated, but before new items can be added, they must be linked to the previous CAT. The purpose of this study was to evaluate 41 pretest items prior to including them into an operational CAT. METHODS We recruited 6,882 patients with spine, lower extremity, upper extremity, and nonorthopedic impairments who received outpatient rehabilitation in one of 147 clinics across 13 states of the USA. Forty-one new Daily Activity (DA) items were administered along with the Activity Measure for Post-Acute Care Daily Activity CAT (DA-CAT-1) in five separate waves. We compared the scoring consistency with the full item bank, test information function (TIF), person standard errors (SEs), and content range of the DA-CAT-1 to the new CAT (DA-CAT-2) with the pretest items by real data simulations. RESULTS We retained 29 of the 41 pretest items. Scores from the DA-CAT-2 were more consistent (ICC = 0.90 versus 0.96) than DA-CAT-1 when compared with the full item bank. TIF and person SEs were improved for persons with higher levels of DA functioning, and ceiling effects were reduced from 16.1% to 6.1%. CONCLUSIONS Item response theory and online calibration methods were valuable in improving the DA-CAT.
Collapse
|
68
|
van Nispen RMA, Knol DL, Langelaan M, de Boer MR, Terwee CB, van Rens GHMB. Applying multilevel item response theory to vision-related quality of life in Dutch visually impaired elderly. Optom Vis Sci 2007; 84:710-20. [PMID: 17700335 DOI: 10.1097/opx.0b013e31813375b8] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
PURPOSE Instead of applying the usual longitudinal methods to assess the outcome of low-vision rehabilitation services in terms of vision-related quality of life, a three-level Item Response Theory (IRT) method was proposed. METHODS The translated Vision-Related Quality of Life Core Measure (VCM1) and Low Vision Quality Of Life (LVQOL) questionnaires were used in a nonrandomized follow-up study among elderly patients (n = 296) referred to two different low-vision rehabilitation services in the Netherlands. Factor analysis was performed on the matrix of polychoric correlations to investigate (uni-)dimensionality and to prepare both questionnaires for the multilevel IRT analyses. A statistical model, which was characterized by a graded response model for rating scales, was developed. Threshold and item difficulty parameters and group by time-specific mean fixed effects were estimated. Random individual effects were predicted. Measurement invariance across occasions was tested. RESULTS The VCM1 and the LVQOL "reading and fine work" dimension showed item parameter drift. In the multidisciplinary rehabilitation center patients, deterioration was found on the "mobility" dimension after 1 year and improvement was found on "adjustment" and "visual (motor) skills" after 5 months (p < 0.05). Patients in both low-vision services showed improvement on the "reading small print" subscale at both follow-up time points (p < 0.05). CONCLUSIONS Except for improvement in "reading small print," low-vision rehabilitation services did not seem to contribute substantially to any other dimensions of vision-related quality of life. The results showed a change in only a limited number of individual patients. However, with regard to the field of low-vision rehabilitation, the proposed IRT method seemed to be successful in the follow-up of individuals. IRT specific software was unnecessary. The data did not have to be complete and the use of cumulative logits made the proposed IRT method an economical and efficient approach. Because of item parameter drift, the VCM1 was difficult to interpret. The use of multilevel IRT models with longitudinal data and dependent observations is recommended.
Collapse
Affiliation(s)
- Ruth M A van Nispen
- Department of Ophthalmology, Institute for Research in Extramural Medicine, VU University Medical Center, Amsterdam, The Netherlands.
| | | | | | | | | | | |
Collapse
|
69
|
|
70
|
Böckenholt U. Thurstonian-Based Analyses: Past, Present, and Future Utilities. PSYCHOMETRIKA 2006; 71:615-629. [PMID: 20046841 PMCID: PMC2798976 DOI: 10.1007/s11336-006-1598-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2006] [Accepted: 12/22/2006] [Indexed: 05/28/2023]
Abstract
Current psychometric models of choice behavior are strongly influenced by Thurstone's (1927, 1931) experimental and statistical work on measuring and scaling preferences. Aided by advances in computational techniques, choice models can now accommodate a wide range of different data types and sources of preference variability among respondents induced by such diverse factors as person-specific choice sets or different functional forms for the underlying utility representations. At the same time, these models are increasingly challenged by behavioral work demonstrating the prevalence of choice behavior that is not consistent with the underlying assumptions of these models. I discuss new modeling avenues that can account for such seemingly inconsistent choice behavior and conclude by emphasizing the interdisciplinary frontiers in the study of choice behavior and the resulting challenges for psychometricians.
Collapse
|
71
|
|
72
|
Wang WC, Wilson M, Shih CL. Modeling Randomness in Judging Rating Scales with a Random-Effects Rating Scale Model. JOURNAL OF EDUCATIONAL MEASUREMENT 2006. [DOI: 10.1111/j.1745-3984.2006.00020.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
73
|
Tuerlinckx F, Rijmen F, Verbeke G, De Boeck P. Statistical inference in generalized linear mixed models: a review. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2006; 59:225-55. [PMID: 17067411 DOI: 10.1348/000711005x79857] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
We present a review of statistical inference in generalized linear mixed models (GLMMs). GLMMs are an extension of generalized linear models and are suitable for the analysis of non-normal data with a clustered structure. A GLMM contains parameters common to all clusters (fixed regression effects and variance components) and cluster-specific parameters. The latter parameters are assumed to be randomly drawn from a population distribution. The parameters of this population distribution (the variance components) have to be estimated together with the fixed effects. We focus on the case in which the cluster-specific parameters are normally distributed. The cluster-specific effects are integrated out of the likelihood so that the fixed effects and variance components can be estimated. Unfortunately, the integral over the cluster-specific effects is intractable for most GLMMs with a normal mixing distribution. Within a classical statistical framework, we distinguish between two broad classes of methods to handle this intractable integral: methods that rely on a numerical approximation to the integral and methods that use an analytical approximation to the integrand. Finally, we present an overview of available methods for testing hypotheses about the parameters of GLMMs.
Collapse
|
74
|
Fieuws S, Verbeke G, Boen F, Delecluse C. High dimensional multivariate mixed models for binary questionnaire data. J R Stat Soc Ser C Appl Stat 2006. [DOI: 10.1111/j.1467-9876.2006.00546.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
75
|
Wang WC, Yao G, Tsai YJ, Wang JD, Hsieh CL. Validating, Improving Reliability, and Estimating Correlation of the Four Subscales in the WHOQOL-BREF using Multidimensional Rasch Analysis. Qual Life Res 2006; 15:607-20. [PMID: 16688494 DOI: 10.1007/s11136-005-4365-7] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/20/2005] [Indexed: 11/29/2022]
Abstract
OBJECTIVE This study examined the construct validity, and improved the test reliability and the estimation accuracy for the correlation between domains of the WHOQOL-BREF using multidimensional Rasch analysis. METHOD A total of 13,083 adults were administered the 28-item WHOQOL-BREF Taiwan version, which consists of 4 subscales (domains). The multidimensional form of the partial credit model was used to examine the fit of the 4 subscales. For comparison, each subscale individually was also fitted to the unidimensional partial credit model. Standard item fit statistics and analysis of differential item functioning (DIF) were used to check model-data fit. RESULTS After excluding 2 overall items and deleting 7 DIF items, the remaining items of each subscale in the WHOQOL-BREF constituted a single construct. The test reliabilities and correlations between domains obtained from the multidimensional approach, (0.82-0.86) and (0.79-0.89), respectively, were much higher than those obtained from the unidimensional approach, (0.67-0.75) and (0.53-0.65), respectively. CONCLUSION The 19-item WHOQOL-BREF measures more succinct latent traits than the original design. The multidimensional approach yields not only more accurate estimates for the correlation between domains but also substantially higher reliabilities, than the standard unidimensional approach.
Collapse
Affiliation(s)
- Wen-Chung Wang
- Department of Psychology, National Chung Cheng University, Chia-Yi, 621, Taiwan.
| | | | | | | | | |
Collapse
|
76
|
Sheu CF, Chen CT, Su YH, Wang WC. Using SAS PROC NLMIXED to fit item response theory models. Behav Res Methods 2005; 37:202-18. [PMID: 16171193 DOI: 10.3758/bf03192688] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Researchers routinely construct tests or questionnaires containing a set of items that measure personality traits, cognitive abilities, political attitudes, and so forth. Typically, responses to these items are scored in discrete categories, such as points on a Likert scale or a choice out of several mutually exclusive alternatives. Item response theory (IRT) explains observed responses to items on a test (questionnaire) by a person's unobserved trait, ability, or attitude. Although applications of IRT modeling have increased considerably because of its utility in developing and assessing measuring instruments, IRT modeling has not been fully integrated into the curriculum of colleges and universities, mainly because existing general purpose statistical packages do not provide built-in routines with which to perform IRT modeling. Recent advances in statistical theory and the incorporation of those advances into general purpose statistical software such as the Statistical Analysis System (SAS) allow researchers to analyze measurement data by using a class of models known as generalized linear mixed effects models (McCulloch & Searle, 2001), which include IRT models as special cases. The purpose of this article is to demonstrate the generality and flexibility of using SAS to estimate IRT model parameters. With real data examples, we illustrate the implementations of a variety of IRT models for dichotomous, polytomous, and nominal responses. Since SAS is widely available in educational institutions, it is hoped that this article will contribute to the spread of IRT modeling in quantitative courses.
Collapse
Affiliation(s)
- Ching-Fan Sheu
- Department of Psychology, DePaul University, 2219 North Kenmore Ave., Chicago, IL 60614-3522, USA.
| | | | | | | |
Collapse
|
77
|
Ritsner M, Kurs R, Gibel A, Ratner Y, Endicott J. Validity of an abbreviated quality of life enjoyment and satisfaction questionnaire (Q-LES-Q-18) for schizophrenia, schizoaffective, and mood disorder patients. Qual Life Res 2005; 14:1693-703. [PMID: 16119181 DOI: 10.1007/s11136-005-2816-9] [Citation(s) in RCA: 125] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
We sought to identify a core subset of Quality of Life Enjoyment and Satisfaction Questionnaire (Q-LES-Q) items that maintains the validity and psychometric properties of the basic version. A parsimonious subset of items from the Q-LES-Q that can accurately predict the basic Q-LES-Q domain mean scores was sought and evaluated in 339 inpatients meeting DSM-IV criteria for schizophrenia, schizoaffective, and mood disorders. Three additional data sets were used for validation. Assessments included Q-LES-Q, Quality of Life Scale, Lancashire Quality of Life Profile, rating scales for psychopathology, medication side effects, and self-reported emotional distress, self-esteem, self-efficacy, and social support. We found that 18-items predicted basic Q-LES-Q domains (physical health, subjective feelings, leisure time activities, social relationships) and general index scores with high accuracy. Q-LES-Q-18 showed high reliability, validity, and stability of test-retest ratings. Thus, Q-LES-Q-18, a brief, self-administered questionnaire may aid in monitoring quality of life outcomes of schizophrenia, schizoaffective, and mood disorder patients.
Collapse
|
78
|
Abstract
The article uses confirmatory factor analysis (CFA) as a template to explain didactically multilevel structural equation models (ML-SEM) and to demonstrate the equivalence of general mixed-effects models and ML-SEM. An intuitively appealing graphical representation of complex ML-SEMs is introduced that succinctly describes the underlying model and its assumptions. The use of definition variables (i.e., observed variables used to fix model parameters to individual specific data values) is extended to the case of ML-SEMs for clustered data with random slopes. Empirical examples of multilevel CFA and ML-SEM with random slopes are provided along with scripts for fitting such models in SAS Proc Mixed, Mplus, and Mx. Methodological issues regarding estimation of complex ML-SEMs and the evaluation of model fit are discussed. Further potential applications of ML-SEMs are explored.
Collapse
Affiliation(s)
- Paras D Mehta
- Texas Institute for Measurement Evaluation and Statistics, University of Houston
| | - Michael C Neale
- Virginia Institute of Psychiatric and Behavioral Genetics, Virginia Commonwealth University
| |
Collapse
|
79
|
Borkenau P, Egloff B, Eid M, Hennig J, Kersting M, Neubauer AC, Spinath FM. Persönlichkeitspsychologie: Stand und Perspektiven. PSYCHOLOGISCHE RUNDSCHAU 2005. [DOI: 10.1026/0033-3042.56.4.271] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Zusammenfassung. Es wird gezeigt, dass die Persönlichkeitspsychologie eine zur Zeit sehr dynamische sowie interdisziplinär stark vernetzte Teildisziplin der Psychologie ist. Dies geschieht exemplarisch anhand von sechs Forschungsfeldern: (a) Entwicklung statistischer und mathematischer Methoden (z.B. Strukturgleichungsmodelle, Mischverteilungsmodelle), welche an individuellen Unterschieden ansetzen und deren Analyse optimieren; (b) Implizite Assoziationstests als Ergänzung zu Selbstberichten im Rahmen der Persönlichkeitsmessung; (c) Forschungen zu den kognitiven und neuronalen Quellen individueller Unterschiede in der Intelligenz; (d) Forschungen zu biologischen Einflussfaktoren auf Temperamentsmerkmale; (e) quantitative und molekulare Verhaltensgenetik unter besonderer Berücksichtigung der Beziehung zwischen normaler Variation und psychischen Störungen; und (f) Vorhersage von Ausbildungs- und Berufserfolg.
Collapse
|
80
|
Abstract
A mixed-effects item response theory model that allows for three-level multivariate ordinal outcomes and accommodates multiple random subject effects is proposed for analysis of multivariate ordinal outcomes in longitudinal studies. This model allows for the estimation of different item factor loadings (item discrimination parameters) for the multiple outcomes. The covariates in the model do not have to follow the proportional odds assumption and can be at any level. Assuming either a probit or logistic response function, maximum marginal likelihood estimation is proposed utilizing multidimensional Gauss-Hermite quadrature for integration of the random effects. An iterative Fisher scoring solution, which provides standard errors for all model parameters, is used. An analysis of a longitudinal substance use data set, where four items of substance use behavior (cigarette use, alcohol use, marijuana use, and getting drunk or high) are repeatedly measured over time, is used to illustrate application of the proposed model.
Collapse
Affiliation(s)
- Li C Liu
- Institute for Health Research and Policy, University of Illinois at Chicago, 1747 W. Roosevelt Road, Room 558, M/C 275, Chicago, Illinois 60608, USA.
| | | |
Collapse
|
81
|
Ritsner M, Kurs R, Ratner Y, Gibel A. Condensed version of the Quality of Life Scale for schizophrenia for use in outcome studies. Psychiatry Res 2005; 135:65-75. [PMID: 15890413 DOI: 10.1016/j.psychres.2005.01.007] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/13/2004] [Revised: 12/26/2004] [Accepted: 01/15/2005] [Indexed: 10/25/2022]
Abstract
The Quality of Life Scale (QLS(21)) is widely used in clinical trials involving schizophrenia patients. This study aimed to identify a core subset of QLS(21) items that maintains the validity and psychometric properties of the complete version. A parsimonious subset of items from the QLS(21) that can accurately predict the total scale score was sought and evaluated in 133 schizophrenia patients, using the heuristic algorithm for a regression model. Two additional data sets were used for model validation: a subset of 124 patients who participated in the model construction and who completed the QLS(21) 1 year later as well as a new sample of 40 inpatients. Patients were examined with the Positive and Negative Syndrome Scale (PANSS), the Quality of Life Enjoyment and Satisfaction Questionnaire (Q-LES-Q), the Talbieh Brief Distress Inventory (TBDI), the Extrapyramidal Symptom Rating Scale (ESRS) and the Global Assessment of Functioning Scale (GAF). Using only five QLS items (social initiatives, adequacy, acquaintances, motivation, and time utilization; QLS(5)) as predictors, the correlation was 0.9805 between the predicted and true QLS totals. Two validation samples confirmed this finding. Additional analyses indicate that the QLS(5) exhibited similar performance to the QLS(21) regarding construct validity, test-retest reliability and responsiveness to changes over time. Thus, the five-item condensed Quality of Life Scale for schizophrenia maintains the validity of the full QLS, and has the advantage of shorter administration time. Utilization of the revised QLS(5) in routine care and clinical trials may potentially facilitate evaluation of treatment outcomes in schizophrenia.
Collapse
Affiliation(s)
- Michael Ritsner
- Pathological Behavior and Psychobiology Research Laboratory, Sha'ar Menashe Mental Health Center, Hadera, Israel.
| | | | | | | |
Collapse
|
82
|
De Boeck P, Wilson M, Acton GS. A Conceptual and Psychometric Framework for Distinguishing Categories and Dimensions. Psychol Rev 2005; 112:129-58. [PMID: 15631591 DOI: 10.1037/0033-295x.112.1.129] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
An important, sometimes controversial feature of all psychological phenomena is whether they are categorical or dimensional. A conceptual and psychometric framework is described for distinguishing whether the latent structure behind manifest categories (e.g., psychiatric diagnoses, attitude groups, or stages of development) is category-like or dimension-like. Being dimension-like requires (a) within-category heterogeneity and (b) between-category quantitative differences. Being category-like requires (a) within-category homogeneity and (b) between-category qualitative differences. The relation between this classification and abrupt versus smooth differences is discussed. Hybrid structures are possible. Being category-like is itself a matter of degree; the authors offer a formalized framework to determine this degree. Empirical applications to personality disorders, attitudes toward capital punishment, and stages of cognitive development illustrate the approach.
Collapse
Affiliation(s)
- Paul De Boeck
- Department of Psychology, K. U. Leuven, B-3000 Leuven, Belgium.
| | | | | |
Collapse
|
83
|
The personality triad in balance: Multidimensional individual differences in situation–behavior profiles. JOURNAL OF RESEARCH IN PERSONALITY 2004. [DOI: 10.1016/j.jrp.2003.08.001] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
84
|
Smits DJM, De Boeck P, Verhelst ND. Estimation of the MIRID: A program and a SAS-based approach. ACTA ACUST UNITED AC 2003; 35:537-49. [PMID: 14748498 DOI: 10.3758/bf03195533] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The MIRID CML program is a program for the estimation of the parameter values of two different componential IRT models: the Rasch-MIRID and the OPLM-MIRID (Butter, 1994; Butter, De Boeck, & Verhelst, 1998). To estimate the parameters of both models, the program uses a CML approach. The model parameters can also be estimated with a MML approach that can be implemented in PROC NLMIXED of SAS Version 8. Both the MIRID CML program and the MML SAS approach are explained and compared in a simulation study. The results showed that they did about equally well in estimating the values of the item parameters but that there were some differences in the estimation of the person parameters, as could be expected from the differential assumptions regarding the distribution of the persons. The SAS MML approach is much slower than the MIRID CML program, but it is more flexible.
Collapse
Affiliation(s)
- Dirk J M Smits
- Department of Psychology, Katholieke Universiteit Leuven, Leuven, Belgium.
| | | | | |
Collapse
|