1
|
Liu Y, Culpepper SA. Restricted Latent Class Models for Nominal Response Data: Identifiability and Estimation. PSYCHOMETRIKA 2024; 89:592-625. [PMID: 38114767 DOI: 10.1007/s11336-023-09940-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 11/15/2023] [Indexed: 12/21/2023]
Abstract
Restricted latent class models (RLCMs) provide an important framework for diagnosing and classifying respondents on a collection of multivariate binary responses. Recent research made significant advances in theory for establishing identifiability conditions for RLCMs with binary and polytomous response data. Multiclass data, which are unordered nominal response data, are also widely collected in the social sciences and psychometrics via forced-choice inventories and multiple choice tests. We establish new identifiability conditions for parameters of RLCMs for multiclass data and discuss the implications for substantive applications. The new identifiability conditions are applicable to a wealth of RLCMs for polytomous and nominal response data. We propose a Bayesian framework for inferring model parameters, assess parameter recovery in a Monte Carlo simulation study, and present an application of the model to a real dataset.
Collapse
Affiliation(s)
- Ying Liu
- Department of Statistics, University of Illinois at Urbana-Champaign, Computing Applications Building, Room 152, 605 E. Springfield Ave., Champaign, IL, 61820, USA
| | - Steven Andrew Culpepper
- Department of Statistics, University of Illinois at Urbana-Champaign, Computing Applications Building, Room 152, 605 E. Springfield Ave., Champaign, IL, 61820, USA.
| |
Collapse
|
2
|
Lin M, Xu G. Sufficient and Necessary Conditions for the Identifiability of DINA Models with Polytomous Responses. PSYCHOMETRIKA 2024; 89:717-740. [PMID: 38517594 DOI: 10.1007/s11336-024-09961-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Indexed: 03/24/2024]
Abstract
Cognitive diagnosis models (CDMs) provide a powerful statistical and psychometric tool for researchers and practitioners to learn fine-grained diagnostic information about respondents' latent attributes. There has been a growing interest in the use of CDMs for polytomous response data, as more and more items with multiple response options become widely used. Similar to many latent variable models, the identifiability of CDMs is critical for accurate parameter estimation and valid statistical inference. However, the existing identifiability results are primarily focused on binary response models and have not adequately addressed the identifiability of CDMs with polytomous responses. This paper addresses this gap by presenting sufficient and necessary conditions for the identifiability of the widely used DINA model with polytomous responses, with the aim to provide a comprehensive understanding of the identifiability of CDMs with polytomous responses and to inform future research in this field.
Collapse
Affiliation(s)
- Mengqi Lin
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, 48109, MI, USA
| | - Gongjun Xu
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, 48109, MI, USA.
| |
Collapse
|
3
|
Xu X, Fang G, Guo J, Ying Z, Zhang S. Diagnostic Classification Models for Testlets: Methods and Theory. PSYCHOMETRIKA 2024:10.1007/s11336-024-09962-9. [PMID: 38528268 DOI: 10.1007/s11336-024-09962-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Indexed: 03/27/2024]
Abstract
Diagnostic classification models (DCMs) have seen wide applications in educational and psychological measurement, especially in formative assessment. DCMs in the presence of testlets have been studied in recent literature. A key ingredient in the statistical modeling and analysis of testlet-based DCMs is the superposition of two latent structures, the attribute profile and the testlet effect. This paper extends the standard testlet DINA (T-DINA) model to accommodate the potential correlation between the two latent structures. Model identifiability is studied and a set of sufficient conditions are proposed. As a byproduct, the identifiability of the standard T-DINA is also established. The proposed model is applied to a dataset from the 2015 Programme for International Student Assessment. Comparisons are made with DINA and T-DINA, showing that there is substantial improvement in terms of the goodness of fit. Simulations are conducted to assess the performance of the new method under various settings.
Collapse
Affiliation(s)
- Xin Xu
- Beijing Normal University, Beijing, China
| | | | - Jinxin Guo
- Minzu University of China, Beijing, China
| | | | - Susu Zhang
- University of Illinois Urbana-Champaign, Champaign, USA.
| |
Collapse
|
4
|
Gu Y. Going Deep in Diagnostic Modeling: Deep Cognitive Diagnostic Models (DeepCDMs). PSYCHOMETRIKA 2024; 89:118-150. [PMID: 38079062 DOI: 10.1007/s11336-023-09941-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Accepted: 11/15/2023] [Indexed: 05/02/2024]
Abstract
Cognitive diagnostic models (CDMs) are discrete latent variable models popular in educational and psychological measurement. In this work, motivated by the advantages of deep generative modeling and by identifiability considerations, we propose a new family of DeepCDMs, to hunt for deep discrete diagnostic information. The new class of models enjoys nice properties of identifiability, parsimony, and interpretability. Mathematically, DeepCDMs are entirely identifiable, including even fully exploratory settings and allowing to uniquely identify the parameters and discrete loading structures (the " Q -matrices") at all different depths in the generative model. Statistically, DeepCDMs are parsimonious, because they can use a relatively small number of parameters to expressively model data thanks to the depth. Practically, DeepCDMs are interpretable, because the shrinking-ladder-shaped deep architecture can capture cognitive concepts and provide multi-granularity skill diagnoses from coarse to fine grained and from high level to detailed. For identifiability, we establish transparent identifiability conditions for various DeepCDMs. Our conditions impose intuitive constraints on the structures of the multiple Q -matrices and inspire a generative graph with increasingly smaller latent layers when going deeper. For estimation and computation, we focus on the confirmatory setting with known Q -matrices and develop Bayesian formulations and efficient Gibbs sampling algorithms. Simulation studies and an application to the TIMSS 2019 math assessment data demonstrate the usefulness of the proposed methodology.
Collapse
Affiliation(s)
- Yuqi Gu
- Department of Statistics, Columbia University, Room 928 SSW, 1255 Amsterdam Avenue, New York, NY, 10027, USA.
| |
Collapse
|
5
|
Jimenez A, Balamuta JJ, Culpepper SA. A sequential exploratory diagnostic model using a Pólya-gamma data augmentation strategy. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2023; 76:513-538. [PMID: 37786373 DOI: 10.1111/bmsp.12307] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 03/17/2023] [Accepted: 04/11/2023] [Indexed: 10/04/2023]
Abstract
Cognitive diagnostic models provide a framework for classifying individuals into latent proficiency classes, also known as attribute profiles. Recent research has examined the implementation of a Pólya-gamma data augmentation strategy binary response model using logistic item response functions within a Bayesian Gibbs sampling procedure. In this paper, we propose a sequential exploratory diagnostic model for ordinal response data using a logit-link parameterization at the category level and extend the Pólya-gamma data augmentation strategy to ordinal response processes. A Gibbs sampling procedure is presented for efficient Markov chain Monte Carlo (MCMC) estimation methods. We provide results from a Monte Carlo study for model performance and present an application of the model.
Collapse
Affiliation(s)
- Auburn Jimenez
- Department of Psychology, University of Illinois Urbana-Champaign, Champaign, Illinois, USA
| | - James Joseph Balamuta
- Departments of Informatics and Statistics, University of Illinois Urbana-Champaign, Champaign, Illinois, USA
| | - Steven Andrew Culpepper
- Department of Statistics, University of Illinois at Urbana-Champaign, Champaign, Illinois, USA
| |
Collapse
|
6
|
Yigit HD, Culpepper SA. Extending exploratory diagnostic classification models: Inferring the effect of covariates. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2023; 76:372-401. [PMID: 36601975 DOI: 10.1111/bmsp.12298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Accepted: 11/24/2022] [Indexed: 06/17/2023]
Abstract
Diagnostic models provide a statistical framework for designing formative assessments by classifying student knowledge profiles according to a collection of fine-grained attributes. The context and ecosystem in which students learn may play an important role in skill mastery, and it is therefore important to develop methods for incorporating student covariates into diagnostic models. Including covariates may provide researchers and practitioners with the ability to evaluate novel interventions or understand the role of background knowledge in attribute mastery. Existing research is designed to include covariates in confirmatory diagnostic models, which are also known as restricted latent class models. We propose new methods for including covariates in exploratory RLCMs that jointly infer the latent structure and evaluate the role of covariates on performance and skill mastery. We present a novel Bayesian formulation and report a Markov chain Monte Carlo algorithm using a Metropolis-within-Gibbs algorithm for approximating the model parameter posterior distribution. We report Monte Carlo simulation evidence regarding the accuracy of our new methods and present results from an application that examines the role of student background knowledge on the mastery of a probability data set.
Collapse
|
7
|
Gu Y. Generic Identifiability of the DINA Model and Blessing of Latent Dependence. PSYCHOMETRIKA 2023; 88:117-131. [PMID: 36167947 DOI: 10.1007/s11336-022-09886-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Revised: 04/20/2022] [Indexed: 06/16/2023]
Abstract
Cognitive diagnostic models are a powerful family of fine-grained discrete latent variable models in psychometrics. Within this family, the DINA model is a fundamental and parsimonious one that has received significant attention. Similar to other complex latent variable models, identifiability is an important issue for CDMs, including the DINA model. Gu and Xu (Psychometrika 84(2):468-483, 2019) established the necessary and sufficient conditions for strict identifiability of the DINA model. Despite being the strongest possible notion of identifiability, strict identifiability may impose overly stringent requirements on designing the cognitive diagnostic tests. This work studies a slightly weaker yet very useful notion, generic identifiability, which means parameters are identifiable almost everywhere in the parameter space, excluding only a negligible subset of measure zero. We propose transparent generic identifiability conditions for the DINA model, relaxing existing conditions in nontrivial ways. Under generic identifiability, we also explicitly characterize the forms of the measure-zero sets where identifiability breaks down. In addition, we reveal an interesting blessing-of-latent-dependence phenomenon under DINA-that is, dependence between the latent attributes can restore identifiability under some otherwise unidentifiable [Formula: see text]-matrix designs. The blessing of latent dependence provides useful practical implications and reassurance for real-world designs of cognitive diagnostic assessments.
Collapse
Affiliation(s)
- Yuqi Gu
- Department of Statistics, Columbia University, Room 928 SSW, 1255 Amsterdam Avenue, New York, NY, 10027, USA.
| |
Collapse
|
8
|
Culpepper SA, Balamuta JJ. Inferring Latent Structure in Polytomous Data with a Higher-Order Diagnostic Model. MULTIVARIATE BEHAVIORAL RESEARCH 2023; 58:368-386. [PMID: 34699299 DOI: 10.1080/00273171.2021.1985949] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Researchers continue to develop and advance models for diagnostic research in the social and behavioral sciences. These diagnostic models (DMs) provide researchers with a framework for providing a fine-grained classification of respondents into substantively meaningful latent classes as defined by a multivariate collection of binary attributes. A central concern for DMs is advancing exploratory methods for uncovering the latent structure, which corresponds with the relationship between unobserved binary attributes and observed polytomous items with two or more response options. Multivariate behavioral polytomous data are often collected within a higher-order design where general factors underlying first-order latent variables. This study advances existing exploratory DMs for polytomous data by proposing a new method for inferring the latent structure underlying polytomous response data using a higher-order model to describe dependence among the discrete latent attributes. We report a novel Bayesian formulation that uses variable selection techniques for inferring the latent structure along with a higher-order factor model for attributes. We report evidence of accurate parameter recovery in a Monte Carlo simulation study and present results from an application to the 2012 Programme for International Student Assessment (PISA) problem-solving vignettes to demonstrate the method.
Collapse
Affiliation(s)
| | - James J Balamuta
- Departments of Informatics and Statistics, University of Illinois at Urbana-Champaign
| |
Collapse
|
9
|
Culpepper SA. A Note on Weaker Conditions for Identifying Restricted Latent Class Models for Binary Responses. PSYCHOMETRIKA 2023; 88:158-174. [PMID: 35896935 DOI: 10.1007/s11336-022-09875-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Revised: 06/10/2022] [Indexed: 06/15/2023]
Abstract
Restricted latent class models (RLCMs) are an important class of methods that provide researchers and practitioners in the educational, psychological, and behavioral sciences with fine-grained diagnostic information to guide interventions. Recent research established sufficient conditions for identifying RLCM parameters. A current challenge that limits widespread application of RLCMs is that existing identifiability conditions may be too restrictive for some practical settings. In this paper we establish a weaker condition for identifying RLCM parameters for multivariate binary data. Although the new results weaken identifiability conditions for general RLCMs, the new results do not relax existing necessary and sufficient conditions for the simpler DINA/DINO models. Theoretically, we introduce a new form of latent structure completeness, referred to as dyad-completeness, and prove identification by applying Kruskal's Theorem for the uniqueness of three-way arrays. The new condition is more likely satisfied in applied research, and the results provide researchers and test-developers with guidance for designing diagnostic instruments.
Collapse
Affiliation(s)
- Steven Andrew Culpepper
- Department of Statistics, University of Illinois at Urbana-Champaign, 605 E Springfield Ave, Champaign, IL61820, USA.
| |
Collapse
|
10
|
Ma C, Ouyang J, Xu G. Learning Latent and Hierarchical Structures in Cognitive Diagnosis Models. PSYCHOMETRIKA 2023; 87:1343-1360. [PMID: 35596101 DOI: 10.1007/s11336-022-09852-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 10/16/2021] [Accepted: 04/07/2022] [Indexed: 05/28/2023]
Abstract
Cognitive Diagnosis Models (CDMs) are a special family of discrete latent variable models that are widely used in educational and psychological measurement. A key component of CDMs is the Q-matrix characterizing the dependence structure between the items and the latent attributes. Additionally, researchers also assume in many applications certain hierarchical structures among the latent attributes to characterize their dependence. In most CDM applications, the attribute-attribute hierarchical structures, the item-attribute Q-matrix, the item-level diagnostic models, as well as the number of latent attributes, need to be fully or partially pre-specified, which however may be subjective and misspecified as noted by many recent studies. This paper considers the problem of jointly learning these latent and hierarchical structures in CDMs from observed data with minimal model assumptions. Specifically, a penalized likelihood approach is proposed to select the number of attributes and estimate the latent and hierarchical structures simultaneously. An expectation-maximization (EM) algorithm is developed for efficient computation, and statistical consistency theory is also established under mild conditions. The good performance of the proposed method is illustrated by simulation studies and real data applications in educational assessment.
Collapse
Affiliation(s)
- Chenchen Ma
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, MI, 48109, USA
| | - Jing Ouyang
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, MI, 48109, USA
| | - Gongjun Xu
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, MI, 48109, USA.
| |
Collapse
|
11
|
Oka M, Okada K. Scalable Bayesian Approach for the Dina Q-Matrix Estimation Combining Stochastic Optimization and Variational Inference. PSYCHOMETRIKA 2023; 88:302-331. [PMID: 36097246 DOI: 10.1007/s11336-022-09884-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 05/20/2022] [Accepted: 08/03/2022] [Indexed: 06/15/2023]
Abstract
Diagnostic classification models offer statistical tools to inspect the fined-grained attribute of respondents' strengths and weaknesses. However, the diagnosis accuracy deteriorates when misspecification occurs in the predefined item-attribute relationship, which is encoded into a Q-matrix. To prevent such misspecification, methodologists have recently developed several Bayesian Q-matrix estimation methods for greater estimation flexibility. However, these methods become infeasible in the case of large-scale assessments with a large number of attributes and items. In this study, we focused on the deterministic inputs, noisy "and" gate (DINA) model and proposed a new framework for the Q-matrix estimation to find the Q-matrix with the maximum marginal likelihood. Based on this framework, we developed a scalable estimation algorithm for the DINA Q-matrix by constructing an iteration algorithm that utilizes stochastic optimization and variational inference. The simulation and empirical studies reveal that the proposed method achieves high-speed computation, good accuracy, and robustness to potential misspecifications, such as initial value choices and hyperparameter settings. Thus, the proposed method can be a useful tool for estimating a Q-matrix in large-scale settings.
Collapse
Affiliation(s)
- Motonori Oka
- Graduate School of Education, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan.
| | - Kensuke Okada
- Graduate School of Education, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan
| |
Collapse
|
12
|
Yamaguchi K, Templin J. Direct Estimation of Diagnostic Classification Model Attribute Mastery Profiles via a Collapsed Gibbs Sampling Algorithm. PSYCHOMETRIKA 2022; 87:1390-1421. [PMID: 35426059 DOI: 10.1007/s11336-022-09857-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Revised: 11/05/2021] [Indexed: 06/14/2023]
Abstract
This paper proposes a novel collapsed Gibbs sampling algorithm that marginalizes model parameters and directly samples latent attribute mastery patterns in diagnostic classification models. This estimation method makes it possible to avoid boundary problems in the estimation of model item parameters by eliminating the need to estimate such parameters. A simulation study showed the collapsed Gibbs sampling algorithm can accurately recover the true attribute mastery status in various conditions. A second simulation showed the collapsed Gibbs sampling algorithm was computationally more efficient than another MCMC sampling algorithm, implemented by JAGS. In an analysis of real data, the collapsed Gibbs sampling algorithm indicated good classification agreement with results from a previous study.
Collapse
Affiliation(s)
- Kazuhiro Yamaguchi
- , Iowa City, USA.
- Division of Psychology, Faculty of Human Sciences, University of Tsukuba, Institutes of Human Sciences A314, 1-1-1 Tennodai, Tsukuba, Ibaraki, 3050006, Japan.
| | | |
Collapse
|
13
|
Wang J, Shi N, Zhang X, Xu G. Sequential Gibbs Sampling Algorithm for Cognitive Diagnosis Models with Many Attributes. MULTIVARIATE BEHAVIORAL RESEARCH 2022; 57:840-858. [PMID: 33755507 DOI: 10.1080/00273171.2021.1896352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Cognitive diagnosis models (CDMs) are useful statistical tools to provide rich information relevant for intervention and learning. As a popular approach to estimate and make inference of CDMs, the Markov chain Monte Carlo (MCMC) algorithm is widely used in practice. However, when the number of attributes, K, is large, the existing MCMC algorithm may become time-consuming, due to the fact that O(2K) calculations are usually needed in the process of MCMC sampling to get the conditional distribution for each attribute profile. To overcome this computational issue, motivated by Culpepper and Hudson's earlier work in 2018, we propose a computationally efficient sequential Gibbs sampling method, which needs O(K) calculations to sample each attribute profile. We use simulation and real data examples to show the good finite-sample performance of the proposed sequential Gibbs sampling, and its advantage over existing methods.
Collapse
Affiliation(s)
- Juntao Wang
- Department of Statistics, KLAS, School of Mathematics and Statistics, Northeast Normal University, Changchun, China
| | - Ningzhong Shi
- Department of Statistics, KLAS, School of Mathematics and Statistics, Northeast Normal University, Changchun, China
| | - Xue Zhang
- China Institute of Rural Education Development, Northeast Normal University, Changchun, China
| | - Gongjun Xu
- Department of Statistics, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
14
|
Balamuta JJ, Culpepper SA. Exploratory Restricted Latent Class Models with Monotonicity Requirements under PÒLYA-GAMMA Data Augmentation. PSYCHOMETRIKA 2022; 87:903-945. [PMID: 35023017 DOI: 10.1007/s11336-021-09815-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 09/17/2021] [Indexed: 06/14/2023]
Abstract
Restricted latent class models (RLCMs) provide an important framework for supporting diagnostic research in education and psychology. Recent research proposed fully exploratory methods for inferring the latent structure. However, prior research is limited by the use of restrictive monotonicity condition or prior formulations that are unable to incorporate prior information about the latent structure to validate expert knowledge. We develop new methods that relax existing monotonicity restrictions and provide greater insight about the latent structure. Furthermore, existing Bayesian methods only use a probit link function and we provide a new formulation for using the exploratory RLCM with a logit link function that has an additional advantage of being computationally more efficient for larger sample sizes. We present four new Bayesian formulations that employ different link functions (i.e., the logit using the Pòlya-gamma data augmentation versus the probit) and priors for inducing sparsity in the latent structure. We report Monte Carlo simulation studies to demonstrate accurate parameter recovery. Furthermore, we report results from an application to the Last Series of the Standard Progressive Matrices to illustrate our new methods.
Collapse
Affiliation(s)
- James Joseph Balamuta
- Departments of Informatics and Statistics, University of Illinois Urbana-Champaign, 725 South Wright Street, Champaign, IL, 61820, USA.
| | | |
Collapse
|
15
|
Zhou S, Traynor A. Measuring students’ learning progressions in energy using cognitive diagnostic models. Front Psychol 2022; 13:892884. [PMID: 36017436 PMCID: PMC9396370 DOI: 10.3389/fpsyg.2022.892884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 06/29/2022] [Indexed: 11/13/2022] Open
Abstract
This study applied cognitive diagnostic models to assess students’ learning progressions in energy. A Q-matrix (i.e., an item attribute alignment table) was proposed based on existing literature about learning progressions of energy in the physical science domain and the Trends in International Mathematics and Science Study (TIMSS) assessment framework. The Q-matrix was validated by expert review and real data analysis. Then, the deterministic inputs, noisy ‘and’ gate (DINA) model with hierarchical relations was applied to data from three jurisdictions that had stable, defined science curricula (i.e., Australia, Hong Kong, and Ontario). The results suggested that the hypothesized learning progression was consistent with the observed progression in understanding the energy concept. We also found similarities in students’ attribute mastery across the three jurisdictions. In addition, we examined the instructional sensitivity of the selected item. We discuss several curriculum-related issues and student misconceptions that may affect students’ learning progressions and mastery patterns in different regions of the world.
Collapse
Affiliation(s)
- Shuqi Zhou
- College of Foreign Languages, Donghua University, Shanghai, China
- *Correspondence: Shuqi Zhou, ;
| | - Anne Traynor
- Department of Educational Studies, Purdue University, West Lafayette, IN, United States
| |
Collapse
|
16
|
A multiple logistic regression-based (MLR-B) Q-matrix validation method for cognitive diagnosis models:A confirmatory approach. Behav Res Methods 2022:10.3758/s13428-022-01880-x. [PMID: 35819718 DOI: 10.3758/s13428-022-01880-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/17/2022] [Indexed: 11/08/2022]
Abstract
Q-matrix is an essential component specifying the relationship between attributes and items, which plays a key role in cognitive diagnosis assessment. The Q-matrix is usually developed by domain experts and its specifications tend to be subjective and might have misspecifications. Many existing pieces of research concentrate on the validation of Q-matrix; however, few of them can be applied to saturated cognitive diagnosis models. This paper proposes a general and effective Q-matrix validation method by employing multiple logistic regression model. Simulation studies are carried out to investigate the performance of the proposed method and compare it with four existing methods. Simulation results indicate the proposed method outperforms the existing methods in terms of validation accuracy. In addition, a set of real data is used as an example to illustrate its application. Finally, we discuss the limitations of the current study and the directions of future studies.
Collapse
|
17
|
Ma W. A Higher-Order Cognitive Diagnosis Model with Ordinal Attributes for Dichotomous Response Data. MULTIVARIATE BEHAVIORAL RESEARCH 2022; 57:408-421. [PMID: 33434081 DOI: 10.1080/00273171.2020.1860731] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Most existing cognitive diagnosis models (CDMs) assume attributes are binary latent variables, which may be oversimplified in practice. This article introduces a higher-order CDM with ordinal attributes for dichotomous response data. The proposed model can either incorporate domain experts' knowledge or learn from the data empirically by regularizing model parameters. A sequential item response model was employed for joint attribute distribution to accommodate the sequential mastery mechanism. The expectation-maximization algorithm was employed for model estimation, and a simulation study was conducted to assess the recovery of model parameters. A set of real data was also analyzed to assess the viability of the proposed model in practice.
Collapse
|
18
|
Chen Y, Liu Y, Culpepper SA, Chen Y. Inferring the Number of Attributes for the Exploratory DINA Model. PSYCHOMETRIKA 2021; 86:30-64. [PMID: 33751367 DOI: 10.1007/s11336-021-09750-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 02/15/2021] [Accepted: 02/19/2021] [Indexed: 05/28/2023]
Abstract
Diagnostic classification models (DCMs) are widely used for providing fine-grained classification of a multidimensional collection of discrete attributes. The application of DCMs requires the specification of the latent structure in what is known as the [Formula: see text] matrix. Expert-specified [Formula: see text] matrices might be biased and result in incorrect diagnostic classifications, so a critical issue is developing methods to estimate [Formula: see text] in order to infer the relationship between latent attributes and items. Existing exploratory methods for estimating [Formula: see text] must pre-specify the number of attributes, K. We present a Bayesian framework to jointly infer the number of attributes K and the elements of [Formula: see text]. We propose the crimp sampling algorithm to transit between different dimensions of K and estimate the underlying [Formula: see text] and model parameters while enforcing model identifiability constraints. We also adapt the Indian buffet process and reversible-jump Markov chain Monte Carlo methods to estimate [Formula: see text]. We report evidence that the crimp sampler performs the best among the three methods. We apply the developed methodology to two data sets and discuss the implications of the findings for future research.
Collapse
Affiliation(s)
- Yinghan Chen
- Department of Mathematics and Statistics, University of Nevada, Reno, 1664 N. Virginia Street, Reno, NV, 89557, USA
| | - Ying Liu
- Department of Statistics, University of Illinois at Urbana-Champaign, 725 South Wright Street, Champaign, IL , 61820, USA
| | - Steven Andrew Culpepper
- Department of Statistics, University of Illinois at Urbana-Champaign, 725 South Wright Street, Champaign, IL , 61820, USA.
| | - Yuguo Chen
- Department of Statistics, University of Illinois at Urbana-Champaign, 725 South Wright Street, Champaign, IL , 61820, USA
| |
Collapse
|
19
|
Zhang X, Wang J. On the Sequential Hierarchical Cognitive Diagnostic Model. Front Psychol 2020; 11:579018. [PMID: 33117245 PMCID: PMC7577049 DOI: 10.3389/fpsyg.2020.579018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 09/04/2020] [Indexed: 01/30/2023] Open
Abstract
Model data fit plays an important role in any statistical analysis, and the primary goal is to detect the preferred model based on certain criteria. Under the cognitive diagnostic assessment (CDA) framework, a family of sequential cognitive diagnostic models (CDMs) is introduced to handle polytomously scored data, which are attained by answering constructed-response items sequentially. The presence of attribute hierarchies, which can provide useful information about the nature of attributes, will help understand the relation between attributes and response categories. This article introduces the sequential hierarchical CDM (SH-CDM), which adapts the sequential CDM to deal with attribute hierarchy. Furthermore, model fit analysis for SH-CDMs is assessed using eight model fit indices (i.e., three absolute fit indices and five relative fit indices). Two misfit sources were focused; that is, misspecifying attribute structures and misfitting processing functions. The performances of those indices were evaluated via Monte Carlo simulation studies and a real data illustration.
Collapse
Affiliation(s)
- Xue Zhang
- China Institute of Rural Education Development, Northeast Normal University, Changchun, China
| | - Juntao Wang
- School of Mathematics and Statistics, Northeast Normal University, Changchun, China
| |
Collapse
|
20
|
Lin Q, Xing K, Park YS. Measuring Skill Growth and Evaluating Change: Unconditional and Conditional Approaches to Latent Growth Cognitive Diagnostic Models. Front Psychol 2020; 11:2205. [PMID: 33041889 PMCID: PMC7517785 DOI: 10.3389/fpsyg.2020.02205] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 08/05/2020] [Indexed: 01/23/2023] Open
Abstract
During the past decade, cognitive diagnostic models (CDMs) have become prevalent in providing diagnostic information for learning. Cognitive diagnostic models have generally focused on single cross-sectional time points. However, longitudinal assessments have been commonly used in education to assess students’ learning progress as well as evaluating intervention effects. Thus, it becomes natural to identify longitudinal growth in skills profiles mastery, which can yield meaningful inferences on learning. This study proposes longitudinal CDMs that incorporate latent growth curve modeling and covariate extensions, with the aim to measure the growth of skills mastery and to evaluate attribute-level intervention effects over time. Using real-world data, this study demonstrates applications of unconditional and conditional latent growth CDMs. Simulation studies show stable parameter recovery and classification of latent classes for different sample sizes. These findings suggest that building on the well-established growth modeling frameworks, applications of covariate-based longitudinal CDM can help understand the effect of explanatory factors and intervention on the change of attribute mastery.
Collapse
Affiliation(s)
- Qiao Lin
- Department of Educational Psychology, University of Illinois at Chicago, Chicago, IL, United States
| | - Kuan Xing
- University of Tennessee Health Science Center, Memphis, TN, United States
| | - Yoon Soo Park
- Department of Medical Education, University of Illinois at Chicago, Chicago, IL, United States
| |
Collapse
|
21
|
Liu CW, Andersson B, Skrondal A. A Constrained Metropolis-Hastings Robbins-Monro Algorithm for Q Matrix Estimation in DINA Models. PSYCHOMETRIKA 2020; 85:322-357. [PMID: 32632838 DOI: 10.1007/s11336-020-09707-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Revised: 05/07/2020] [Indexed: 06/11/2023]
Abstract
In diagnostic classification models (DCMs), the Q matrix encodes in which attributes are required for each item. The Q matrix is usually predetermined by the researcher but may in practice be misspecified which yields incorrect statistical inference. Instead of using a predetermined Q matrix, it is possible to estimate it simultaneously with the item and structural parameters of the DCM. Unfortunately, current methods are computationally intensive when there are many attributes and items. In addition, the identification constraints necessary for DCMs are not always enforced in the estimation algorithms which can lead to non-identified models being considered. We address these problems by simultaneously estimating the item, structural and Q matrix parameters of the Deterministic Input Noisy "And" gate model using a constrained Metropolis-Hastings Robbins-Monro algorithm. Simulations show that the new method is computationally efficient and can outperform previously proposed Bayesian Markov chain Monte-Carlo algorithms in terms of Q matrix recovery, and item and structural parameter estimation. We also illustrate our approach using Tatsuoka's fraction-subtraction data and Certificate of Proficiency in English data.
Collapse
Affiliation(s)
- Chen-Wei Liu
- Department of Educational Psychology and Counseling, National Taiwan Normal University, 162, Section 1, Heping E. Road, 10610, Taipei, Taiwan.
| | - Björn Andersson
- Centre for Educational Measurement, University of Oslo, Oslo, Norway
| | - Anders Skrondal
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Centre for Educational Measurement, University of Oslo, Oslo, Norway
- Graduate School of Education, University of California, Berkeley, Berkeley, CA, USA
| |
Collapse
|