New Paradigm of Identifiable General-response Cognitive Diagnostic Models: Beyond Categorical Data

被引:1
作者
Lee, Seunghyun [1 ]
Gu, Yuqi [1 ]
机构
[1] Columbia Univ, New York, NY USA
关键词
cognitive diagnostic model; diagnostic classification model; EM algorithm; exponential family; general responses; generalized linear model; identifiability; Q-matrix; TIMES; FRAMEWORK;
D O I
10.1007/s11336-024-09983-4
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Cognitive diagnostic models (CDMs) are a popular family of discrete latent variable models that model students' mastery or deficiency of multiple fine-grained skills. CDMs have been most widely used to model categorical item response data such as binary or polytomous responses. With advances in technology and the emergence of varying test formats in modern educational assessments, new response types, including continuous responses such as response times, and count-valued responses from tests with repetitive tasks or eye-tracking sensors, have also become available. Variants of CDMs have been proposed recently for modeling such responses. However, whether these extended CDMs are identifiable and estimable is entirely unknown. We propose a very general cognitive diagnostic modeling framework for arbitrary types of multivariate responses with minimal assumptions, and establish identifiability in this general setting. Surprisingly, we prove that our general-response CDMs are identifiable under Q\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\textbf{Q}}$$\end{document}-matrix-based conditions similar to those for traditional categorical-response CDMs. Our conclusions set up a new paradigm of identifiable general-response CDMs. We propose an EM algorithm to efficiently estimate a broad class of exponential family-based general-response CDMs. We conduct simulation studies under various response types. The simulation results not only corroborate our identifiability theory, but also demonstrate the superior empirical performance of our estimation algorithms. We illustrate our methodology by applying it to a TIMSS 2019 response time dataset.
引用
收藏
页码:1304 / 1336
页数:33
相关论文
共 62 条
  • [1] IDENTIFIABILITY OF PARAMETERS IN LATENT STRUCTURE MODELS WITH MANY OBSERVED VARIABLES
    Allman, Elizabeth S.
    Matias, Catherine
    Rhode, John A.
    [J]. ANNALS OF STATISTICS, 2009, 37 (6A) : 3099 - 3132
  • [2] Casella G., 2021, Statistical Inference
  • [3] Bayesian Estimation of the DINA Q matrix
    Chen, Yinghan
    Culpepper, Steven Andrew
    Chen, Yuguo
    Douglas, Jeffrey
    [J]. PSYCHOMETRIKA, 2018, 83 (01) : 89 - 108
  • [4] A Sparse Latent Class Model for Cognitive Diagnosis
    Chen, Yinyin
    Culpepper, Steven
    Liang, Feng
    [J]. PSYCHOMETRIKA, 2020, 85 (01) : 121 - 153
  • [5] Statistical Analysis of Q-Matrix Based Diagnostic Classification Models
    Chen, Yunxiao
    Liu, Jingchen
    Xu, Gongjun
    Ying, Zhiliang
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (510) : 850 - 866
  • [6] A Note on Weaker Conditions for Identifying Restricted Latent Class Models for Binary Responses
    Culpepper, Steven Andrew
    [J]. PSYCHOMETRIKA, 2023, 88 (01) : 158 - 174
  • [7] An Exploratory Diagnostic Model for Ordinal Responses with Binary Attributes: Identifiability and Estimation
    Culpepper, Steven Andrew
    [J]. PSYCHOMETRIKA, 2019, 84 (04) : 921 - 940
  • [8] An Overview of Models for Response Times and Processes in Cognitive Tests
    De Boeck, Paul
    Jeon, Minjeong
    [J]. FRONTIERS IN PSYCHOLOGY, 2019, 10
  • [9] Higher-order latent trait models for cognitive diagnosis
    De la Torre, J
    Douglas, JA
    [J]. PSYCHOMETRIKA, 2004, 69 (03) : 333 - 353
  • [10] The Generalized DINA Model Framework
    de la Torre, Jimmy
    [J]. PSYCHOMETRIKA, 2011, 76 (02) : 179 - 199