Incremental Model Fit Assessment in the Case of Categorical Data: Tucker-Lewis Index for Item Response Theory Modeling

被引:37
作者
Cai, Li [1 ]
Chung, Seung Won [2 ]
Lee, Taehun [3 ]
机构
[1] Univ Calif Los Angeles, UCLA CRESST, 315 GSEIS Bldg, Los Angeles, CA 90095 USA
[2] Univ Minnesota Twin Cities, Minneapolis, MN USA
[3] Chung Ang Univ, Seoul, South Korea
关键词
Categorical data analysis; Model evaluation; Item response theory; Goodness of fit; Limited-information testing; TLI; GOODNESS-OF-FIT; LIMITED-INFORMATION;
D O I
10.1007/s11121-021-01253-4
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
The Tucker-Lewis index (TLI; Tucker & Lewis, 1973), also known as the non-normed fit index (NNFI; Bentler & Bonett, 1980), is one of the numerous incremental fit indices widely used in linear mean and covariance structure modeling, particularly in exploratory factor analysis, tools popular in prevention research. It augments information provided by other indices such as the root-mean-square error of approximation (RMSEA). In this paper, we develop and examine an analogous index for categorical item level data modeled with item response theory (IRT). The proposed Tucker-Lewis index for IRT (TLIRT) is based on Maydeu-Olivares and Joe's (2005) M-2 family of limited-information overall model fit statistics. The limited-information fit statistics have significantly better Chi-square approximation and power than traditional full-information Pearson or likelihood ratio statistics under realistic situations. Building on the incremental fit assessment principle, the TLIRT compares the fit of model under consideration along a spectrum of worst to best possible model fit scenarios. We examine the performance of the new index using simulated and empirical data. Results from a simulation study suggest that the new index behaves as theoretically expected, and it can offer additional insights about model fit not available from other sources. In addition, a more stringent cutoff value is perhaps needed than Hu and Bentler's (1999) traditional cutoff criterion with continuous variables. In the empirical data analysis, we use a data set from a measurement development project in support of cigarette smoking cessation research to illustrate the usefulness of the TLIRT. We noticed that had we only utilized the RMSEA index, we could have arrived at qualitatively different conclusions about model fit, depending on the choice of test statistics, an issue to which the TLIRT is relatively more immune.
引用
收藏
页码:455 / 466
页数:12
相关论文
共 40 条
[1]  
Asparouhov T., 2010, Simple second order chi-square correction
[2]   A goodness of fit test for sparse 2p contingency tables [J].
Bartholomew, DJ ;
Leung, SO .
BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2002, 55 :1-15
[3]   The goodness of fit of latent trait models in attitude measurement [J].
Bartholomew, DJ ;
Tzamourani, P .
SOCIOLOGICAL METHODS & RESEARCH, 1999, 27 (04) :525-546
[4]  
BENTLER PM, 1990, PSYCHOL BULL, V107, P238, DOI 10.1037/0033-2909.107.2.238
[5]   MARGINAL MAXIMUM-LIKELIHOOD ESTIMATION OF ITEM PARAMETERS - APPLICATION OF AN EM ALGORITHM [J].
BOCK, RD ;
AITKIN, M .
PSYCHOMETRIKA, 1981, 46 (04) :443-459
[7]  
Browne M.W., 1993, SOCIOL METHOD RES, P445
[9]  
Cai L, 2015, FLEXMIRT VERSION 3 0
[10]   Limited-information goodness-of-fit testing of item response theory models for sparse 2P tables [J].
Cai, Li ;
Maydeu-Olivares, Albert ;
Coffman, Donna L. ;
Thissen, David .
BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2006, 59 :173-194