Power Analysis for the Wald, LR, Score, and Gradient Tests in a Marginal Maximum Likelihood Framework: Applications in IRT

被引:17
作者
Zimmer, Felix [1 ]
Draxler, Clemens [2 ]
Debelak, Rudolf [1 ]
机构
[1] Univ Zurich, Zurich, Switzerland
[2] Hlth & Life Sci Univ, Hall In Tirol, Austria
关键词
marginal maximum likelihood; item response theory; power analysis; Wald test; score test; likelihood ratio; gradient test; ITEM RESPONSE THEORY; SAMPLE-SIZE DETERMINATION; CLINICAL-TRIALS; FIT STATISTICS; MODELS; INFORMATION; PARAMETERS; REGRESSION; MISFIT; RATIO;
D O I
10.1007/s11336-022-09883-5
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The Wald, likelihood ratio, score, and the recently proposed gradient statistics can be used to assess a broad range of hypotheses in item response theory models, for instance, to check the overall model fit or to detect differential item functioning. We introduce new methods for power analysis and sample size planning that can be applied when marginal maximum likelihood estimation is used. This allows the application to a variety of IRT models, which are commonly used in practice, e.g., in large-scale educational assessments. An analytical method utilizes the asymptotic distributions of the statistics under alternative hypotheses. We also provide a sampling-based approach for applications where the analytical approach is computationally infeasible. This can be the case with 20 or more items, since the computational load increases exponentially with the number of items. We performed extensive simulation studies in three practically relevant settings, i.e., testing a Rasch model against a 2PL model, testing for differential item functioning, and testing a partial credit model against a generalized partial credit model. The observed distributions of the test statistics and the power of the tests agreed well with the predictions by the proposed methods in sufficiently large samples. We provide an openly accessible R package that implements the methods for user-supplied hypotheses.
引用
收藏
页码:1249 / 1298
页数:50
相关论文
共 88 条
[31]  
Glas CAW, 2016, CH CRC STAT SOC BEHA, P343
[32]   Analysis of longitudinal randomized clinical trials using item response models [J].
Glas, Cees A. W. ;
Geerlings, Hanneke ;
van de Laar, Mart A. F. J. ;
Taal, Erik .
CONTEMPORARY CLINICAL TRIALS, 2009, 30 (02) :158-170
[33]   Use of the Lagrange Multiplier Test for Assessing Measurement Invariance Under Model Misspecification [J].
Guastadisegni, Lucia ;
Cagnone, Silvia ;
Moustaki, Irini ;
Vasdekis, Vassilis .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2022, 82 (02) :254-280
[34]   Statistical power of likelihood ratio and Wald tests in latent class models with covariates [J].
Gudicha, Dereje W. ;
Schmittmann, Verena D. ;
Vermunt, Jeroen K. .
BEHAVIOR RESEARCH METHODS, 2017, 49 (05) :1824-1837
[35]   Generalized Residuals for General Models for Contingency Tables With Application to Item Response Theory [J].
Haberman, Shelby J. ;
Sinharay, Sandip .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (504) :1435-1444
[36]  
Haberman SJ., 2006, RR0614 ED TEST SERV, V2006, P1, DOI [10.1002/j.2333-8504.2006.tb02020.x, DOI 10.1002/J.2333-8504.2006.TB02020.X]
[37]   Towards power and sample size calculations for the comparison of two groups of patients with item response theory models [J].
Hardouin, Jean-Benoit ;
Amri, Sarah ;
Feddag, Mohand-Larbi ;
Sebille, Veronique .
STATISTICS IN MEDICINE, 2012, 31 (11-12) :1277-1290
[38]  
Holland PW, 2012, DIFFERENTIAL ITEM FU, DOI DOI 10.4324/9780203357811
[39]   Power analysis in randomized clinical trials based on item response theory [J].
Holman, R ;
Glas, CAW ;
de Haan, RJ .
CONTROLLED CLINICAL TRIALS, 2003, 24 (04) :390-410
[40]   Estimating power for clinical trials with Patient Reported Outcomes-using Item Response Theory [J].
Hu, Jinxiang ;
Thompson, Jeffrey ;
Mudaranthakam, Dinesh Pal ;
Hinton, Lynn Chollet ;
Streeter, David ;
Park, Michele ;
Terluin, Berend ;
Gajewski, Byron .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2022, 141 :141-148