The estimation of the IRT reliability coefficient and its lower and upper bounds, with comparisons to CTT reliability statistics

被引:38
作者
Kim, Seonghoon [1 ]
Feldt, Leonard S. [2 ]
机构
[1] Keimyung Univ, Dept Educ, Taegu 704701, South Korea
[2] Univ Iowa, Iowa City, IA USA
关键词
Test reliability; Item response theory (IRT); Lower and upper bounds of reliability coefficient; Test score metric versus ability score metric; MARGINAL MAXIMUM-LIKELIHOOD; ITEM PARAMETER-ESTIMATION; RESPONSE THEORY; SCORES; MODEL;
D O I
10.1007/s12564-009-9062-8
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The primary purpose of this study is to investigate the mathematical characteristics of the test reliability coefficient rho(XX)' as a function of item response theory (IRT) parameters and present the lower and upper bounds of the coefficient. Another purpose is to examine relative performances of the IRT reliability statistics and two classical test theory (CTT) reliability statistics (Cronbach's alpha and Feldt-Gilmer congeneric coefficients) under various testing conditions that result from manipulating large-scale real data. For the first purpose, two alternative ways of exactly quantifying rho(XX)' are compared in terms of computational efficiency and statistical usefulness. In addition, the lower and upper bounds for rho(XX)' are presented in line with the assumptions of essential tau-equivalence and congeneric similarity, respectively. Empirical studies conducted for the second purpose showed across all testing conditions that (1) the IRT reliability coefficient was higher than the CTT reliability statistics; (2) the IRT reliability coefficient was closer to the Feldt-Gilmer coefficient than to the Cronbach's alpha coefficient; and (3) the alpha coefficient was close to the lower bound of IRT reliability. Some advantages of the IRT approach to estimating test-score reliability over the CTT approaches are discussed in the end.
引用
收藏
页码:179 / 188
页数:10
相关论文
共 29 条
[1]  
[Anonymous], 2004, TEST EQUATING SCALIN, DOI [DOI 10.1007/978-1-4757-4310-4, DOI 10.1007/978-1-4939-0317-7]
[2]  
[Anonymous], NUMERICAL RECIPES C
[3]  
[Anonymous], 2012, Applications of item response theory to practical testing problems
[4]  
Birnbaum A., 1968, Statistical theories of mental test scores, P397
[5]   MARGINAL MAXIMUM-LIKELIHOOD ESTIMATION OF ITEM PARAMETERS - APPLICATION OF AN EM ALGORITHM [J].
BOCK, RD ;
AITKIN, M .
PSYCHOMETRIKA, 1981, 46 (04) :443-459
[6]  
Cronbach LJ, 1951, PSYCHOMETRIKA, V16, P297
[7]   Marginal true-score measures and reliability for binary items as a function of their IRT parameters [J].
Dimitrov, DM .
APPLIED PSYCHOLOGICAL MEASUREMENT, 2003, 27 (06) :440-458
[8]  
FELDT L.S., 1989, ED MEASUREMENT
[9]   Estimating the internal consistency reliability of tests composed of testlets varying in length [J].
Feldt, LS .
APPLIED MEASUREMENT IN EDUCATION, 2002, 15 (01) :33-48
[10]   RELIABILITY ESTIMATION FOR A TEST WITH PARTS OF UNKNOWN LENGTHS [J].
GILMER, JS ;
FELDT, LS .
PSYCHOMETRIKA, 1983, 48 (01) :99-111