Examining testlet effects in the TestDaF listening section: A testlet response theory modeling approach

被引:21
作者
Eckes, Thomas [1 ]
机构
[1] Univ Bochum, D-44787 Bochum, Germany
关键词
Listening comprehension; testlets; local item dependency; item response theory; LOCAL ITEM DEPENDENCE; LANGUAGE PROFICIENCY; RASCH MODELS; BI-FACTOR; ASSESSMENTS; PERFORMANCE;
D O I
10.1177/0265532213492969
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Testlets are subsets of test items that are based on the same stimulus and are administered together. Tests that contain testlets are in widespread use in language testing, but they also share a fundamental problem: Items within a testlet are locally dependent with possibly adverse consequences for test score interpretation and use. Building on testlet response theory (Wainer, Bradlow, & Wang, 2007), the listening section of the Test of German as a Foreign Language (TestDaF) was analyzed to determine whether, and to which extent, testlet effects were present. Three listening passages (i.e., three testlets) with 8, 10, and 7 items, respectively, were analyzed using a two-parameter logistic testlet response model. The data came from two live exams administered in April 2010 (N = 2859) and November 2010 (N = 2214). Results indicated moderate effects for one testlet, and small effects for the other two testlets. As compared to a standard IRT analysis, neglecting these testlet effects led to an overestimation of test reliability and an underestimation of the standard error of ability estimates. Item difficulty and item discrimination estimates remained largely unaffected. Implications for the analysis and evaluation of testlet-based tests are discussed.
引用
收藏
页码:39 / 61
页数:23
相关论文
共 79 条
[1]  
Adams R. J., 2012, CONQUEST VERSION 3 0
[2]  
Alderson J.C., 1995, LANGUAGE TEST CONSTR
[3]  
Andrich D., 1985, Test design: Developments in psychology and psychometrics, P245
[4]  
[Anonymous], 2010, ALIGNING TESTS CEFR
[5]  
[Anonymous], 2009, Bayesian analysis for the social sciences
[6]  
[Anonymous], 2011, Doing Bayesian data analysis: A tutorial with R and BUGS
[7]  
[Anonymous], 2007, Educ. Meas., DOI DOI 10.1111/J.1745-3992.2007.00107.X
[8]  
[Anonymous], 2011, VALIDIERUNG SPRACHPR
[9]  
Bachman L., 2010, LANGUAGE ASSESSMENT
[10]  
Bachman L. F., 1990, Fundamental Considerations in Language Testing