A systematic review of item response theory in language assessment: Implications for the dimensionality of language ability

被引:12
作者
Min, Shangchao [1 ]
Aryadoust, Vahid [2 ]
机构
[1] Zhejiang Univ, Inst Appl Linguist, Hangzhou, Peoples R China
[2] Nanyang Technol Univ, Natl Inst Educ, Singapore, Singapore
关键词
Dimensionality; Item response theory (IRT); Language ability; Systematic review; Language assessment; SAMPLE-SIZE; TESTLET; MODEL; FIT; DEPENDENCE; PERFORMANCE; UNIDIMENSIONALITY; COMPREHENSION; INFORMATION; VALIDATION;
D O I
10.1016/j.stueduc.2020.100963
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The present study conducted a systematic review of the item response theory (IRT) literature in language assessment to investigate the conceptualization and operationalization of the dimensionality of language ability. Sixty-two IRT-based studies published between 1985 and 2020 in language assessment and educational measurement journals were first classified into two categories based on a unidimensional and multidimensional research framework, and then reviewed to examine language dimensionality from technical and substantive perspectives. It was found that 12 quantitative techniques were adopted to assess language dimensionality. Exploratory factor analysis was the primary method of dimensionality analysis in papers that had applied unidimensional IRT models, whereas the comparison modeling approach was dominant in the multidimensional framework. In addition, there was converging evidence within the two streams of research supporting the role of a number of factors such as testlets, language skills, subskills, and linguistic elements as sources of multi-dimensionality, while mixed findings were reported for the role of item formats across research streams. The assessment of reading, listening, speaking, and writing skills was grounded within both unidimensional and multidimensional framework. By contrast, vocabulary and grammar knowledge was mainly conceptualized as unidimensional. Directions for continued inquiry and application of IRT in language assessment are provided.
引用
收藏
页数:10
相关论文
共 133 条
[1]   Estimating the reproducibility of psychological science [J].
Aarts, Alexander A. ;
Anderson, Joanna E. ;
Anderson, Christopher J. ;
Attridge, Peter R. ;
Attwood, Angela ;
Axt, Jordan ;
Babel, Molly ;
Bahnik, Stepan ;
Baranski, Erica ;
Barnett-Cowan, Michael ;
Bartmess, Elizabeth ;
Beer, Jennifer ;
Bell, Raoul ;
Bentley, Heather ;
Beyan, Leah ;
Binion, Grace ;
Borsboom, Denny ;
Bosch, Annick ;
Bosco, Frank A. ;
Bowman, Sara D. ;
Brandt, Mark J. ;
Braswell, Erin ;
Brohmer, Hilmar ;
Brown, Benjamin T. ;
Brown, Kristina ;
Bruening, Jovita ;
Calhoun-Sauls, Ann ;
Callahan, Shannon P. ;
Chagnon, Elizabeth ;
Chandler, Jesse ;
Chartier, Christopher R. ;
Cheung, Felix ;
Christopherson, Cody D. ;
Cillessen, Linda ;
Clay, Russ ;
Cleary, Hayley ;
Cloud, Mark D. ;
Cohn, Michael ;
Cohoon, Johanna ;
Columbus, Simon ;
Cordes, Andreas ;
Costantini, Giulio ;
Alvarez, Leslie D. Cramblet ;
Cremata, Ed ;
Crusius, Jan ;
DeCoster, Jamie ;
DeGaetano, Michelle A. ;
Della Penna, Nicolas ;
den Bezemer, Bobby ;
Deserno, Marie K. .
SCIENCE, 2015, 349 (6251)
[2]  
Abbott M.L., 2007, Language Testing, V24, P7, DOI DOI 10.1177/0265532207071510
[3]  
Alderson J. C., 2000, Assessing reading
[4]  
[Anonymous], 2004, Explanatory Item Response Models, A Generalized Linear and Nonlinear Approach, DOI DOI 10.1007/978-1-4757-3990-9
[5]  
[Anonymous], 1985, BASICS ITEM RESPONSE
[6]  
[Anonymous], 2010, Rasch Measurement Transactions, V24, P1289
[7]  
[Anonymous], 2012, Applications of item response theory to practical testing problems
[8]  
[Anonymous], 1991, Fundamentals of a Item Responde Theory
[9]  
Aryadoust V., 2019, International Journal of Listening, V33, P71, DOI DOI 10.1080/10904018.2017.1397519
[10]   A review of comprehension subskills: A Scientometrics perspective [J].
Aryadoust, Vahid .
SYSTEM, 2020, 88