MULTI-FACETED RASCH MEASUREMENT AND BIAS PATTERNS IN EFL WRITING PERFORMANCE ASSESSMENT

被引:11
作者
He, Tung-Hsien [1 ]
Gou, Wen Johnny [1 ]
Chien, Ya-Chen [1 ]
Chen, I-Shan Jenny [1 ]
Chang, Shan-Mao [2 ]
机构
[1] Natl Taipei Univ Educ, Dept Childrens English Educ, Taipei, Taiwan
[2] Natl Changhua Univ Educ, Dept English, Changhua, Peoples R China
关键词
RATER TYPES; DISCOURSE; MODE;
D O I
10.2466/03.11.PR0.112.2.469-485
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
This study applied multi-faceted Rasch measurement to examine rater bias in the assessment of essays written by college students learning English as a foreign language. Four raters who had received different academic training from four distinctive disciplines applied a six-category rating scale to analytically rate essays on an argumentative topic and on a descriptive topic. FACETS, a Rasch computer program, was utilized to pinpoint bias patterns by analyzing the rater-topic, rater-category, and topic-category interactions. Results showed: argumentative essays were rated more severely than were descriptive essays; the linguistics-major rater was the most lenient rater, while the literature-major rater was the severest one; and the category of language use received the severest ratings, whereas content was given the most lenient ratings. The severity hierarchies for raters, essay topics, and rating categories suggested that raters' academic training and their perceptions of the importance of categories were associated with their bias patterns. Implications for rater training are discussed.
引用
收藏
页码:469 / 485
页数:17
相关论文
共 40 条
[1]  
Barnwell D., 1989, LANG TEST, V6, P152
[2]  
Brown A., 1995, Language Testing, V12, P1
[3]  
Brown H., 2003, Language assessment: Principles and classroom practices
[4]   READING AND WRITING DESCRIPTIVE AND PERSUASIVE TEXTS [J].
CARRELL, PL ;
CONNOR, U .
MODERN LANGUAGE JOURNAL, 1991, 75 (03) :314-324
[5]  
Chae S, 1998, J Outcome Meas, V2, P123
[6]  
Chiang S., 2003, System, V31, P471, DOI DOI 10.1016/J.SYSTEM.2003.02.002
[7]  
Cohen A. D., 1994, ASSESSING LANUGAGE A
[8]   The stability of rater severity in large-scale assessment programs [J].
Congdon, PJ ;
McQueen, J .
JOURNAL OF EDUCATIONAL MEASUREMENT, 2000, 37 (02) :163-178
[9]  
CROWHURST M, 1980, RES TEACH ENGL, V14, P223
[10]   Decision making while rating ESL/EFL writing tasks: A descriptive framework [J].
Cumming, A ;
Kantor, R ;
Powers, DE .
MODERN LANGUAGE JOURNAL, 2002, 86 (01) :67-96