Validity evidence of Criterion® for assessing L2 writing proficiency in a Japanese university context

被引：0

作者：

Koizumi R. ^{[1
]}

In’nami Y. ^{[2
]}

Asano K. ^{[1
]}

Agawa T. ^{[1
]}

机构：

[1] Juntendo University, Chiba

[2] Chuo University, Tokyo

来源：

Language Testing in Asia | / 6卷 / 1期

基金：

日本学术振兴会;

关键词：

Automated essay scoring; Essay length; Holistic scoring; Multilevel modeling; Rasch analysis; Syntactic complexity; Validity argument;

D O I：

10.1186/s40468-016-0027-7

中图分类号：

学科分类号：

摘要：

Background: While numerous articles on Criterion® have been published and its validity evidence has accumulated, test users need to obtain relevant validity evidence for their local context and develop their own validity argument. This paper aims to provide validity evidence for the interpretation and use of Criterion® for assessing second language (L2) writing proficiency at a university in Japan. Method: We focused on three perspectives: (a) differences in the difficulty of prompts in terms of Criterion® holistic scores, (b) relationships between Criterion® holistic scores and indicators of L2 proficiency, and (c) changes in Criterion® holistic and writing quality scores at three time points over 28 weeks. We used Rasch analysis (to examine (a)), Pearson product–moment correlations (to examine (b)), and multilevel modeling (to examine (c)). Results: First, we found statistically significant but minor differences in prompt difficulty. Second, Criterion® holistic scores were found to be relatively weakly but positively correlated with indicators of L2 proficiency. Third, Criterion® holistic and writing quality scores—particularly, essay length and syntactic complexity—significantly improved, and thus are sensitive measures of the longitudinal development of L2 writing. Conclusion: All the results can be used as backing (i.e., positive evidence) for validity when we interpret Criterion® holistic scores as reflecting L2 writing proficiency and use the scores to detect gains in L2 writing proficiency. All of these results help to accumulate validity evidence for an overall validity argument in our context. © 2016, Koizumi et al.

引用

共 50 条

[1] Dependency distance measures in assessing L2 writing proficiency
Ouyang, Jinghui
Jiang, Jingyang
Liu, Haitao
ASSESSING WRITING, 2022, 51
[2] Writing in L2 Greek: Exploring the effect of L2 proficiency and learning context on complexity, accuracy, and fluency
Panagopoulos, Panagiotis
Andria, Maria
Mikros, George
Varlokosta, Spyridoula
JOURNAL OF SECOND LANGUAGE WRITING, 2024, 64
[3] Verbal diversity within constructions as a predictor of L2 writing proficiency
Zhang, Xiaopeng
Yang, Xiaofeng
JOURNAL OF SECOND LANGUAGE WRITING, 2025, 68
[4] Linguistic complexity in L2 writing revisited: Issues of topic, proficiency, and construct multidimensionality
Yoon, Hyung-Jo
SYSTEM, 2017, 66 : 130 - 141
[5] The Effects of Task Complexity and L2 Proficiency on L2 Written Performance
Lee, Jiyong
JOURNAL OF ASIA TEFL, 2018, 15 (04): : 945 - 958
[6] Kolmogorov complexity metrics in assessing L2 proficiency: An information-theoretic approach
Wang, Gui
Wang, Hui
Wang, Li
FRONTIERS IN PSYCHOLOGY, 2022, 13
[7] Complexity, accuracy, and fluency in L2 writing across proficiency levels: A matter of L1 background?
Vo Dinh Phuoc
Barrot, Jessie S.
ASSESSING WRITING, 2022, 54
[8] Syntactic complexity across proficiency and languages: L2 and L1 writing in Dutch, Italian and Spanish
Kuiken, Folkert
Vedder, Ineke
INTERNATIONAL JOURNAL OF APPLIED LINGUISTICS, 2019, 29 (02) : 192 - 210
[9] Assessing syntactic sophistication in L2 writing: A usage-based approach
Kyle, Kristopher
Crossley, Scott
LANGUAGE TESTING, 2017, 34 (04) : 513 - 535
[10] Understanding the SSARC model of task sequencing: Assessing L2 writing development
Tabari, Mahmoud Abdi
Wang, Yizhou
Miller, Michol
ASSESSING WRITING, 2024, 62

← 1 2 3 4 5 →