Validity evidence of Criterion® for assessing L2 writing proficiency in a Japanese university context

被引:0
作者
Koizumi R. [1 ]
In’nami Y. [2 ]
Asano K. [1 ]
Agawa T. [1 ]
机构
[1] Juntendo University, Chiba
[2] Chuo University, Tokyo
基金
日本学术振兴会;
关键词
Automated essay scoring; Essay length; Holistic scoring; Multilevel modeling; Rasch analysis; Syntactic complexity; Validity argument;
D O I
10.1186/s40468-016-0027-7
中图分类号
学科分类号
摘要
Background: While numerous articles on Criterion® have been published and its validity evidence has accumulated, test users need to obtain relevant validity evidence for their local context and develop their own validity argument. This paper aims to provide validity evidence for the interpretation and use of Criterion® for assessing second language (L2) writing proficiency at a university in Japan. Method: We focused on three perspectives: (a) differences in the difficulty of prompts in terms of Criterion® holistic scores, (b) relationships between Criterion® holistic scores and indicators of L2 proficiency, and (c) changes in Criterion® holistic and writing quality scores at three time points over 28 weeks. We used Rasch analysis (to examine (a)), Pearson product–moment correlations (to examine (b)), and multilevel modeling (to examine (c)). Results: First, we found statistically significant but minor differences in prompt difficulty. Second, Criterion® holistic scores were found to be relatively weakly but positively correlated with indicators of L2 proficiency. Third, Criterion® holistic and writing quality scores—particularly, essay length and syntactic complexity—significantly improved, and thus are sensitive measures of the longitudinal development of L2 writing. Conclusion: All the results can be used as backing (i.e., positive evidence) for validity when we interpret Criterion® holistic scores as reflecting L2 writing proficiency and use the scores to detect gains in L2 writing proficiency. All of these results help to accumulate validity evidence for an overall validity argument in our context. © 2016, Koizumi et al.
引用
收藏
相关论文
共 50 条
  • [1] Dependency distance measures in assessing L2 writing proficiency
    Ouyang, Jinghui
    Jiang, Jingyang
    Liu, Haitao
    ASSESSING WRITING, 2022, 51
  • [2] Writing in L2 Greek: Exploring the effect of L2 proficiency and learning context on complexity, accuracy, and fluency
    Panagopoulos, Panagiotis
    Andria, Maria
    Mikros, George
    Varlokosta, Spyridoula
    JOURNAL OF SECOND LANGUAGE WRITING, 2024, 64
  • [3] Verbal diversity within constructions as a predictor of L2 writing proficiency
    Zhang, Xiaopeng
    Yang, Xiaofeng
    JOURNAL OF SECOND LANGUAGE WRITING, 2025, 68
  • [5] The Effects of Task Complexity and L2 Proficiency on L2 Written Performance
    Lee, Jiyong
    JOURNAL OF ASIA TEFL, 2018, 15 (04): : 945 - 958
  • [6] Kolmogorov complexity metrics in assessing L2 proficiency: An information-theoretic approach
    Wang, Gui
    Wang, Hui
    Wang, Li
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [7] Complexity, accuracy, and fluency in L2 writing across proficiency levels: A matter of L1 background?
    Vo Dinh Phuoc
    Barrot, Jessie S.
    ASSESSING WRITING, 2022, 54
  • [8] Syntactic complexity across proficiency and languages: L2 and L1 writing in Dutch, Italian and Spanish
    Kuiken, Folkert
    Vedder, Ineke
    INTERNATIONAL JOURNAL OF APPLIED LINGUISTICS, 2019, 29 (02) : 192 - 210
  • [9] Assessing syntactic sophistication in L2 writing: A usage-based approach
    Kyle, Kristopher
    Crossley, Scott
    LANGUAGE TESTING, 2017, 34 (04) : 513 - 535
  • [10] Understanding the SSARC model of task sequencing: Assessing L2 writing development
    Tabari, Mahmoud Abdi
    Wang, Yizhou
    Miller, Michol
    ASSESSING WRITING, 2024, 62