Evaluating methodological enhancements to the Yes/No Angoff standard-setting method in language proficiency assessment

被引:0
作者
Fechter, Tia M. [1 ,2 ,3 ]
Yoon, Heeyeon [1 ,2 ]
机构
[1] Def Personnel Analyt Ctr, Alexandria, VA USA
[2] Def Language Inst Foreign Language Ctr, Monterey, CA USA
[3] Def Personnel Analyt Ctr, POB 1397, Seaside, CA 93955 USA
关键词
Internal validity; Mapmark method; modified Yes/No Angoff; procedural validity; proficiency test; standard setting;
D O I
10.1177/02655322231222600
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent mixed-methods study design. The study used the Yes/No ratings as the baseline method in two rounds of ratings, while differentiating the two methods by incorporating item maps and an Ordered Item Booklet, each of which is an integral tool of the Mapmark and the Bookmark methods. The results showed that the internal validity evidence is similar across both methods, especially after Round 2 ratings. When procedural validity evidence was considered, however, a preference emerged for the method where panelists conducted the initial ratings unbeknownst to the empirical item difficulty information, and then such information was provided on an item map as part of the Round 1 feedback. The findings highlight the importance of evaluating both internal and procedural validity evidence when considering standard-setting methods.
引用
收藏
页码:530 / 556
页数:27
相关论文
共 21 条
[1]  
Angoff W.H., 1971, ED MEASUREMENT, P508
[2]  
[Anonymous], 2014, Standards for educational and psychological testing
[3]  
Cizek G.J., 1996, Educational Measurement: Issues and Practice, V15, P20, DOI DOI 10.1111/j.1745-3992.1996.tb00809.x
[4]  
Cizek G. J., 2009, DEVELOPING PLAN SETT
[5]  
Creswell J W., 2007, Designing and conducting mixed methods research
[6]  
Hambleton R.K., 2006, ED MEASUREMENT, V4th, P433, DOI [10.1111/j.1745-3992.2006.00040.x, DOI 10.1111/J.1745-3992.2006.00040.X]
[7]  
Hambleton R.K., 2012, SETTING PERFORMANCE, P47
[8]   Standard setting: An alternative approach [J].
Impara, JC ;
Plake, BS .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1997, 34 (04) :353-366
[9]  
Interagency Language Roundtable, WELCOME ILR
[10]  
Kaftandjieva F., 2010, Methods for setting cut scores in criterion-referenced achievement tests: A comparative analysis of six recent methods with an application to tests of reading in EFL