Application of the Bookmark method: setting standard for the ninth-grade mathematics achievement test in China

被引:0
作者
Li, Guangming [1 ]
Wu, Yuelin [1 ]
机构
[1] South China Normal Univ, Ctr Studies Psychol Applicat, Sch Psychol, Guangdong Key Lab Mental Hlth & Cognit Sci, Guangzhou, Peoples R China
关键词
Standard setting; Bookmark method; Rasch model; Item response theory; Generalizability theory; ANGOFF;
D O I
10.1007/s12144-022-03992-1
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
The purpose of this paper is to apply the Bookmark method to the standard setting. Based on the Rasch Model in item response theory, a ninth-grade mathematics achievement test in china has been taken as an example of the standard setting, and 2 cut scores have been established to distinguish students into different performance levels eventually, namely basic and proficient cut scores. In addition, based on the use of generalizability theory, the standard error of the cut scores and the practical standard error are used as indicators to explore the effect that panelists and the standard setting rounds have made on the precision of Bookmark standard setting results through a mixed design of (p: g) x r. Result shows that the cut scores of basic and proficient were respectively 52.25 and 67.53. Besides, increasing the number of panelists in the group or standard setting rounds will reduce the standard error of the cut scores and the practical standard error. In addition, practical standard error is a necessary reference index when applying generalizability theory to analyze the cut scores established by Bookmark method, while the standard error of cut scores also has a great reference value.
引用
收藏
页码:28941 / 28952
页数:12
相关论文
共 27 条
[1]  
Angoff W.H., 1971, ED MEASUREMENT, P508
[2]  
Brennan R.L., 1980, APPL PSYCH MEAS, P219, DOI [10.1177/014662168000400209, DOI 10.1177/014662168000400209]
[3]   Performance assessments from the perspective of generalizability theory [J].
Brennan, RL .
APPLIED PSYCHOLOGICAL MEASUREMENT, 2000, 24 (04) :339-353
[4]  
Buckendahl C.W., 2009, Evaluation of the National Assessment of Educational Progress
[5]   A comparison of Angoff and Bookmark standard setting methods [J].
Buckendahl, CW ;
Smith, RW ;
Impara, JC ;
Plake, BS .
JOURNAL OF EDUCATIONAL MEASUREMENT, 2002, 39 (03) :253-263
[6]  
[陈梦竹 CHEN Meng-Zhu], 2009, [心理科学进展, Advances in Psychological Science], V17, P1102
[7]  
Chen P., 2008, NATL ED PSYCHOL STAT, V36
[8]   An Experimental Study of the Internal Consistency of Judgments Made in Bookmark Standard Setting [J].
Clauser, Brian E. ;
Baldwin, Peter ;
Margolis, Melissa J. ;
Mee, Janet ;
Winward, Marcia .
JOURNAL OF EDUCATIONAL MEASUREMENT, 2017, 54 (04) :481-497
[9]   An Examination of the Replicability of Angoff Standard Setting Results Within a Generalizability Theory Framework [J].
Clauser, Jerome C. ;
Margolis, Melissa J. ;
Clauser, Brian E. .
JOURNAL OF EDUCATIONAL MEASUREMENT, 2014, 51 (02) :127-140
[10]  
Crick J.E., 1983, Manual for GENOVA: A Generalized Analysis of Variance System