TEST SCALING AND VALUE-ADDED MEASUREMENT

被引:49
作者
Ballou, Dale [1 ]
机构
[1] Vanderbilt Univ, Peabody Coll, Dept Leadership Policy & Org, Nashville, TN 37205 USA
关键词
D O I
10.1162/edfp.2009.4.4.351
中图分类号
F [经济];
学科分类号
02 ;
摘要
Conventional value-added assessment requires that achievement be reported on an interval scale. While many metrics do not have this property, application of item response theory (IRT) is said to produce interval scales. However, it is difficult to confirm that the requisite conditions are met. Even when they are, the properties of the data that make a test IRT scalable may not be the properties we seek to represent in an achievement scale, as shown by the lack of surface plausibility of many scales resulting from the application of IRT. An alternative, ordinal data analysis, is presented. It is shown that value-added estimates are sensitive to the choice of ordinal methods over conventional techniques. Value-added practitioners should ask themselves whether they are so confident of the metric properties of these scales that they are willing to attribute differences to the superiority of the latter.
引用
收藏
页码:351 / 383
页数:33
相关论文
共 22 条
[11]   ABILITY SCALE IN ITEM CHARACTERISTIC CURVE THEORY [J].
LORD, FM .
PSYCHOMETRIKA, 1975, 40 (02) :205-217
[12]  
Luce D, 1971, Additive and Polynomial Representations, VI
[14]  
Northwest Evaluation Association (NWEA), 2008, MATH
[15]  
Phillips Meredith., 2000, Analytic Issues in the Assessment of Student Achievement, P103
[16]  
Springer Matthew G, 2008, ED NEXT, V81, P75
[17]   ON THE THEORY OF SCALES OF MEASUREMENT [J].
STEVENS, SS .
SCIENCE, 1946, 103 (2684) :677-680
[18]  
Wright BD, 1999, NEW RULES OF MEASUREMENT, P65
[19]   Comparison of item response theory and Thurstone methods of vertical scaling [J].
Yen, WM ;
Burket, GR .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1997, 34 (04) :293-313