Impact of Rank-Based Normalizing Transformations on the Accuracy of Test Scores

被引:387
作者
Solomon, Shira R. [1 ]
Sawilowsky, Shlomo S. [2 ]
机构
[1] CNA Educ, New York, NY 10018 USA
[2] Wayne State Univ, Evaluat & Res, Detroit, MI USA
关键词
Normalization; normalizing transformations; T scores; test scoring; ranking methods; Rankit; Blom; Tukey; Van der Waerden; Monte Carlo;
D O I
10.22237/jmasm/1257034080
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The purpose of this article is to provide an empirical comparison of rank-based normalization methods for standardized test scores. A series of Monte Carlo simulations were performed to compare the Blom, Tukey, Van der Waerden and Rankit approximations in terms of achieving the T score's specified mean and standard deviation and unit normal skewness and kurtosis. All four normalization methods were accurate on the mean but were variably inaccurate on the standard deviation. Overall, deviation from the target moments was pronounced for the even moments but slight for the odd moments. Rankit emerged as the most accurate method among all sample sizes and distributions, thus it should be the default selection for score normalization in the social and behavioral sciences. However, small samples and skewed distributions degrade the performance of all methods, and practitioners should take these conditions into account when making decisions based on standardized test scores.
引用
收藏
页码:448 / 462
页数:15
相关论文
共 47 条
[1]   FORMULAS FOR EQUATING RATINGS ON DIFFERENT SCALES [J].
AIKEN, LR .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1987, 47 (01) :51-54
[2]   THE J-CURVE HYPOTHESIS OF CONFORMING BEHAVIOR [J].
Allport, Floyd H. .
JOURNAL OF SOCIAL PSYCHOLOGY, 1934, 5 (02) :141-183
[3]  
Angoff W. H., 1984, SCALES NORMS EQUIVAL
[4]  
[Anonymous], 1908, BIOMETRIKA, V6, P1
[5]  
[Anonymous], 1972, ROBUST ESTIMATES LOC
[6]   A RANKIT ANALYSIS OF PAIRED COMPARISONS FOR MEASURING THE EFFECT OF SPRAYS ON FLAVOR [J].
BLISS, CI ;
GREENWOOD, ML ;
WHITE, ES .
BIOMETRICS, 1956, 12 (04) :381-403
[7]  
BLOM G, 1954, BIOMETRIKA, V41, P302, DOI 10.2307/2332711
[8]  
Blom G., 1958, STAT ESTIMATES TRANS
[9]   ROBUSTNESS [J].
BRADLEY, JV .
BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 1978, 31 (NOV) :144-152
[10]   Methods in scaling the basic competence test [J].
Chang, Shun-Wen .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2006, 66 (06) :907-929