Data Cleaning Basics: Best Practices in Dealing with Extreme Scores

被引:36
作者
Osborne, Jason W. [1 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27695 USA
关键词
Data cleaning; Extreme scores; Outliers; Parameter estimates;
D O I
10.1053/j.nainr.2009.12.009
中图分类号
R47 [护理学];
学科分类号
1011 ;
摘要
In quantitative research, it is critical to perform data cleaning to ensure that the conclusions drawn from the data are as generalizable as possible, yet few researchers report doing so (Osborne JW. Educ Psychol. 2008; 28: 1-10). Extreme scores are a significant threat to the validity and generalizability of the results. In this article, I argue that researchers need to examine extreme scores to determine which of many possible causes contributed to the extreme score. From this, researchers can take appropriate action, which has many laudatory effects, from reducing error variance and improving the accuracy of parameter estimates to reducing the probability of errors of inference.
引用
收藏
页码:37 / 43
页数:7
相关论文
共 27 条
[1]  
[Anonymous], PRACTICAL ASSESS RES
[2]  
Barnett V., 1994, OUTLIERS STAT DATA
[3]  
Brewer CS, 1998, NAT M ASS HLTH SERV
[4]  
Cole J. C., 2008, BEST PRACTICES QUANT
[5]   ANALYSIS OF EXTREME VALUES [J].
DIXON, WJ .
ANNALS OF MATHEMATICAL STATISTICS, 1950, 21 (04) :488-506
[6]  
Evans VP, 1999, ADV SOC SCI, V5, P213
[7]  
Hawkins D., 1980, IDENTIFICATION OUTLI, DOI [10.1007/978-94-015-3994-4, DOI 10.1007/978-94-015-3994-4]
[8]  
Hoaglin DC., 1993, DETECT HANDLE OUTLIE
[9]   SOME COMMENTS CONCERNING USE OF MONOTONIC TRANSFORMATIONS TO REMOVE INTERACTION IN 2-FACTOR ANOVAS [J].
HUCK, SW ;
SUTTON, CO .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1975, 35 (04) :789-791
[10]  
Jarrell M. G., 1994, RES SCH, V1, P49