Iterative stepwise regression imputation using standard and robust methods

被引:104
作者
Templ, Matthias [1 ,2 ]
Kowarik, Alexander [2 ]
Filzmoser, Peter [1 ]
机构
[1] Vienna Univ Technol, Dept Stat & Probabil Theory, A-1040 Vienna, Austria
[2] Stat Austria, Methods Unit, A-1110 Vienna, Austria
关键词
Regression imputation; Robustness; R;
D O I
10.1016/j.csda.2011.04.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Imputation of missing values is one of the major tasks for data pre-processing in many areas. Whenever imputation of data from official statistics comes into mind, several (additional) challenges almost always arise, like large data sets, data sets consisting of a mixture of different variable types, or data outliers. The aim is to propose an automatic algorithm called IRMI for iterative model-based imputation using robust methods, encountering for the mentioned challenges, and to provide a software tool in R. This algorithm is compared to the algorithm IVEWARE, which is the "recommended software" for imputations in international and national statistical institutions. Using artificial data and real data sets from official statistics and other fields, the advantages of IRMI over IVEWARE - especially with respect to robustness - are demonstrated. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:2793 / 2806
页数:14
相关论文
共 38 条
[1]  
ALFONS A, 2009, UNECE WORK SESS STAT, P10
[2]  
[Anonymous], ESTADISTICA
[3]  
[Anonymous], 2009, LANG ENV STAT COMP
[4]  
[Anonymous], 2000, SURV METHODOL
[5]  
Béguin C, 2008, SURV METHODOL, V34, P91
[6]   Robust inference for generalized linear models [J].
Cantoni, E ;
Ronchetti, E .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (455) :1022-1030
[7]  
CHAMBERS J, 2008, GRAPHICAL METHODS DA
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]  
Durrant G.B., 2005, NCRM METHODS REV PAP
[10]  
Eurostat, 2008, European Communities: Methodologies and working papers