Robust Regression Using Data Partitioning and M-Estimation

被引:8
作者
Park, Yousung [1 ]
Kim, Daeyoung [2 ]
Kim, Seongyong [3 ]
机构
[1] Korea Univ, Dept Stat, Seoul, South Korea
[2] Univ Massachusetts, Dept Math & Stat, Amherst, MA 01003 USA
[3] Korea Univ, Econ & Stat Inst, Chungnam, South Korea
关键词
Breakdown point; Data partition; Leverage points; Outlier; OUTLIERS;
D O I
10.1080/03610918.2011.598994
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a new robust regression estimator using data partition technique and M estimation (DPM). The data partition technique is designed to define a small fixed number of subsets of the partitioned data set and to produce corresponding ordinary least square (OLS) fits in each subset, contrary to the resampling technique of existing robust estimators such as the least trimmed squares estimator. The proposed estimator shares a common strategy with the median ball algorithm estimator that is obtained from the OLS trial fits only on a fixed number of subsets of the data. We examine performance of the DPM estimator in the eleven challenging data sets and simulation studies. We also compare the DPM with the five commonly used robust estimators using empirical convergence rates relative to the OLS for clean data, robustness through mean squared error and bias, masking and swamping probabilities, the ability of detecting the known outliers, and the regression and affine equivariances.
引用
收藏
页码:1282 / 1300
页数:19
相关论文
共 18 条