Unified methods for variable selection and outlier detection in a linear regression

被引:2
|
作者
Seo, Han Son [1 ]
机构
[1] Konkuk Univ, Dept Appl Stat, 120 Neungdong Ro, Seoul 05029, South Korea
关键词
outliers; regression diagnostics; robustness; variable selections; MULTIPLE OUTLIERS; MODEL SELECTION; IDENTIFICATION; TESTS;
D O I
10.29220/CSAM.2019.26.6.575
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The problem of selecting variables in the presence of outliers is considered. Variable selection and outlier detection are not separable problems because each observation affects the fitted regression equation differently and has a different influence on each variable. We suggest a simultaneous method for variable selection and outlier detection in a linear regression model. The suggested procedure uses a sequential method to detect outliers and uses all possible subset regressions for model selections. A simplified version of the procedure is also proposed to reduce the computational burden. The procedures are compared to other variable selection methods using real data sets known to contain outliers. Examples show that the proposed procedures are effective and superior to robust algorithms in selecting the best model.
引用
收藏
页码:575 / 582
页数:8
相关论文
共 50 条