Locally and globally robust Penalized Trimmed Squares regression

被引:3
作者
Avramidis, A. [1 ]
Zioutas, G. [1 ]
机构
[1] Aristotle Univ Thessaloniki, Fac Technol, Gen Dept, Thessaloniki 54124, Greece
关键词
Robust regression; Monte Carlo simulation; Penalized Trimmed Squares; Unmasking outliers; Bounded influence; HIGH BREAKDOWN-POINT; LINEAR-REGRESSION; SUPPORT VECTORS; FAST ALGORITHM; OUTLIERS; MODELS; SETS;
D O I
10.1016/j.simpat.2010.06.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multiple outliers are frequently encountered in regression models used in business economics engineers and applied studies The ordinary least squares (OLS) estimator fails even in the presence of a single outlying observation To overcome this problem a class of high breakdown robust estimators (insensitive to outliers up to 50% of the data sample) has been introduced as an alternative to the least squares regression Among them the Penalized Trimmed Squares (PTS) is a reasonable high breakdown estimator This estimator is defined by the minimization of an objective function where penalty cost for deleting an outlier is added which serves as an upper bound on the residual error for any feasible regression line Since the PTS does not require presetting the number of outliers to delete from the data set it has better efficiency with respect to other estimators However small outliers remain influential causing bias to the regression line In this work we present a new class of regression estimates called generalized PTS (GPTS) The new GPTS estimator is defined as the PTS but with penalties suitable for bounding the influence function of all observations We show with some numerical examples and a Monte Carlo simulation study that the generalized PTS estimate has very good performance for both robust and efficiency properties (C) 2010 Elsevier B V All rights reserved
引用
收藏
页码:148 / 160
页数:13
相关论文
共 32 条
[21]   Computing LTS regression for large data sets [J].
Rousseeuw, PJ ;
Van Driessen, K .
DATA MINING AND KNOWLEDGE DISCOVERY, 2006, 12 (01) :29-45
[22]   LEAST MEDIAN OF SQUARES REGRESSION [J].
ROUSSEEUW, PJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1984, 79 (388) :871-880
[23]  
ROUSSEEUW RJ, 1990, J AM STAT ASSOC, V85, P633
[24]   A fast algorithm for S-regression estimates [J].
Salibian-Barrera, Matias ;
Yohai, Victor J. .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2006, 15 (02) :414-427
[25]   ON ONE-STEP GM ESTIMATES AND STABILITY OF INFERENCES IN LINEAR-REGRESSION [J].
SIMPSON, DG ;
RUPPERT, D ;
CARROLL, RJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1992, 87 (418) :439-450
[26]   THE INFLUENCE FUNCTIONS FOR THE LEAST TRIMMED SQUARES AND THE LEAST TRIMMED ABSOLUTE DEVIATIONS ESTIMATORS [J].
TABLEMAN, M .
STATISTICS & PROBABILITY LETTERS, 1994, 19 (04) :329-337
[27]  
Welsch RoyE., 1980, Evaluation of Econometric Models, P153
[28]   HIGH BREAKDOWN-POINT AND HIGH-EFFICIENCY ROBUST ESTIMATES FOR REGRESSION [J].
YOHAI, VJ .
ANNALS OF STATISTICS, 1987, 15 (02) :642-656
[29]  
YOHAI VJ, 1988, J AM STAT ASSOC, V83, P406
[30]  
Zioutas G, 2007, REVSTAT-STAT J, V5, P115