Challenging the empirical mean and empirical variance: A deviation study

被引:245
作者
Catoni, Olivier [1 ,2 ]
机构
[1] Ecole Normale Super, Dept Math & Appl, CNRS UMR 8553, F-75230 Paris 05, France
[2] INRIA Paris Rocquencourt, CLASSIC Team, Paris, France
来源
ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES | 2012年 / 48卷 / 04期
关键词
Non-parametric estimation; M-estimators; PAC-Bayes bounds; BOUNDS;
D O I
10.1214/11-AIHP454
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We present new M-estimators of the mean and variance of real valued random variables, based on PAC-Bayes bounds. We analyze the non-asymptotic minimax properties of the deviations of those estimators for sample distributions having either a bounded variance or a bounded variance and a bounded kurtosis. Under those weak hypotheses, allowing for heavy-tailed distributions, we show that the worst case deviations of the empirical mean are suboptimal. We prove indeed that for any confidence level, there is some M-estimator whose deviations are of the same order as the deviations of the empirical mean of a Gaussian statistical sample, even when the statistical sample is instead heavy-tailed. Experiments reveal that these new estimators perform even better than predicted by our bounds, showing deviation quantile functions uniformly lower at all probability levels than the empirical mean for non-Gaussian sample distributions as simple as the mixture of two Gaussian measures.
引用
收藏
页码:1148 / 1185
页数:38
相关论文
共 15 条
[1]   PAC-Bayesian Bounds for Randomized Empirical Risk Minimizers [J].
Alquier, P. .
MATHEMATICAL METHODS OF STATISTICS, 2008, 17 (04) :279-304
[2]  
[Anonymous], 905BIS U PAR 6 PAR 7
[3]  
[Anonymous], 2007, IMS LECT NOTES MONOG
[4]  
[Anonymous], ROBUST LINEAR UNPUB
[5]   ROBUST LINEAR LEAST SQUARES REGRESSION [J].
Audibert, Jean-Yves ;
Catoni, Olivier .
ANNALS OF STATISTICS, 2011, 39 (05) :2766-2794
[6]   EFFICIENT AND ROBUST ADAPTIVE ESTIMATOR OF LOCATION [J].
BERAN, R .
ANNALS OF STATISTICS, 1978, 6 (02) :292-313
[7]   ON ADAPTIVE ESTIMATION [J].
BICKEL, PJ .
ANNALS OF STATISTICS, 1982, 10 (03) :647-671
[8]  
Catoni O., 2004, Lecture Notes in Mathematics, V1851
[9]  
Huber P. J., 1981, Robust Statistics
[10]   ROBUST ESTIMATION OF LOCATION PARAMETER [J].
HUBER, PJ .
ANNALS OF MATHEMATICAL STATISTICS, 1964, 35 (01) :73-&