Robust estimation via robust gradient estimation

被引:48
作者
Prasad, Adarsh [1 ]
Suggala, Arun Sai [1 ]
Balakrishnan, Sivaraman [1 ]
Ravikumar, Pradeep [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
Heavy tails; Huber contamination; Outliers; Robust gradients; Robustness; HIGH DIMENSIONS;
D O I
10.1111/rssb.12364
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We provide a new computationally efficient class of estimators for risk minimization. We show that these estimators are robust for general statistical models, under varied robustness settings, including in the classical Huber epsilon-contamination model, and in heavy-tailed settings. Our workhorse is a novel robust variant of gradient descent, and we provide conditions under which our gradient descent variant provides accurate estimators in a general convex risk minimization problem. We provide specific consequences of our theory for linear regression and logistic regression and for canonical parameter estimation in an exponential family. These results provide some of the first computationally tractable and provably robust estimators for these canonical statistical models. Finally, we study the empirical performance of our proposed methods on synthetic and real data sets, and we find that our methods convincingly outperform a variety of baselines.
引用
收藏
页码:601 / 627
页数:27
相关论文
共 47 条
[1]  
Alon N., 1996, Proceedings of the Twenty-Eighth Annual ACM Symposium on the Theory of Computing, P20, DOI 10.1145/237814.237823
[2]   Convex Optimization: Algorithms and Complexity [J].
不详 .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2015, 8 (3-4) :232-+
[3]  
[Anonymous], 2011, Robust statistics
[4]  
[Anonymous], 2016, ARXIV161002581
[5]  
[Anonymous], P 30 C LEARN THEOR
[6]  
[Anonymous], ARXIV170204656
[7]  
[Anonymous], 2016, ARXIV160308315
[8]   STATISTICAL GUARANTEES FOR THE EM ALGORITHM: FROM POPULATION TO SAMPLE-BASED ANALYSIS [J].
Balakrishnan, Sivaraman ;
Wainwrightt, Martin J. ;
Yu, Bin .
ANNALS OF STATISTICS, 2017, 45 (01) :77-120
[9]  
Bhatia K, 2015, ARXIV150602428
[10]  
BOX GEP, 1953, BIOMETRIKA, V40, P318, DOI 10.1093/biomet/40.3-4.318