ROBUST WASSERSTEIN PROFILE INFERENCE AND APPLICATIONS TO MACHINE LEARNING

被引:145
作者
Blanchet, Jose [1 ]
Kang, Yang [2 ]
Murthy, Karthyek [3 ]
机构
[1] Stanford Univ, Management Sci & Engn, 475 Via Ortega, Stanford, CA 94305 USA
[2] Columbia Univ, 1255 Amsterdam Ave,Rm 1005, New York, NY 10027 USA
[3] Singapore Univ Technol & Design, 8 Somapah Rd, Singapore 487372, Singapore
基金
美国国家科学基金会;
关键词
Distributionally robust optimization; Wasserstein distance; regularization; square-root LASSO; logistic regression; support vector machine; limit characterization of optimal Wasserstein ball radius and regularization parameter; empirical likelihood; EMPIRICAL LIKELIHOOD; CONFIDENCE-INTERVALS; LASSO; REGULARIZATION; REGRESSION; DISTANCE;
D O I
10.1017/jpr.2019.49
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We show that several machine learning estimators, including square-root least absolute shrinkage and selection and regularized logistic regression, can be represented as solutions to distributionally robust optimization problems. The associated uncertainty regions are based on suitably defined Wasserstein distances. Hence, our representations allow us to view regularization as a result of introducing an artificial adversary that perturbs the empirical distribution to account for out-of-sample effects in loss estimation. In addition, we introduce RWPI (robust Wasserstein profile inference), a novel inference methodology which extends the use of methods inspired by empirical likelihood to the setting of optimal transport costs (of which Wasserstein distances are a particular case). We use RWPI to show how to optimally select the size of uncertainty regions, and as a consequence we are able to choose regularization parameters for these machine learning estimators without the use of cross validation. Numerical experiments are also given to validate our theoretical findings.
引用
收藏
页码:830 / 857
页数:28
相关论文
共 52 条
[1]  
[Anonymous], ARXIV160402199V1
[2]  
[Anonymous], CONFIDENCE REGIONS O
[3]  
[Anonymous], 2008, Grundlehren der mathematischen Wissenschaften Fundamental Principles of Mathematical Sciences
[4]  
[Anonymous], ARXIV150505116
[5]  
[Anonymous], 2016, ARXIV160401446
[6]  
[Anonymous], 2016, ARXIV160501340
[7]  
[Anonymous], ARXIV170208848
[8]  
[Anonymous], 2014, Advances in Neural Information Process- ing Systems
[9]  
[Anonymous], 2013, CONVERGE PROBAB MEAS
[10]  
[Anonymous], SUPPLEMENTARY MAT