ADMM for Penalized Quantile Regression in Big Data

被引:35
作者
Yu, Liqun [1 ]
Lin, Nan [1 ]
机构
[1] Washinton Univ, Dept Math, St Louis, MO 63130 USA
关键词
Penalized quantile regression; ADMM; large-scale; divide-and-conquer; Hadoop; MapReduce; ALTERNATING DIRECTION METHOD; VARIABLE SELECTION; ALGORITHM; OPTIMIZATION; CONVERGENCE; NONCONVEX;
D O I
10.1111/insr.12221
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Traditional linear programming algorithms for quantile regression, for example, the simplex method and the interior point method, work well for data of small to moderate sizes. However, these methods are difficult to generalize to high-dimensional big data for which penalization is usually necessary. Further, the massive size of contemporary big data calls for the development of large-scale algorithms on distributed computing platforms. The traditional linear programming algorithms are intrinsically sequential and not suitable for such frameworks. In this paper, we discuss how to use the popular ADMM algorithm to solve large-scale penalized quantile regression problems. The ADMM algorithm can be easily parallelized and implemented in modern distributed frameworks. Simulation results demonstrate that the ADMM is as accurate as traditional LP algorithms while faster even in the nonparallel case.
引用
收藏
页码:494 / 518
页数:25
相关论文
共 42 条
[1]  
[Anonymous], 2011, MATHEMATICS-BASEL
[2]  
[Anonymous], ARXIV14101390
[3]  
[Anonymous], 1983, STUDIES MATH ITS APP
[4]  
[Anonymous], 2014, P INT C MACH LEARN
[5]  
[Anonymous], 2012, Hadoop: The definitive guide
[6]  
[Anonymous], 2012, MATH PROGRAMMING
[7]   l1-PENALIZED QUANTILE REGRESSION IN HIGH-DIMENSIONAL SPARSE MODELS [J].
Belloni, Alexandre ;
Chernozhukov, Victor .
ANNALS OF STATISTICS, 2011, 39 (01) :82-130
[8]   Distributed optimization and statistical learning via the alternating direction method of multipliers [J].
Boyd S. ;
Parikh N. ;
Chu E. ;
Peleato B. ;
Eckstein J. .
Foundations and Trends in Machine Learning, 2010, 3 (01) :1-122
[9]  
Chen C, 2005, SANKHYA INDIAN J STA, V67, P399, DOI DOI 10.2307/25053439
[10]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137