Penalized Quantile Regression for Distributed Big Data Using the Slack Variable Representation

被引:12
作者
Fan, Ye [1 ]
Lin, Nan [2 ]
Yin, Xianjun [1 ]
机构
[1] Cent Univ Finance & Econ, Sch Stat & Math, Beijing, Peoples R China
[2] Washington Univ, Dept Math & Stat, St Louis, MO 63130 USA
关键词
ADMM; Big data; Nonconvex penalty; Quantile regression; SELECTION;
D O I
10.1080/10618600.2020.1840996
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Penalized quantile regression is a widely used tool for analyzing high-dimensional data with heterogeneity. Although its estimation theory has been well studied in the literature, its computation still remains a challenge in big data, due to the nonsmoothness of the check loss function and the possible nonconvexity of the penalty term. In this article, we propose the QPADM-slack method, a parallel algorithm formulated via the alternating direction method of multipliers (ADMM) that supports penalized quantile regression in big data. Our proposal is different from the recent QPADM algorithm and uses the slack variable representation of the quantile regression problem. Simulation studies demonstrate that this new formulation is significantly faster than QPADM, especially when the data volume n or the dimension p is large, and has favorable estimation accuracy in big data analysis for both nondistributed and distributed environments. We further illustrate the practical performance of QPADM-slack by analyzing a news popularity dataset.
引用
收藏
页码:557 / 565
页数:9
相关论文
共 23 条
[1]  
Battey H., 2015, ARXIV150905457
[2]  
Boyd W, 2011, TLS-TIMES LIT SUPPL, P3
[3]   ADAPTIVE ROBUST VARIABLE SELECTION [J].
Fan, Jianqing ;
Fan, Yingying ;
Barut, Emre .
ANNALS OF STATISTICS, 2014, 42 (01) :324-351
[4]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[5]   A Proactive Intelligent Decision Support System for Predicting the Popularity of Online News [J].
Fernandes, Kelwin ;
Vinagre, Pedro ;
Cortez, Paulo .
PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 :535-546
[6]  
Gabay D., 1976, Computers & Mathematics with Applications, V2, P17, DOI 10.1016/0898-1221(76)90003-1
[7]  
GLOWINSKI R, 1975, REV FR AUTOMAT INFOR, V9, P41
[8]   ADMM for High-Dimensional Sparse Penalized Quantile Regression [J].
Gu, Yuwen ;
Fan, Jun ;
Kong, Lingchen ;
Ma, Shiqian ;
Zou, Hui .
TECHNOMETRICS, 2018, 60 (03) :319-331
[9]   An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines [J].
Guan, Lei ;
Qiao, Linbo ;
Li, Dongsheng ;
Sun, Tao ;
Ge, Keshi ;
Lu, Xicheng .
2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, :1209-1216
[10]  
Hunter DR, 2000, J COMPUT GRAPH STAT, V9, P60