Bi-level variable selection via adaptive sparse group Lasso

被引:23
作者
Fang, Kuangnan [1 ]
Wang, Xiaoyan [1 ]
Zhang, Shengwei [1 ]
Zhu, Jianping [1 ]
Ma, Shuangge [1 ,2 ]
机构
[1] Xiamen Univ, Sch Econ, Dept Stat, Xiamen, Peoples R China
[2] Yale Univ, Sch Publ Hlth, Dept Biostat, New Haven, CT 06520 USA
基金
中国国家自然科学基金;
关键词
adaptive sparse group Lasso; bi-level; penalized variable selection; COOPERATIVE MEDICAL SCHEME; RURAL CHINA; ORACLE PROPERTIES; MODEL SELECTION; ELASTIC-NET; REGRESSION; HEALTH;
D O I
10.1080/00949655.2014.938241
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Penalization has been extensively adopted for variable selection in regression. In some applications, covariates have natural grouping structures, where those in the same group have correlated measurements or related functions. Under such settings, variable selection should be conducted at both the group-level and within-group-level, that is, a bi-level selection. In this study, we propose the adaptive sparse group Lasso (adSGL) method, which combines the adaptive Lasso and adaptive group Lasso (GL) to achieve bi-level selection. It can be viewed as an improved version of sparse group Lasso (SGL) and uses data-dependent weights to improve selection performance. For computation, a block coordinate descent algorithm is adopted. Simulation shows that adSGL has satisfactory performance in identifying both individual variables and groups and lower false discovery rate and mean square error than SGL and GL. We apply the proposed method to the analysis of a household healthcare expenditure data set.
引用
收藏
页码:2750 / 2760
页数:11
相关论文
共 28 条
[1]   Privatization and its discontents - The evolving Chinese health care system [J].
Blumenthal, D ;
Hsiao, W .
NEW ENGLAND JOURNAL OF MEDICINE, 2005, 353 (11) :1165-1170
[2]  
Breheny P, 2009, STAT INTERFACE, V2, P369
[3]  
Breiman L, 1996, ANN STAT, V24, P2350
[4]   High-dimensional variable screening and bias in subsequent inference, with an empirical comparison [J].
Buehlmann, Peter ;
Mandozzi, Jacopo .
COMPUTATIONAL STATISTICS, 2014, 29 (3-4) :407-430
[5]  
Fan JQ, 2010, STAT SINICA, V20, P101
[6]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[7]   Health Insurance Coverage and Impact: A Survey in Three Cities in China [J].
Fang, Kuangnan ;
Shia, BenChang ;
Ma, Shuangge .
PLOS ONE, 2012, 7 (06)
[8]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[9]  
Friedman J., 2010, ARXIV
[10]   A Selective Review of Group Selection in High-Dimensional Models [J].
Huang, Jian ;
Breheny, Patrick ;
Ma, Shuangge .
STATISTICAL SCIENCE, 2012, 27 (04) :481-499