Variable Selection Using Adaptive Nonlinear Interaction Structures in High Dimensions

被引:87
作者
Radchenko, Peter [1 ]
James, Gareth M. [1 ]
机构
[1] Univ So Calif, Marshall Sch Business, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
Heredity structure; Interactions; Nonlinear regression; Regularization; Variable selection; DANTZIG SELECTOR; REGRESSION; SHRINKAGE; LASSO;
D O I
10.1198/jasa.2010.tm10130
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Numerous penalization based methods have been proposed for fitting a traditional linear regression model in which the number of predictors, p, is large relative to the number of observations, n. Most of these approaches assume sparsity in the underlying coefficients and perform some form of variable selection. Recently, some of this work has been extended to nonlinear additive regression models. However, in many contexts one wishes to allow for the possibility of interactions among the predictors. This poses serious statistical and computational difficulties when p is large, as the number of candidate interaction terms is of order p(2). We introduce a new approach, "Variable selection using Adaptive Nonlinear Interaction Structures in High dimensions" (VANISH), that is based on a penalized least squares criterion and is designed for high dimensional nonlinear problems. Our criterion is convex and enforces the heredity constraint, in other words if an interaction term is added to the model, then the corresponding main effects are automatically included. We provide theoretical conditions under which VANISH will select the correct main effects and interactions. These conditions suggest that VANISH should outperform certain natural competitors when the true interaction structure is sufficiently sparse. Detailed simulation results are also provided, demonstrating that VANISH is computationally efficient and can be applied to nonlinear models involving thousands of terms while producing superior predictive performance over other approaches.
引用
收藏
页码:1541 / 1553
页数:13
相关论文
共 29 条
[1]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[2]  
Birman M. S., 1967, Sbornik: Mathematics, V2, P295, DOI 10.1070/SM1967v002n03ABEH002343
[3]  
Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
[4]   Variable Selection With the Strong Heredity Constraint and Its Oracle Property [J].
Choi, Nam Hee ;
Li, William ;
Zhu, Ji .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (489) :354-364
[5]  
FAN J., 2010, NONPARAMETRIC INDEPE
[6]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[7]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[8]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[9]   PATHWISE COORDINATE OPTIMIZATION [J].
Friedman, Jerome ;
Hastie, Trevor ;
Hoefling, Holger ;
Tibshirani, Robert .
ANNALS OF APPLIED STATISTICS, 2007, 1 (02) :302-332
[10]   Penalized regressions: The bridge versus the lasso [J].
Fu, WJJ .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (03) :397-416