Rodeo: Sparse, greedy nonparametric regression

被引:67
作者
Lafferty, John [1 ]
Wasserman, Larry [2 ]
机构
[1] Carnegie Mellon Univ, Dept Machine Learning, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Dept Stat, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
nonparametric regression; sparsity; local linear smoothing; bandwidth estimation; variable selection; minimax rates of convergence;
D O I
10.1214/009053607000000811
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We present a greedy method for simultaneously performing local bandwidth selection and variable selection in nonparametric regression. The method starts with a local linear estimator with large bandwidths, and incrementally decreases the bandwidth of variables for which the gradient of the estimator with respect to bandwidth is large. The method-called rodeo (regularization of derivative expectation operator)-conducts a sequence of hypothesis tests to threshold derivatives, and is easy to implement. Under certain assumptions on the regression function and sampling density, it is shown that the rodeo applied to local linear smoothing avoids the curse of dimensionality, achieving near optimal minimax rates of convergence in the number of relevant variables, as if these variables were isolated in advance.
引用
收藏
页码:28 / 63
页数:36
相关论文
共 31 条
[1]  
Bühlmann P, 2006, J MACH LEARN RES, V7, P1001
[2]   For most large underdetermined systems of linear equations the minimal l1-norm solution is also the sparsest solution [J].
Donoho, DL .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2006, 59 (06) :797-829
[3]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[4]   Nonconcave penalized likelihood with a diverging number of parameters [J].
Fan, JQ ;
Peng, H .
ANNALS OF STATISTICS, 2004, 32 (03) :928-961
[5]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[6]   DESIGN-ADAPTIVE NONPARAMETRIC REGRESSION [J].
FAN, JQ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1992, 87 (420) :998-1004
[7]  
Friedman J, 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5
[8]   MULTIVARIATE ADAPTIVE REGRESSION SPLINES [J].
FRIEDMAN, JH .
ANNALS OF STATISTICS, 1991, 19 (01) :1-67
[9]  
George EI, 1997, STAT SINICA, V7, P339
[10]   An equivalence between sparse approximation and support vector machines [J].
Girosi, F .
NEURAL COMPUTATION, 1998, 10 (06) :1455-1480