NONASYMPTOTIC ANALYSIS OF SEMIPARAMETRIC REGRESSION MODELS WITH HIGH-DIMENSIONAL PARAMETRIC COEFFICIENTS

被引:9
作者
Zhu, Ying [1 ]
机构
[1] Michigan State Univ, Dept Econ, 486 W Circle Dr Rm 110, E Lansing, MI 48824 USA
关键词
High-dimensional statistics; Lasso; nonasymptotic analysis; partially linear models; sample selection; SELECTION PROCEDURES; VARIABLE SELECTION; SERIES ESTIMATION; SAMPLE SELECTION; LASSO; RECOVERY; RATES;
D O I
10.1214/16-AOS1528
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a two-step projection based Lasso procedure for estimating a partially linear regression model where the number of coefficients in the linear component can exceed the sample size and these coefficients belong to the l(q) -"balls" for q is an element of [0, 1]. Our theoretical results regarding the properties of the estimators are nonasymptotic. In particular, we establish a new nonasymptotic "oracle" result: Although the error of the nonparametric projection per se (with respect to the prediction norm) has the scaling t(n) in the first step, it only contributes a scaling t(n)(2) in the l(2)-error of the second-step estimator for the linear coefficients. This new "oracle" result holds for a large family of nonparametric least squares procedures and regularized nonparametric least squares procedures for the first-step estimation and the driver behind it lies in the projection strategy. We specialize our analysis to the estimation of a semiparametric sample selection model and provide a simple method with theoretical guarantees for choosing the regularization parameter in practice.
引用
收藏
页码:2274 / 2298
页数:25
相关论文
共 42 条
[1]   SEMIPARAMETRIC ESTIMATION OF CENSORED SELECTION MODELS WITH A NONPARAMETRIC SELECTION MECHANISM [J].
AHN, H ;
POWELL, JL .
JOURNAL OF ECONOMETRICS, 1993, 58 (1-2) :3-29
[2]  
Bartlett P. L., 2003, Journal of Machine Learning Research, V3, P463, DOI 10.1162/153244303321897690
[3]   SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR [J].
Bickel, Peter J. ;
Ritov, Ya'acov ;
Tsybakov, Alexandre B. .
ANNALS OF STATISTICS, 2009, 37 (04) :1705-1732
[4]   From Brunn-Minkowski to Brascamp-Lieb and to logarithmic Sobolev inequalities [J].
Bobkov, SG ;
Ledoux, M .
GEOMETRIC AND FUNCTIONAL ANALYSIS, 2000, 10 (05) :1028-1052
[5]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[6]   Two-stage model selection procedures in partially linear regression [J].
Bunea, F ;
Wegkamp, MH .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2004, 32 (02) :105-118
[7]   Consistent covariate selection and post model selection inference in semiparametric regression [J].
Bunea, F .
ANNALS OF STATISTICS, 2004, 32 (03) :898-927
[8]   CONVERGENCE-RATES FOR PARAMETRIC COMPONENTS IN A PARTLY LINEAR-MODEL [J].
CHEN, H .
ANNALS OF STATISTICS, 1988, 16 (01) :136-146
[9]   SERIES ESTIMATION OF SEMILINEAR MODELS [J].
DONALD, SG ;
NEWEY, WK .
JOURNAL OF MULTIVARIATE ANALYSIS, 1994, 50 (01) :30-40
[10]   SEMIPARAMETRIC ESTIMATES OF THE RELATION BETWEEN WEATHER AND ELECTRICITY SALES [J].
ENGLE, RF ;
GRANGER, CWJ ;
RICE, J ;
WEISS, A .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1986, 81 (394) :310-320