SHARP ORACLE INEQUALITIES FOR LEAST SQUARES ESTIMATORS IN SHAPE RESTRICTED REGRESSION

被引:54
作者
Bellec, Pierre C. [1 ,2 ,3 ]
机构
[1] CNRS, UMR 9194, ENSAE, Paris, France
[2] Rutgers State Univ, Dept Stat & Biostat, 501 Hill Ctr,Busch Campus,110 Frelinghuysen Rd, Piscataway, NJ 08854 USA
[3] ENSAE, 3 Ave Pierre Larousse, F-92245 Malakoff, France
关键词
Shape restricted regression; convexity; minimax rates; Gaussian width; concentration; CONVEX REGRESSION; RISK BOUNDS;
D O I
10.1214/17-AOS1566
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The performance of Least Squares (LS) estimators is studied in shape-constrained regression models under Gaussian and sub-Gaussian noise. General bounds on the performance of LS estimators over closed convex sets are provided. These results have the form of sharp oracle inequalities that account for the model misspecification error. In the presence of misspecification, these bounds imply that the LS estimator estimates the projection of the true parameter at the same rate as in the well-specified case. In isotonic and unimodal regression, the LS estimator achieves the non-parametric rate n(-2/3) as well as a parametric rate of order k/n up to logarithmic factors, where k is the number of constant pieces of the true parameter. In univariate convex regression, the LS estimator satisfies an adaptive risk bound of order q/n up to logarithmic factors, where q is the number of affine pieces of the true regression function. This adaptive risk bound holds for any collection of design points. While Guntuboyina and Sen [Probab. Theory Related Fields 163 (2015) 379-411] established that the nonparametric rate of convex regression is of order n(-4/5) for equispaced design points, we show that the nonparametric rate of convex regression can be as slow as n(-2/3) for some worst-case design points. This phenomenon can be explained as follows: Although convexity brings more structure than unimodality, for some worstcase design points this extra structure is uninformative and the nonparametric rates of unimodal regression and convex regression are both n(-2/3). Higher order cones, such as the cone of beta-monotone sequences, are also studied.
引用
收藏
页码:745 / 780
页数:36
相关论文
共 25 条
[11]  
CHATTERJEE S., 2015, MATRIX ESTIMATION MO
[12]  
CHATTERJEE S., 2017, IN PRESS
[13]   An improved global risk bound in concave regression [J].
Chatterjee, Sabyasachi .
ELECTRONIC JOURNAL OF STATISTICS, 2016, 10 (01) :1608-1629
[14]   A NEW PERSPECTIVE ON LEAST SQUARES UNDER CONVEX CONSTRAINT [J].
Chatterjee, Sourav .
ANNALS OF STATISTICS, 2014, 42 (06) :2340-2381
[15]   ON RISK BOUNDS IN ISOTONIC AND OTHER SHAPE RESTRICTED REGRESSION PROBLEMS [J].
Chatterjee, Yasachi ;
Guntuboyina, Adityanand ;
Sen, Bodhisattva .
ANNALS OF STATISTICS, 2015, 43 (04) :1774-1800
[16]  
FLAMMARION N., 2016, OPTIMAL RATES STAT S
[17]   Entropy estimate for high-dimensional monotonic functions [J].
Gao, Fuchang ;
Wellner, Jon A. .
JOURNAL OF MULTIVARIATE ANALYSIS, 2007, 98 (09) :1751-1764
[18]   Global risk bounds and adaptation in univariate convex regression [J].
Guntuboyina, Adityanand ;
Sen, Bodhisattva .
PROBABILITY THEORY AND RELATED FIELDS, 2015, 163 (1-2) :379-411
[19]  
Meyer M, 2000, ANN STAT, V28, P1083
[20]   Sharp MSE Bounds for Proximal Denoising [J].
Oymak, Samet ;
Hassibi, Babak .
FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2016, 16 (04) :965-1029