Most-predictive design points for functional data predictors

被引:70
作者
Ferraty, F. [1 ]
Hall, P. [2 ]
Vieu, P. [1 ]
机构
[1] Inst Math Toulouse, F-31062 Toulouse 9, France
[2] Univ Melbourne, Dept Math & Stat, Melbourne, Vic 3010, Australia
基金
美国国家科学基金会;
关键词
Boosting; Chemometrics; Design point; Functional data; Functional regression; Local linear regression; Model selection; Spectrometric curve; REGRESSION; ESTIMATORS; SELECTION;
D O I
10.1093/biomet/asq058
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We suggest a way of reducing the very high dimension of a functional predictor, X, to a low number of dimensions chosen so as to give the best predictive performance. Specifically, if X is observed on a fine grid of design points t(1),..., t(r), we propose a method for choosing a small subset of these, say t(i1),..., t(ik), to optimize the prediction of a response variable, Y. The values t(ij) are referred to as the most predictive design points, or covariates, for a given value of k, and are computed using information contained in a set of independent observations (X-i, Y-i) of (X, Y). The algorithm is based on local linear regression, and calculations can be accelerated using linear regression to preselect the design points. Boosting can be employed to further improve the predictive performance. We illustrate the usefulness of our ideas through simulations and examples drawn from chemometrics, and we develop theoretical arguments showing that the methodology can be applied successfully in a range of settings.
引用
收藏
页码:807 / 824
页数:18
相关论文
共 38 条
[1]   Cross-validated estimations in the single-functional index model [J].
Ait-Saidi, Ahmed ;
Ferraty, Frederic ;
Kassa, Rabah ;
Vieu, Philippe .
STATISTICS, 2008, 42 (06) :475-494
[2]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267
[3]  
[Anonymous], 1996, Local polynomial modelling and its applications
[4]  
[Anonymous], 2002, Model selection and multimodel inference: a practical informationtheoretic approach
[5]  
[Anonymous], 1989, MULTIVARIATE CALIBRA
[6]   OPTIMAL MINIMAL NEURAL INTERPRETATION OF SPECTRA [J].
BORGGAARD, C ;
THODBERG, HH .
ANALYTICAL CHEMISTRY, 1992, 64 (05) :545-551
[7]   Boosting for high-dimensional linear models [J].
Buhlmann, Peter .
ANNALS OF STATISTICS, 2006, 34 (02) :559-583
[8]  
Cardot H, 2003, STAT SINICA, V13, P571
[9]   SMOOTHING SPLINES ESTIMATORS FOR FUNCTIONAL LINEAR REGRESSION [J].
Crambes, Christophe ;
Kneip, Alois ;
Sarda, Pascal .
ANNALS OF STATISTICS, 2009, 37 (01) :35-72
[10]  
Davidian M, 2004, STAT SINICA, V14, P613