Penalized Partial Least Squares with applications to B-spline transformations and functional data

被引:59
作者
Kraemer, Nicole [1 ]
Boulesteix, Anne-Laure [2 ,3 ]
Tutz, Gerhard [4 ]
机构
[1] TU Berlin, Dept Elect Engn & Comp Sci, D-10587 Berlin, Germany
[2] Sylvia Lawry Ctr MS Res, Munich, Germany
[3] Tech Univ Munich, Dept Med Stat & Epidemiol, Munich, Germany
[4] Univ Munich, Dept Stat, D-80539 Munich, Germany
关键词
NIR spectroscopy; Additive model; Dimensionality reduction; Nonlinear regression; Conjugate gradient; Krylov spaces;
D O I
10.1016/j.chemolab.2008.06.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a novel framework that combines penalization techniques with Partial Least Squares (PLS). We focus on two important applications. (1) We combine PLS with a roughness penalty to estimate high-imensional regression problems with functional predictors and scalar response. (2) Starting with an additive model, we expand each variable in terms of a generous number of B-spline basis functions. To prevent overfitting, we estimate the model by applying a penalized version of PLS. We gain additional model flexibility by incorporating a sparsity penalty. Both applications can be formulated in terms of a unified algorithm called Penalized Partial Least Squares, which can be computed virtually as fast as PLS using the kernel trick. Furthermore, we prove a close connection of penalized PLS to preconditioned linear systems. In experiments. we show the benefits of our method to noisy functional data and to sparse nonlinear regression models. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:60 / 69
页数:10
相关论文
共 46 条
[1]  
[Anonymous], 2003, Proc. 20th Int. Conf. Machine Learning
[2]   Comments on: Augmenting the bootstrap to analyze high dimensional genomic data [J].
Anne-Laure Boulesteix ;
Athanassios Kondylis ;
Nicole Krämer .
TEST, 2008, 17 (1) :31-35
[3]   Partial least squares: a versatile tool for the analysis of high-dimensional genomic data [J].
Boulesteix, Anne-Laure ;
Strimmer, Korbinian .
BRIEFINGS IN BIOINFORMATICS, 2007, 8 (01) :32-44
[4]   Bayesian wavelet regression on curves with application to a spectroscopic calibration problem [J].
Brown, PJ ;
Fearn, T ;
Vannucci, M .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (454) :398-408
[5]  
Cardot H, 2003, STAT SINICA, V13, P571
[6]  
Chapelle O., 2009, IEEE Trans. Neural Netw., V20, P542, DOI [10.1109/TNN.2009.2015974, DOI 10.1109/TNN.2009.2015974]
[7]  
de Boor C., 2001, A practical guide to splines
[8]  
rev. ed. Applied mathematical sciences, DOI DOI 10.1007/978-1-4612-6333-3
[9]   SIMPLS - AN ALTERNATIVE APPROACH TO PARTIAL LEAST-SQUARES REGRESSION [J].
DEJONG, S .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 18 (03) :251-263
[10]   Local polynomial additive regression through PLS and splines: PLSS [J].
Durand, JF .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 58 (02) :235-246