Marginal Screening for Partial Least Squares Regression

被引:8
作者
Zhao, Naifei [1 ]
Xu, Qingsong [1 ]
Wang, Hong [1 ]
机构
[1] Cent S Univ, Sch Math & Stat, Changsha 410083, Hunan, Peoples R China
来源
IEEE ACCESS | 2017年 / 5卷
基金
中国国家自然科学基金;
关键词
Marginal screening; partial least squares; variable selection; VARIABLE SELECTION METHODS; NONCONCAVE PENALIZED LIKELIHOOD; NEAR-INFRARED SPECTRA; UVE-PLS METHOD; MULTIVARIATE CALIBRATION; DIMENSION REDUCTION; LINEAR-MODELS; ELIMINATION; PREDICTION; LASSO;
D O I
10.1109/ACCESS.2017.2728532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Partial least squares (PLS) regression is a versatile modeling approach for high-dimensional data analysis. Recently, PLS-based variable selection has attracted great attention due to high-throughput data reduction and modeling interpretability. In this paper, a class of variable selection methods for PLS, which employs marginal screening approaches to select relevant variables, is proposed. The proposed methods select variables in two steps: first, a solution path of all predictors is generated by sorting the sequence of marginal correlations between each predictor and response, and second, variable selection is carried out by screening the solution path with PLS. We provide three marginal screening methods for PLS in this paper, namely, sure independence screening (SIS), profiled independence screening ( PIS), and high-dimensional ordinary least-squares projection (HOLP). The promising performance of our methods is illustrated via three near-infrared (NIR) spectral data sets. Compared with SIS and PIS, HOLP for PLS is more suitable for selecting important wavelengths and enhances the prediction accuracy in the NIR spectral data.
引用
收藏
页码:14047 / 14055
页数:9
相关论文
共 59 条
[1]   Comparison of different variable selection methods conducted on NIR transmission measurements on intact tablets [J].
Abrahamsson, C ;
Johansson, J ;
Sparén, A ;
Lindgren, F .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2003, 69 (1-2) :3-12
[2]   Using basis expansions for estimating functional PLS regression Applications with chemometric data [J].
Aguilera, Ana M. ;
Escabias, Manuel ;
Preda, Cristian ;
Saporta, Gilbert .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2010, 104 (02) :289-305
[3]   A variable selection method based on uninformative variable elimination for multivariate calibration of near-infrared spectra [J].
Cai, Wensheng ;
Li, Yankun ;
Shao, Xueguang .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2008, 90 (02) :188-194
[4]   Elimination of uninformative variables for multivariate calibration [J].
Centner, V ;
Massart, DL ;
deNoord, OE ;
deJong, S ;
Vandeginste, BM ;
Sterna, C .
ANALYTICAL CHEMISTRY, 1996, 68 (21) :3851-3858
[5]  
CHO H, 2011, J ROYAL STAT SOC B, V74, P593
[6]   Performance of some variable selection methods when multicollinearity is present [J].
Chong, IG ;
Jun, CH .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2005, 78 (1-2) :103-112
[7]   Sparse partial least squares regression for simultaneous dimension reduction and variable selection [J].
Chun, Hyonho ;
Keles, Suenduez .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2010, 72 :3-25
[8]   METHODOLOGY AND THEORY FOR PARTIAL LEAST SQUARES APPLIED TO FUNCTIONAL DATA [J].
Delaigle, Aurore ;
Hall, Peter .
ANNALS OF STATISTICS, 2012, 40 (01) :322-352
[9]   Compressed sensing [J].
Donoho, DL .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (04) :1289-1306
[10]  
Eriksson L., 2013, Multi-and megavariate data analysis basic principles and applications, V3rd ed.