An Efficient Elastic Net with Regression Coefficients Method for Variable Selection of Spectrum Data

被引:42
作者
Liu, Wenya [1 ]
Li, Qi [1 ]
机构
[1] Dalian Univ Technol, Sch Control Sci & Engn, Dalian, Peoples R China
来源
PLOS ONE | 2017年 / 12卷 / 02期
基金
中国国家自然科学基金;
关键词
PLS; REGULARIZATION; LASSO;
D O I
10.1371/journal.pone.0171122
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Using the spectrum data for quality prediction always suffers from noise and colinearity, so variable selection method plays an important role to deal with spectrum data. An efficient elastic net with regression coefficients method (Enet-BETA) is proposed to select the significant variables of the spectrum data in this paper. The proposed Enet-BETA method can not only select important variables to make the quality easy to interpret, but also can improve the stability and feasibility of the built model. Enet-BETA method is not prone to overfitting because of the reduction of redundant variables realized by elastic net method. Hypothesis testing is used to further simplify the model and provide a better insight into the nature of process. The experimental results prove that the proposed Enet-BETA method outperforms the other methods in terms of prediction performance and model interpretation.
引用
收藏
页数:13
相关论文
共 21 条
[1]   Elimination of uninformative variables for multivariate calibration [J].
Centner, V ;
Massart, DL ;
deNoord, OE ;
deJong, S ;
Vandeginste, BM ;
Sterna, C .
ANALYTICAL CHEMISTRY, 1996, 68 (21) :3851-3858
[2]   Multiway elastic net (MEN) for final product quality prediction and quality-related analysis of batch processes [J].
Chiu, Chih-Chiun ;
Yao, Yuan .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2013, 125 :153-165
[3]   Performance of some variable selection methods when multicollinearity is present [J].
Chong, IG ;
Jun, CH .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2005, 78 (1-2) :103-112
[4]   COMPARISON OF STOPPING RULES IN FORWARD STEPWISE DISCRIMINANT-ANALYSIS [J].
COSTANZA, MC ;
AFIFI, AA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (368) :777-785
[5]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[6]   Input variable selection for PLS modeling using nearest correlation spectral clustering [J].
Fujiwara, Koichi ;
Sawada, Hiroshi ;
Kano, Manabu .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 118 :109-119
[7]   A Bootstrap-VIP approach for selecting wavelength intervals in spectral imaging applications [J].
Gosselin, Ryan ;
Rodrigue, Denis ;
Duchesne, Carl .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2010, 100 (01) :12-21
[8]   A modified principal component technique based on the LASSO [J].
Jolliffe, IT ;
Trendafilov, NT ;
Uddin, M .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2003, 12 (03) :531-547
[9]   Variable selection via RIVAL (removing irrelevant variables amidst Lasso iterations) and its application to nuclear material detection [J].
Kump, Paul ;
Bai, Er-Wei ;
Chan, Kung-sik ;
Eichinger, Bill ;
Li, Kang .
AUTOMATICA, 2012, 48 (09) :2107-2115
[10]   Genetic algorithms applied to feature selection in PLS regression: how and when to use them [J].
Leardi, R ;
Gonzalez, AL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1998, 41 (02) :195-207