Input variable selection for PLS modeling using nearest correlation spectral clustering

被引:35
作者
Fujiwara, Koichi [1 ]
Sawada, Hiroshi [1 ]
Kano, Manabu [2 ]
机构
[1] NTT Commun Sci Labs, Kyoto 6190237, Japan
[2] Kyoto Univ, Dept Syst Sci, Sakyo Ku, Kyoto 6068501, Japan
关键词
Soft-senor; Variable selection; Spectral clustering; Regression; Graph theory; Modeling; PARTIAL-LEAST-SQUARES; DISTILLATION COMPOSITIONS; REGRESSION;
D O I
10.1016/j.chemolab.2012.08.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Soft-sensors have been widely used for estimating product quality or other key variables, and partial least squares (PLS) regression is accepted as a useful technique for soft-sensor design. To achieve high estimation performance, it is important to select appropriate input or explanatory variables. The present work proposes a new systematic methodology to select input variables for PLS using nearest correlation spectral clustering (NCSC), which is a clustering method based on the correlation among variables. The proposed method, referred to as NCSC-based variable selection (NCSC-VS), clusters the variables into some variable classes by using NCSC, and selects a few variable classes according to their contribution to estimates. That is, the input variables are not selected individually but some variables that have similar correlation are selected together. The usefulness of the proposed NCSC-VS is demonstrated through an application to soft-sensor design for an industrial chemical process. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:109 / 119
页数:11
相关论文
共 26 条
[1]  
[Anonymous], 2001, SPECTRAL CLUSTERING
[2]   Genetic algorithm-based wavelength selection method for spectral calibration [J].
Arakawa, Masamoto ;
Yamashita, Yosuke ;
Funatsu, Kimito .
JOURNAL OF CHEMOMETRICS, 2011, 25 (01) :10-19
[3]  
Atkeson CG, 1997, ARTIF INTELL REV, V11, P11, DOI 10.1023/A:1006559212014
[4]  
Bontempi G, 1999, INT J CONTROL, V72, P643, DOI 10.1080/002071799220830
[5]   A new data-based methodology for nonlinear process modeling [J].
Cheng, C ;
Chiu, MS .
CHEMICAL ENGINEERING SCIENCE, 2004, 59 (13) :2801-2810
[6]   Performance of some variable selection methods when multicollinearity is present [J].
Chong, IG ;
Jun, CH .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2005, 78 (1-2) :103-112
[7]  
Ding C.H.Q., 2001, IEEE INT C DAT MIN S
[8]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[9]   Correlation-based spectral clustering for flexible process monitoring [J].
Fujiwara, Koichi ;
Kano, Manabu ;
Hasebe, Shinji .
JOURNAL OF PROCESS CONTROL, 2011, 21 (10) :1438-1448
[10]   Development of correlation-based clustering method and its application to software sensing [J].
Fujiwara, Koichi ;
Kano, Manabu ;
Hasebe, Shinji .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2010, 101 (02) :130-138