Process Knowledge Discovery Using Sparse Principal Component Analysis

被引:11
作者
Gao, Huihui [1 ]
Gajjar, Shriram [2 ]
KulahciP, Murat [3 ,4 ]
Zhu, Qunxiong [1 ]
Palazoglu, Ahmet [2 ]
机构
[1] Beijing Univ Chem Technol, Sch Informat Sci & Technol, Beijing 100029, Peoples R China
[2] Univ Calif Davis, Dept Chem Engn, Davis, CA 95616 USA
[3] Tech Univ Denmark, Dept Appl Math & Comp Sci, DK-2800 Lyngby, Denmark
[4] Lulea Univ Technol, Dept Business Adm Technol & Social Sci, S-97187 Lulea, Sweden
关键词
ROTATION; PCA; DATABASES;
D O I
10.1021/acs.iecr.6b03045
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
As the goals of ensuring process safety and energy efficiency become ever more challenging, engineers increasingly rely on data collected from such processes for informed decision making. During recent decades, extracting and interpreting valuable process information from large historical data sets have been an active area of research. Among the methods used, principal component analysis (PCA) is a well-established technique that allows for dimensionality reduction for large data sets by finding new uncorrelated variables, namely principal components (PCs). However, it is difficult to interpret the derived PCs, as each PC is a linear combination of all of the original variables and the loadings are typically nonzero. Sparse principal component analysis (SPCA) is a relatively recent technique proposed for producing PCs with sparse loadings via the variance sparsity trade-off. We propose a forward SPCA approach that helps uncover the underlying process knowledge regarding variable relations. This approach systematically determines the optimal sparse loadings for each sparse PC while improving interpretability and minimizing information loss. The salient features of the proposed approach are demonstrated through the Tennessee Eastman process simulation. The results indicate how knowledge and process insight can be discovered through a systematic analysis of sparse loadings.
引用
收藏
页码:12046 / 12059
页数:14
相关论文
共 27 条
[1]  
[Anonymous], 2002, Principal components analysis
[2]  
Banerjee O, 2008, J MACH LEARN RES, V9, P485
[3]  
Cios K.J., 2012, Data mining methods for knowledge discovery, V458
[4]   Merging model-based two-dimensional principal component analysis [J].
Cui, Kai ;
Gao, Quanxue ;
Zhang, Hailin ;
Gao, Xinbo ;
Xie, Deyan .
NEUROCOMPUTING, 2015, 168 :1198-1206
[5]   A direct formulation for sparse PCA using semidefinite programming [J].
d'Aspremont, Alexandre ;
El Ghaoui, Laurent ;
Jordan, Michael I. ;
Lanckriet, Gert R. G. .
SIAM REVIEW, 2007, 49 (03) :434-448
[6]   A PLANT-WIDE INDUSTRIAL-PROCESS CONTROL PROBLEM [J].
DOWNS, JJ ;
VOGEL, EF .
COMPUTERS & CHEMICAL ENGINEERING, 1993, 17 (03) :245-255
[7]  
FRAWLEY WJ, 1992, AI MAG, V13, P57
[8]   A data-driven multidimensional visualization technique for process fault detection and diagnosis [J].
Gajjar, Shriram ;
Palazoglu, Ahmet .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2016, 154 :122-136
[9]   Temporal evolution of groundwater composition in an alluvial aquifer (Pisuerga River, Spain) by principal component analysis [J].
Helena, B ;
Pardo, R ;
Vega, M ;
Barrado, E ;
Fernandez, JM ;
Fernandez, L .
WATER RESEARCH, 2000, 34 (03) :807-816
[10]   Principal component analysis: a review and recent developments [J].
Jolliffe, Ian T. ;
Cadima, Jorge .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2016, 374 (2065)