Efficient Dynamic Latent Variable Analysis for High-Dimensional Time Series Data

被引:60
作者
Dong, Yining [1 ,2 ]
Liu, Yingxiang [3 ]
Qin, S. [4 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[2] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[3] Univ Southern Calif, Dept Elect & Comp Engn, Los Angeles, CA 90089 USA
[4] Univ Hong Kong, Sch Data Sci, Kowloon, Hong Kong, Peoples R China
关键词
Canonical correlation analysis; dynamic feature extraction; high-dimensional time series; latent dynamic modeling; numerical implementation; MODELS;
D O I
10.1109/TII.2019.2958074
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic-inner canonical correlation analysis (DiCCA) extracts dynamic latent variables from high-dimensional time series data with a descending order of predictability in terms of R-2. The reduced dimensional latent variables with rank-ordered predictability capture the dynamic features in the data, leading to easy interpretation and visualization. In this article, numerically efficient algorithms for DiCCA are developed to extract dynamic latent components from high-dimensional time series data. The numerically improved DiCCA algorithms avoid repeatedly inverting a covariance matrix inside the iteration loop of the numerical DiCCA algorithms. A further improvement using singular value decomposition converts the generalized eigenvector problem into a standard eigenvector problem for the DiCCA solution. Another improvement in model efficiency in this article is the dynamic model compaction of the extracted latent scores using autoregressive integrated moving average (ARIMA) models. Integrating factors, if existed in the latent variable scores, are made explicit in the ARIMA models. Numerical tests on two industrial datasets are provided to illustrate the improvements.
引用
收藏
页码:4068 / 4076
页数:9
相关论文
共 33 条
[1]  
Akaike H., 1976, Mathematics in Science and Engineering, V126, P27, DOI 10.1016/S0076-5392(08)60869-3
[2]  
Anandkumar A, 2014, J MACH LEARN RES, V15, P2773
[3]  
[Anonymous], 1999, SYSTEM IDENTIFICATIO
[4]   An Efficient Industrial Big-Data Engine [J].
Basanta-Val, Pablo .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (04) :1361-1369
[5]   KARHUNEN-LOEVE ANALYSIS OF HISTORICAL TIME-SERIES WITH AN APPLICATION TO PLANTATION BIRTHS IN JAMAICA [J].
BASILEVSKY, A ;
HUM, DPJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (366) :284-290
[6]   AUTO-ASSOCIATION BY MULTILAYER PERCEPTRONS AND SINGULAR VALUE DECOMPOSITION [J].
BOURLARD, H ;
KAMP, Y .
BIOLOGICAL CYBERNETICS, 1988, 59 (4-5) :291-294
[7]   Dynamic-Inner Canonical Correlation and Causality Analysis for High Dimensional Time Series Data [J].
Dong, Yining ;
Qin, S. Joe .
IFAC PAPERSONLINE, 2018, 51 (18) :476-481
[8]   DiCCA with Discrete-Fourier Transforms for Power System Events Detection and Localization [J].
Dong, Yining ;
Liu, Yingxiang ;
Qin, S. Joe .
IFAC PAPERSONLINE, 2018, 51 (18) :726-731
[9]   Regression on dynamic PLS structures for supervised learning of dynamic data [J].
Dong, Yining ;
Qin, S. Joe .
JOURNAL OF PROCESS CONTROL, 2018, 68 :64-72
[10]   Dynamic latent variable analytics for process operations and control [J].
Dong, Yining ;
Qin, S. Joe .
COMPUTERS & CHEMICAL ENGINEERING, 2018, 114 :69-80