Monitoring covariance in multivariate time series: Comparing machine learning and statistical approaches

被引:0
|
作者
Weix, Derek [1 ]
Cath, Tzahi Y. [2 ]
Hering, Amanda S. [1 ]
机构
[1] Baylor Univ, Dept Stat Sci, One Bear Pl 97140, Waco, TX 76798 USA
[2] Colorado Sch Mines, Dept Civil & Environm Engn, Golden, CO USA
基金
美国国家科学基金会;
关键词
fault detection; machine learning; MEWMA; real-time; time series data; CONTROL CHARTS; PROCESS VARIABILITY; MATRIX;
D O I
10.1002/qre.3551
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In complex systems with multiple variables monitored at high-frequency, variables are not only temporally autocorrelated, but they may also be nonlinearly related or exhibit nonstationarity as the inputs or operation changes. One approach to handling such variables is to detrend them prior to monitoring and then apply control charts that assume independence and stationarity to the residuals. Monitoring controlled systems is even more challenging because the control strategy seeks to maintain variables at prespecified mean levels, and to compensate, correlations among variables may change, making monitoring the covariance essential. In this paper, a vector autoregressive model (VAR) is compared with a multivariate random forest (MRF) and a neural network (NN) for detrending multivariate time series prior to monitoring the covariance of the residuals using a multivariate exponentially weighted moving average (MEWMA) control chart. Machine learning models have an advantage when the data's structure is unknown or may change. We design a novel simulation study with nonlinear, nonstationary, and autocorrelated data to compare the different detrending models and subsequent covariance monitoring. The machine learning models have superior performance for nonlinear and strongly autocorrelated data and similar performance for linear data. An illustration with data from a reverse osmosis process is given.
引用
收藏
页码:2822 / 2840
页数:19
相关论文
共 50 条
  • [41] An extreme learning machine for unsupervised online anomaly detection in multivariate time series
    Peng, Xinggan
    Li, Hanhui
    Yuan, Feng
    Razul, Sirajudeen Gulam
    Chen, Zhebin
    Lin, Zhiping
    NEUROCOMPUTING, 2022, 501 : 596 - 608
  • [42] Multivariate Time Series Prediction based on Multiple Kernel Extreme Learning Machine
    Wang, Xinying
    Han, Min
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 198 - 201
  • [43] Comparing Boosting and Deep Learning Methods on Multivariate Time Series for Retail Demand Forecasting
    Theodoridis, Georgios
    Tsadiras, Athanasios
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART II, 2022, 647 : 375 - 386
  • [44] A comprehensive evaluation of statistical, machine learning and deep learning models for time series prediction
    Xuan, Ang
    Yin, Mengmeng
    Li, Yupei
    Chen, Xiyu
    Ma, Zhenliang
    2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 55 - 60
  • [45] A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data
    Smith, Hayley
    Sweeting, Michael
    Morris, Tim
    Crowther, Michael
    DIAGNOSTIC AND PROGNOSTIC RESEARCH, 2022, 6 (01)
  • [46] A Method for Comparing Multivariate Time Series with Different Dimensions
    Tapinos, Avraam
    Mendes, Pedro
    PLOS ONE, 2013, 8 (02):
  • [47] Comparing alternative approaches for multivariate statistical analysis of batch process data
    Westerhuis, JA
    Kourti, T
    Macgregor, JF
    JOURNAL OF CHEMOMETRICS, 1999, 13 (3-4) : 397 - 413
  • [48] Nature-inspired Approaches for Distance Metric Learning in Multivariate Time Series Classification
    Oregi, Izaskun
    Del Ser, Javier
    Perez, Aritz
    Lozano, Jose A.
    2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 1992 - 1998
  • [49] Monitoring the mean of multivariate financial time series
    Garthoff, Robert
    Golosnoy, Vasyl
    Schmid, Wolfgang
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2014, 30 (03) : 328 - 340
  • [50] Discussion on "Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models"
    Maranzano, Paolo
    Parker, Paul A.
    ENVIRONMETRICS, 2025, 36 (02)