Semisupervised anomaly detection of multivariate time series based on a variational autoencoder

被引:0
作者
Ningjiang Chen
Huan Tu
Xiaoyan Duan
Liangqing Hu
Chengxiang Guo
机构
[1] Guangxi University,College of Computer and Electronic Information
[2] Guangxi Colleges and Universities Key Laboratory of Parallel and Distributed Computing,undefined
[3] Guangxi Key Laboratory of Multimedia Communications and Network Technology,undefined
[4] Guangxi University of Chinese Medicine,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Multivariate time series; Anomaly detection; Semisupervised learning; VAE; LSTM; Attention mechanism;
D O I
暂无
中图分类号
学科分类号
摘要
In a large-scale cloud environment, many key performance indicators (KPIs) of entities are monitored in real time. These multivariate time series consist of high-dimensional, high-noise, random and time-dependent data. As a common method implemented in artificial intelligence for IT operations (AIOps), time series anomaly detection has been widely studied and applied. However, the existing detection methods cannot fully consider the influence of multiple factors and cannot quickly and accurately detect anomalies in multivariate KPIs of entities. Concurrently, fine-grained root cause locations cannot be determined for detected anomalies and often require abundant normal data that are difficult to obtain for model training. To solve these problems, we propose a long short-term memory (LSTM)-based semisupervised variational autoencoder (VAE) anomaly detection strategy called LR-SemiVAE. First, LR-SemiVAE uses VAE to perform feature dimension reduction and reconstruction of multivariate time series data and judges whether the entity is abnormal by calculating the reconstruction probability score. Second, by introducing an LSTM network into the VAE encoder and decoder, the model can fully learn the time dependence of multivariate time series. Then, LR-SemiVAE predicts the data labels by introducing a classifier to reduce the dependence on the original labeled data during model training. Finally, by proposing a new evidence lower bound (ELBO) loss function calculation method, LR-SemiVAE pays attention to the normal pattern and ignores the abnormal pattern during training to reduce the time cost of removing random anomaly and noise data. However, due to the limitations of LSTM in learning the long-term dependence of time series data, based on LR-SemiVAE, we propose a transformer-based semisupervised VAE anomaly detection and location strategy called RT-SemiVAE for cluster systems with complex service dependencies. This method learns the long-term dependence of multivariate time series by introducing a parallel multihead attention mechanism transformer, while LSTM is used to capture short-term dependence, and the introduction of parallel computing also markedly reduces model training time. After RT-SemiVAE detects entity anomalies, it traces the root entities according to the obtained service dependence graph and locates the root causes at the indicator level. We verify the strategies by using public data sets and constructing a system prototype. Experimental results show that compared with existing baseline methods, the LR-SemiVAE and RT-SemiVAE strategies can detect anomalies more quickly and accurately and perform fine-grained and accurate localization of the root causes of anomalies.
引用
收藏
页码:6074 / 6098
页数:24
相关论文
共 26 条
[1]  
Borghesi A(2019)A semisupervised autoencoder-based approach for anomaly detection in high performance computing systems Eng Appl Artif Intell 85 634-644
[2]  
Notaro P(2021)A survey of AIOps methods for failure management ACM Trans on Intell Sys and Tech (TIST) 12.6 1-45
[3]  
Cardoso J(2021)A survey on automated log analysis for reliability engineering ACM Comp Surveys (CSUR) 54.6 1-37
[4]  
Gerndt M(2021)A review on outlier/anomaly detection in time series data ACM Comp Surveys (CSUR) 54.3 1-33
[5]  
He S(2018)A multimodal anomaly detector for robot-assisted feeding using an lstm-based variational autoencoder IEEE Robotics and Automation Lett 3.3 1544-1551
[6]  
Blázquez-García A(2020)LSTM-Based VAE-GAN for time-series anomaly detection Sensors 20.13 3738-6342
[7]  
Park D(2018)Information fusion and semi-supervised deep learning scheme for diagnosing gear faults in induction machine systems IEEE Trans on Industrial Elect 66.8 6331-3141
[8]  
Hoshi Y(2021)A survey on anomaly detection for technical systems using LSTM networks Comp in Industry 131 103498-3477
[9]  
Kemp CC(2019)Unsupervised anomaly detection with LSTM neural networks IEEE Trans on Neural Networks and Learning Sys 31.8 3127-178
[10]  
Niu Z(2020)Variational LSTM enhanced anomaly detection for industrial big data IEEE Trans on Industrial Informatics 17.5 3469-127764