PAFormer: Anomaly Detection of Time Series With Parallel-Attention Transformer

被引:11
作者
Bai, Ningning [1 ]
Wang, Xiaofeng [1 ]
Han, Ruidong [2 ,3 ]
Wang, Qin [2 ]
Liu, Zinian [2 ]
机构
[1] Xian Univ Technol, Dept Math, Xian 710048, Peoples R China
[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[3] Yuncheng Univ, Sch Math & Informat Technol, Yuncheng 044000, Peoples R China
基金
中国国家自然科学基金;
关键词
Anomaly detection; parallel-attention (PA); time series; transformer;
D O I
10.1109/TNNLS.2023.3337876
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time-series anomaly detection is a critical task with significant impact as it serves a pivotal role in the field of data mining and quality management. Current anomaly detection methods are typically based on reconstruction or forecasting algorithms, as these methods have the capability to learn compressed data representations and model time dependencies. However, most methods rely on learning normal distribution patterns, which can be difficult to achieve in real-world engineering applications. Furthermore, real-world time-series data is highly imbalanced, with a severe lack of representative samples for anomalous data, which can lead to model learning failure. In this article, we propose a novel end-to-end unsupervised framework called the parallel-attention transformer (PAFormer), which discriminates anomalies by modeling both the global characteristics and local patterns of time series. Specifically, we construct parallel-attention (PA), which includes two core modules: the global enhanced representation module (GERM) and the local perception module (LPM). GERM consists of two pattern units and a normalization module, with attention weights that indicate the relationship of each data point to the whole series (global). Due to the rarity of anomalous points, they have strong associations with adjacent data points. LPM is composed of a learnable Laplace kernel function that learns the neighborhood relevancies through the distributional properties of the kernel function (local). We employ the PA to learn the global-local distributional differences for each data point, which enables us to discriminate anomalies. Finally, we propose a two-stage adversarial loss to optimize the model. We conduct experiments on five public benchmark datasets (real-world datasets) and one synthetic dataset. The results show that PAFormer outperforms state-of-the-art baselines.
引用
收藏
页码:3315 / 3328
页数:14
相关论文
共 50 条
[1]   Practical Approach to Asynchronous Multivariate Time Series Anomaly Detection and Localization [J].
Abdulaal, Ahmed ;
Liu, Zhuanghua ;
Lancewicki, Tomer .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :2485-2494
[2]   RCAD: Real-time Collaborative Anomaly Detection System for Mobile Broadband Networks [J].
Ahmed, Azza H. ;
Riegler, Michael A. ;
Hicks, Steven A. ;
Elmokashfi, Ahmed .
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, :2682-2691
[3]   Do deep neural networks contribute to multivariate time series anomaly detection? [J].
Audibert, Julien ;
Michiardi, Pietro ;
Guyard, Frederic ;
Marti, Sebastien ;
Zuluaga, Maria A. .
PATTERN RECOGNITION, 2022, 132
[4]   USAD : UnSupervised Anomaly Detection on Multivariate Time Series [J].
Audibert, Julien ;
Michiardi, Pietro ;
Guyard, Frederic ;
Marti, Sebastien ;
Zuluaga, Maria A. .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :3395-3404
[5]   Local Anomaly Detection for Multivariate Time Series by Temporal Dependency Based on Poisson Model [J].
Benkabou, Seif-Eddine ;
Benabdeslem, Khalid ;
Kraus, Vivien ;
Bourhis, Kilian ;
Canitia, Bruno .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) :6701-6711
[6]   Outlier detection in regression models with ARIMA errors using robust estimates [J].
Bianco, AM ;
Ben, MG ;
Martínez, EJ ;
Yohai, VJ .
JOURNAL OF FORECASTING, 2001, 20 (08) :565-579
[7]  
Bin Z, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4433
[8]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[9]  
Carmona CU, 2022, PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, P2843
[10]  
Chen WC, 2022, PR MACH LEARN RES