Learning Graph Structures With Transformer for Multivariate Time-Series Anomaly Detection in IoT

被引:283
作者
Chen, Zekai [1 ]
Chen, Dingshuo [2 ]
Zhang, Xiao [2 ]
Yuan, Zixuan [3 ]
Cheng, Xiuzhen [2 ]
机构
[1] George Washington Univ, Dept Comp Sci, Washington, DC 20052 USA
[2] Shandong Univ, Sch Comp Sci & Technol, Qingdao 250100, Peoples R China
[3] Rutgers State Univ, Sch Business, New Brunswick, NJ 08901 USA
关键词
Time series analysis; Anomaly detection; Sensors; Convolution; Predictive models; Internet of Things; Correlation; graph learning; multivariate time series; self-attention;
D O I
10.1109/JIOT.2021.3100509
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many real-world Internet of Things (IoT) systems, which include a variety of Internet-connected sensory devices, produce substantial amounts of multivariate time-series data. Meanwhile, vital IoT infrastructures, such as smart power grids and water distribution networks are frequently targeted by cyberattacks, making anomaly detection an important study topic. Modeling such relatedness is, nevertheless, unavoidable for any efficient and effective anomaly detection system, given the intricate topological and nonlinear connections that are originally unknown among sensors. Furthermore, detecting anomalies in multivariate time series is difficult due to their temporal dependency and stochasticity. This article presented GTA, a new framework for multivariate time-series anomaly detection that involves automatically learning a graph structure, graph convolution, and modeling temporal dependency using a transformer-based architecture. The connection learning policy, which is based on the Gumbel-softmax sampling approach to learn bidirected links among sensors directly, is at the heart of the learning graph structure. To describe the anomaly information flow between network nodes, we introduced a new graph convolution called influence propagation convolution. In addition, to tackle the quadratic complexity barrier, we suggested a multibranch attention mechanism to replace the original multihead self-attention method. Extensive experiments on four publicly available anomaly detection benchmarks further demonstrate the superiority of our approach over alternative state of the arts.
引用
收藏
页码:9179 / 9189
页数:11
相关论文
共 57 条
[1]  
Aggarwal CC, 2014, CH CRC DATA MIN KNOW, P1
[2]  
Ahmed C.M., 2017, P 3 INT WORKSH CYB S, P25
[3]   Data Improving in Time Series Using ARX and ANN Models [J].
Akouemo, Hermine N. ;
Povinelli, Richard J. .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2017, 32 (05) :3352-3359
[4]  
Angiulli F., 2002, Principles of Data Mining and Knowledge Discovery. 6th European Conference, PKDD 2002. Proceedings (Lecture Notes in Artificial Intelligence Vol.2431), P15
[5]  
[Anonymous], 2005, IMC
[6]  
[Anonymous], 2016, ICLR
[7]  
[Anonymous], 2005, KDD
[8]   A Review on Outlier/Anomaly Detection in Time Series Data [J].
Blazquez-Garcia, Ane ;
Conde, Angel ;
Mori, Usue ;
Lozano, Jose A. .
ACM COMPUTING SURVEYS, 2022, 54 (03)
[9]  
Cai Z., 2021, Generative Adversarial Networks: A Survey Toward Private and Secure Applications
[10]   Trading Private Range Counting over Big IoT Data [J].
Cai, Zhipeng ;
He, Zaobo .
2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, :144-153