Anomaly Detection in Distributed Systems via Variational Autoencoders

被引:0
作者
Qian, Yun [1 ]
Ying, Shi [1 ]
Wang, Bingming [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC) | 2020年
关键词
Anomaly detection; deep learning; distributed systems; log data analysis;
D O I
10.1109/smc42975.2020.9283078
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed systems have been widely used in the information technology industry. However, with the increasing scale and complexity of distributed systems, the efficiency and accuracy of manual anomaly detection in system logs have decreased. Therefore, there is a great demand for a highly accurate and efficient automatic anomaly detection method based on system log analysis to ensure the reliability and the stability of large-scale distributed systems. In this paper, we propose VeLog, an automatic anomaly detection method based on variational autoencoders (VAEs). In the offline training phase, VeLog learns the patterns of normal log sequences and then generates normal intervals. In the online detection phase, VeLog detects an anomaly by automatically evaluating whether the distance between the input vector and its estimated vector matches these normal intervals. We evaluate VeLog on log datasets collected from representative distributed systems. The experimental results demonstrate that VeLog can detect anomalies with high accuracy and good efficiency.
引用
收藏
页码:2822 / 2829
页数:8
相关论文
共 31 条
[1]  
Akidau T, 2015, PROC VLDB ENDOW, V8, P1792
[2]  
An J., 2015, Technical Report, V2, P1
[3]  
[Anonymous], 2016, TUTORIAL VARIATIONAL
[4]  
[Anonymous], **DATA OBJECT**, DOI DOI 10.5281/ZENODO.3549604
[5]  
Bodík P, 2010, EUROSYS'10: PROCEEDINGS OF THE EUROSYS 2010 CONFERENCE, P111
[6]  
Borghesi A, 2019, AAAI CONF ARTIF INTE, P9428
[7]   Recurrent Neural Network Attention Mechanisms for Interpretable System Log Anomaly Detection [J].
Brown, Andy ;
Tuor, Aaron ;
Hutchinson, Brian ;
Nichols, Nicole .
PROCEEDINGS OF THE 1ST WORKSHOP ON MACHINE LEARNING FOR COMPUTING SYSTEMS (MLCS 2018), 2018,
[8]  
Chalapathy R., 2019, Deep Learning for Anomaly Detection: A Survey, DOI DOI 10.48550/ARXIV.1901.03407
[9]   Failure diagnosis using decision trees [J].
Chen, M ;
Zheng, AX ;
Lloyd, J ;
Jordan, MI ;
Brewer, E .
INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING, PROCEEDINGS, 2004, :36-43
[10]   Event Logs for the Analysis of Software Failures: A Rule-Based Approach [J].
Cinque, Marcello ;
Cotroneo, Domenico ;
Pecchia, Antonio .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2013, 39 (06) :806-821