An Unsupervised Gradient-Based Approach for Real-Time Log Analysis From Distributed Systems

被引:0
作者
Wang, Minquan [1 ]
Lu, Siyang [2 ]
Xiao, Sizhe [3 ]
Wang, Dong Dong [4 ]
Wei, Xiang [1 ]
Han, Ningning [2 ]
Wang, Liqiang [4 ]
机构
[1] Beijing Jiaotong Univ, Sch Software Engn, Beijing, Peoples R China
[2] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
[3] Beijing Res Inst Automat Machinery Ind, Beijing, Peoples R China
[4] Univ Cent Florida, Dept Comp Sci, Orlando, FL USA
基金
中国国家自然科学基金;
关键词
Log anomaly detection; log analysis; distributed system; unsupervised learning; deep neural network; ANOMALY DETECTION;
D O I
10.1142/S0218843023500181
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of real-time log anomaly detection for distributed system with deep neural networks by unsupervised learning. There are two challenges in this problem, including detection accuracy and analysis efficacy. To tackle these two challenges, we propose GLAD, a simple yet effective approach mining for anomalies in distributed systems. To ensure detection accuracy, we exploit the gradient features in a well-calibrated deep neural network and analyze anomalous pattern within log files. To improve the analysis efficacy, we further integrate one-class support vector machine (SVM) into anomalous analysis, which significantly reduces the cost of anomaly decision boundary delineation. This effective integration successfully solves both accuracy and efficacy in real-time log anomaly detection. Also, since anomalous analysis is based upon unsupervised learning, it significantly reduces the extra data labeling cost. We conduct a series of experiments to justify that GLAD has the best comprehensive performance balanced between accuracy and efficiency, which implies the advantage in tackling practical problems. The results also reveal that GLAD enables effective anomaly mining and consistently outperforms state-of-the-art methods on both recall and F1 scores.
引用
收藏
页数:18
相关论文
共 26 条
  • [1] Borghesi A, 2019, AAAI CONF ARTIF INTE, P9428
  • [2] Failure diagnosis using decision trees
    Chen, M
    Zheng, AX
    Lloyd, J
    Jordan, MI
    Brewer, E
    [J]. INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING, PROCEEDINGS, 2004, : 36 - 43
  • [3] LogTransfer: Cross-System Log Anomaly Detection for Software Systems with Transfer Learning
    Chen, Rui
    Zhang, Shenglin
    Li, Dongwen
    Zhang, Yuzhe
    Guo, Fangrui
    Meng, Weibin
    Pei, Dan
    Zhang, Yuzhi
    Chen, Xu
    Liu, Yuqing
    [J]. 2020 IEEE 31ST INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2020), 2020, : 37 - 47
  • [4] Chung J., ARXIV
  • [5] Real-Time Anomaly Detection in Data Centers for Log-based Predictive Maintenance using an Evolving Fuzzy-Rule-Based Approach
    Decker, Leticia
    Leite, Daniel
    Giommi, Luca
    Bonacorsi, Daniele
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [6] Devlin J., ARXIV
  • [7] DeepLog: Anomaly Detection and Diagnosis from System Logs through Deep Learning
    Du, Min
    Li, Feifei
    Zheng, Guineng
    Srikumar, Vivek
    [J]. CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 1285 - 1298
  • [8] Guo H., ARXIV
  • [9] Log-Based Anomaly Detection With Robust Feature Extraction and Online Learning
    Han, Shangbin
    Wu, Qianhong
    Zhang, Han
    Qin, Bo
    Hu, Jiankun
    Shi, Xingang
    Liu, Linfeng
    Yin, Xia
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 2300 - 2311
  • [10] Experience Report: System Log Analysis for Anomaly Detection
    He, Shilin
    Zhu, Jieming
    He, Pinjia
    Lyu, Michael R.
    [J]. 2016 IEEE 27TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2016, : 207 - 218