LogBERT: Log Anomaly Detection via BERT

被引:152
作者
Guo, Haixuan [1 ]
Yuan, Shuhan [1 ]
Wu, Xintao [2 ]
机构
[1] Utah State Univ, Logan, UT 84322 USA
[2] Univ Arkansas, Fayetteville, AR 72701 USA
来源
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2021年
基金
美国国家科学基金会;
关键词
D O I
10.1109/IJCNN52387.2021.9534113
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting anomalous events in online computer systems is crucial to protect the systems from malicious attacks or malfunctions. System logs, which record detailed information of computational events, are widely used for system status analysis. In this paper, we propose LogBERT, a self-supervised framework for log anomaly detection based on Bidirectional Encoder Representations from Transformers (BERT). LogBERT learns the patterns of normal log sequences by two novel self-supervised training tasks, masked log message prediction and volume of hypersphere minimization. After training, LogBERT is able to capture the patterns of normal log sequences and further detect anomalies where the underlying patterns deviate from expected patterns. The experimental results on three log datasets show that LogBERT outperforms state-of-the-art approaches for anomaly detection.
引用
收藏
页数:8
相关论文
共 24 条
[11]   Improving one-class SVM for anomaly detection [J].
Li, KL ;
Huang, HK ;
Tian, SF ;
Xu, W .
2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, :3077-3081
[12]   Log Clustering based Problem Identification for Online Service Systems [J].
Lin, Qingwei ;
Zhang, Hongyu ;
Lou, Jian-Guang ;
Zhang, Yu ;
Chen, Xuewei .
2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C), 2016, :102-111
[13]   Isolation Forest [J].
Liu, Fei Tony ;
Ting, Kai Ming ;
Zhou, Zhi-Hua .
ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :413-+
[14]   Affinity maturation of human botulinum neurotoxin antibodies by light chain shuffling via yeast mating [J].
Lou, J. ;
Geren, I. ;
Garcia-Rodriguez, C. ;
Forsyth, C. M. ;
Wen, W. ;
Knopp, K. ;
Brown, J. ;
Smith, T. ;
Smith, L. A. ;
Marks, J. D. .
PROTEIN ENGINEERING DESIGN & SELECTION, 2010, 23 (04) :311-319
[15]   What supercomputers say: A study of five system logs [J].
Oliner, Adam ;
Stearley, Jon .
37TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2007, :575-+
[16]   Nonlinear dimensionality reduction by locally linear embedding [J].
Roweis, ST ;
Saul, LK .
SCIENCE, 2000, 290 (5500) :2323-+
[17]  
Ruff L, 2018, PR MACH LEARN RES, V80
[18]   Estimating the support of a high-dimensional distribution [J].
Schölkopf, B ;
Platt, JC ;
Shawe-Taylor, J ;
Smola, AJ ;
Williamson, RC .
NEURAL COMPUTATION, 2001, 13 (07) :1443-1471
[19]  
Vaswani A, 2017, ADV NEUR IN, V30
[20]  
Wang YX, 2004, PROCEEDINGS FROM THE FIFTH IEEE SYSTEMS, MAN AND CYBERNETICS INFORMATION ASSURANCE WORKSHOP, P358