LogFiT: Log Anomaly Detection Using Fine-Tuned Language Models

被引：10

作者：

Almodovar, Crispin ^{[1
]}

Sabrina, Fariza ^{[1
]}

Karimi, Sarvnaz ^{[2
]}

Azad, Salahuddin ^{[1
]}

机构：

[1] Cent Queensland Univ, Sch Engn & Technol, Rockhampton, Qld 4701, Australia

[2] CSIRO, Data61, Sydney, NSW 2015, Australia

来源：

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2024年 / 21卷 / 02期

关键词：

Service monitoring; fault management; log anomaly detection; deep learning; natural language processing; language modeling;

D O I：

10.1109/TNSM.2024.3358730

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

System logs are a valuable source of information for monitoring and maintaining the security and stability of computer systems. Techniques based on Deep Learning and Natural Language Processing have demonstrated effectiveness in detecting abnormal behaviour from these system logs. However, existing anomaly detection approaches have limitations in terms of flexibility and practicality. Techniques that rely on log templates such as DeepLog and LogBERT fail to capture semantic information and are unable to handle variability in log content. On the other hand, classification-based approaches such as LogSy, LogRobust and HitAnomaly require time-consuming data labelling for supervised training. In this paper, a novel log anomaly detection model named LogFiT is proposed. The LogFiT model doesn't make use of a vocabulary of log templates and it doesn't require any labeled data as the model only requires self-supervised training. The LogFiT model uses a pretrained Bidirectional Encoder Representations from Transformers (BERT)-based language model fine-tuned to recognise the linguistic patterns of the normal log data. The LogFiT model is trained using masked sentence prediction on the normal log data only. Consequently, when presented with the new log data, the model's top- ${k}$ token prediction accuracy serves as a threshold for determining whether the new log data deviates from the normal log data. Experimental results show that LogFiT's F1 score exceeds that of baselines on the HDFS, BGL, and Thunderbird datasets. Critically, when variability is introduced in the log data during evaluation, LogFiT retains its effectiveness compared to that of baselines.

引用

页码：1715 / 1723

页数：9

共 51 条

[1] Almodovar C., 2023, arXiv
[2] Almodovar Crispin, 2022, P 20 ANN WORKSH AUST, P139
[3] Alsaheel A, 2021, PROCEEDINGS OF THE 30TH USENIX SECURITY SYMPOSIUM, P3005
[4] Ananya Kumar, 2022, arXiv
[5] Australia Dept. Home Affairs Belconnen Australia, 2020, Australia's Cyber Security Strategy 2020
[6] Beltagy I, 2020, Arxiv, DOI arXiv:2004.05150
[7] Bommasani R., 2021, arXiv
[8] Chalapathy R, 2019, Arxiv, DOI arXiv:1901.03407
[9] Chen Zeming, 2022, arXiv
[10] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

← 1 2 3 4 5 6 →