RAGLog: Log Anomaly Detection using Retrieval Augmented Generation

被引:0
|
作者
Pan, Jonathan [1 ]
Liang, Wong Swee [1 ]
Yidi, Yuan [1 ]
机构
[1] Home Team Sci & Technol Agcy, Singapore, Singapore
关键词
Log analysis; Retrieval Augmented Generation; Large Language Model;
D O I
10.1109/WEPST58552.2024.00034
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ability to detect log anomalies from system logs is a vital activity needed to ensure cyber resiliency of systems. It is applied for fault identification or facilitate cyber investigation and digital forensics. However, as logs belonging to different systems and components differ significantly, the challenge to perform such analysis is humanly challenging from the volume, variety and velocity of logs. This is further complicated by the lack or unavailability of anomalous log entries to develop trained machine learning or artificial intelligence models for such purposes. In this research work, we explore the use of a Retrieval Augmented Large Language Model that leverages a vector database to detect anomalies from logs. We used a Question and Answer configuration pipeline. To the best of our knowledge, our experiment which we called RAGLog is a novel one and the experimental results show much promise.
引用
收藏
页码:169 / 174
页数:6
相关论文
共 50 条
  • [1] anomalyDetection: Implementation of Augmented Network Log Anomaly Detection Procedures
    Gutierrez, Robert J.
    Boehmke, Bradley C.
    Bauer, Kenneth W.
    Saie, Cade M.
    Bihl, Trevor J.
    R JOURNAL, 2017, 9 (02): : 354 - 365
  • [2] Computer Log Anomaly Detection Using Frequent Episodes
    Halonen, Perttu
    Miettinen, Markus
    Hatonen, Kimmo
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS III, 2009, : 417 - 422
  • [3] A Study on Enhancing Anomaly Detection Technology with Synthetic-Log Generation
    Yamamoto, Takumi
    Iwasaki, Aiko
    Kobayashi, Hajime
    Kawauchi, Kiyoto
    Yoshimura, Ayako
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 2, 2022, 450 : 528 - 538
  • [4] Retrieval augmented generation using engineering design knowledge
    Siddharth, L.
    Luo, Jianxi
    KNOWLEDGE-BASED SYSTEMS, 2024, 303
  • [5] Access Log Anomaly Detection
    Tharshini, Ma
    Ragavinodini, M.
    Senthilkumar, Radha
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2017, : 375 - 381
  • [6] Anomaly detection using augmented negative selection algorithm
    Zeng, Jinquan
    JOURNAL OF BIOTECHNOLOGY, 2008, 136 : S112 - S112
  • [7] Active Retrieval Augmented Generation
    Jiang, Zhengbao
    Xu, Frank F.
    Gao, Luyu
    Sun, Zhiqing
    Liu, Qian
    Dwivedi-Yu, Jane
    Yang, Yiming
    Callan, Jamie
    Neubig, Graham
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 7969 - 7992
  • [8] Anomaly Detection for Big Log Data Using a Hadoop Ecosystem
    Son, Siwoon
    Gil, Myeong-Seon
    Moon, Yang-Sae
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2017, : 377 - 380
  • [9] Unsupervised log message anomaly detection
    Farzad, Amir
    Gulliver, T. Aaron
    ICT EXPRESS, 2020, 6 (03): : 229 - 237
  • [10] Log anomaly detection based on BERT
    Tang, Pan
    Guan, Yepeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 6431 - 6441