Incremental Analysis of Large-Scale System Logs for Anomaly Detection

被引:0
|
作者
Astekin, Merve [1 ]
Ozcan, Selim [1 ]
Sozer, Hasan [2 ]
机构
[1] TUBITAK BILGEM, Inst Informat Technol, Kocaeli, Turkey
[2] Ozyegin Univ, Dept Comp Sci, Istanbul, Turkey
来源
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2019年
关键词
log analysis; distributed systems; parallel processing; anomaly detection; big data; machine learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomalies during system execution can be detected by automated analysis of logs generated by the system. However, large scale systems can generate tens of millions of lines of logs within days. Centralized implementations of traditional machine learning algorithms are not scalable for such data. Therefore, we recently introduced a distributed log analysis framework for anomaly detection. In this paper, we introduce an extension of this framework, which can detect anomalies earlier via incremental analysis instead of the existing offline analysis approach. In the extended version, we periodically process the log data that is accumulated so far. We conducted controlled experiments based on a benchmark dataset to evaluate the effectiveness of this approach. We repeated our experiments with various periods that determine the frequency of analysis as well as the size of the data processed each time. Results showed that our online analysis can improve anomaly detection time significantly while keeping the accuracy level same as that is obtained with the offline approach. The only exceptional case, where the accuracy is compromised, rarely occurs when the analysis is triggered before all the log data associated with a particular session of events are collected.
引用
收藏
页码:2119 / 2127
页数:9
相关论文
共 50 条
  • [1] DILAF: A framework for distributed analysis of large-scale system logs for anomaly detection
    Astekin, Merve
    Zengin, Harun
    Sozer, Hasan
    SOFTWARE-PRACTICE & EXPERIENCE, 2019, 49 (02): : 153 - 170
  • [2] Evaluation of Distributed Machine Learning Algorithms for Anomaly Detection from Large-Scale System Logs: A Case Study
    Astekin, Merve
    Zengin, Harun
    Sozer, Hasan
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2071 - 2077
  • [3] LogFlash: Real-time Streaming Anomaly Detection and Diagnosis from System Logs for Large-scale Software Systems
    Jia, Tong
    Wu, Yifan
    Hou, Chuanjia
    Li, Ying
    2021 IEEE 32ND INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2021), 2021, : 80 - 90
  • [4] A novel multi-modal incremental tensor decomposition for anomaly detection in large-scale networks
    Fan, Rongqiao
    Fan, Qiyuan
    Li, Xue
    Wang, Puming
    Xu, Jing
    Jin, Xin
    Yao, Shaowen
    Liu, Peng
    INFORMATION SCIENCES, 2024, 681
  • [5] Large Scale Anomaly Detection in Data Center Logs and Metrics
    Martinez-Alvarez, Rafael P.
    Giraldo-Rodriguez, Carlos
    Chaves-Dieguez, David
    ECSA 2018: PROCEEDINGS OF THE 12TH EUROPEAN CONFERENCE ON SOFTWARE ARCHITECTURE: COMPANION PROCEEDINGS, 2018,
  • [6] Hybrid Anomaly Detection and Prioritization for Network Logs at Cloud Scale
    Ohana, David
    Wassermann, Bruno
    Dupuis, Nicolas
    Kolodner, Elliot
    Raichstein, Eran
    Malka, Michal
    PROCEEDINGS OF THE SEVENTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS '22), 2022, : 236 - 250
  • [7] Performance Anomaly and Change Point Detection For Large-Scale System Management
    Trubin, Igor
    ICPE'20: COMPANION OF THE ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING, 2020, : 7 - 7
  • [8] Anomaly Detection in a Large-scale Cloud Platform
    Islam, Mohammad S.
    Pourmajidi, William
    Zhang, Lei
    Steinbacher, John
    Erwin, Tony
    Miranskyy, Andriy
    2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE (ICSE-SEIP 2021), 2021, : 150 - 159
  • [9] A Survey of Deep Anomaly Detection for System Logs
    Zhao, Xiaoqing
    Jiang, Zhongyuan
    Ma, Jianfeng
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] Hierarchical Anomaly Detection and Multimodal Classification in Large-Scale Photovoltaic Systems
    Zhao, Yingying
    Liu, Qi
    Li, Dongsheng
    Kang, Dahai
    Lv, Qin
    Shang, Li
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2019, 10 (03) : 1351 - 1361