Incremental Analysis of Large-Scale System Logs for Anomaly Detection

被引:0
作者
Astekin, Merve [1 ]
Ozcan, Selim [1 ]
Sozer, Hasan [2 ]
机构
[1] TUBITAK BILGEM, Inst Informat Technol, Kocaeli, Turkey
[2] Ozyegin Univ, Dept Comp Sci, Istanbul, Turkey
来源
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2019年
关键词
log analysis; distributed systems; parallel processing; anomaly detection; big data; machine learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomalies during system execution can be detected by automated analysis of logs generated by the system. However, large scale systems can generate tens of millions of lines of logs within days. Centralized implementations of traditional machine learning algorithms are not scalable for such data. Therefore, we recently introduced a distributed log analysis framework for anomaly detection. In this paper, we introduce an extension of this framework, which can detect anomalies earlier via incremental analysis instead of the existing offline analysis approach. In the extended version, we periodically process the log data that is accumulated so far. We conducted controlled experiments based on a benchmark dataset to evaluate the effectiveness of this approach. We repeated our experiments with various periods that determine the frequency of analysis as well as the size of the data processed each time. Results showed that our online analysis can improve anomaly detection time significantly while keeping the accuracy level same as that is obtained with the offline approach. The only exceptional case, where the accuracy is compromised, rarely occurs when the analysis is triggered before all the log data associated with a particular session of events are collected.
引用
收藏
页码:2119 / 2127
页数:9
相关论文
共 50 条
[41]   DeepEAD: Explainable Anomaly Detection from System Logs [J].
Wang, Xinda ;
Kim, Kyeong Jin ;
Wang, Ye ;
Koike-Akino, Toshiaki ;
Parsons, Kieran .
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, :771-776
[42]   AutoLog: Anomaly detection by deep autoencoding of system logs [J].
Catillo, Marta ;
Pecchia, Antonio ;
Villano, Umberto .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
[43]   Robust KPI Anomaly Detection for Large-Scale Software Services with Partial Labels [J].
Zhang, Shenglin ;
Zhao, Chenyu ;
Sui, Yicheng ;
Su, Ya ;
Sun, Yongqian ;
Zhang, Yuzhi ;
Pei, Dan ;
Wang, Yizhe .
2021 IEEE 32ND INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2021), 2021, :103-114
[44]   A Hybrid Approach for Anomaly Detection on Large-scale Networks using HWDS and Entropy [J].
de Assis, Marcos V. O. ;
Rodrigues, Joel J. P. C. ;
Proenca, Mario Lemes, Jr. .
2013 21ST INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM 2013), 2013, :295-299
[45]   Anomaly Detection for Data Streams in Large-Scale Distributed Heterogeneous Computing Environments [J].
Dang, Yue ;
Wang, Bin ;
Brant, Ryan ;
Zhang, Zhiping ;
Alqallaf, Maha ;
Wu, Zhiqiang .
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON CYBER WARFARE AND SECURITY (ICCWS 2017), 2017, :121-130
[46]   SafeDrive: Online Driving Anomaly Detection From Large-Scale Vehicle Data [J].
Zhang, Mingming ;
Chen, Chao ;
Wo, Tianyu ;
Xie, Tao ;
Bhuiyan, Md Zakirul Alam ;
Lin, Xuelian .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (04) :2087-2096
[47]   Detection and analysis of real-time anomalies in large-scale complex system [J].
Chen, Siya ;
Jin, G. ;
Ma, Xinyu .
MEASUREMENT, 2021, 184
[48]   A new online anomaly learning and detection for large-scale service of Internet of Thing [J].
JunPing Wang ;
Qiuming Kuang ;
ShiHui Duan .
Personal and Ubiquitous Computing, 2015, 19 :1021-1031
[49]   Anomaly detection in large-scale networks: A state-space decision process [J].
Alghuried, Abdullah ;
Moghaddass, Ramin .
JOURNAL OF QUALITY TECHNOLOGY, 2022, 54 (01) :65-92
[50]   Fast clustering and anomaly detection technique for large-scale power data stream [J].
Wang G. ;
Zhou G. ;
Zhao H. ;
Mi Z. .
Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2016, 40 (24) :27-33