Software Logs for Machine Learning in a DevOps Environment

被引:4
作者
Bosch, Nathan [1 ]
Bosch, Jan [2 ]
机构
[1] Ericsson AB, Dev Unit, Radio Serv & Infra Analyt DSI Analyt, Gothenburg, Sweden
[2] Chalmers Univ Technol, Dept Comp Sci & Engn, Gothenburg, Sweden
来源
2020 46TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2020) | 2020年
关键词
System logs; machine learning; data preprocessing; data generation; DevOps;
D O I
10.1109/SEAA51224.2020.00016
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
System logs perform a critical function in software-intensive systems as logs record the state of the system and significant events in the system at important points in time. Unfortunately, log entries are typically created in an ad-hoc, unstructured and uncoordinated fashion, limiting their usefulness for analytics and machine learning. In a DevOps environment, especially, unmanaged evolution in log data structure causes frequent disruption of operations in automated data pipelines, dashboards and analytics. In this paper, we present the main challenges of contemporary approaches to generating, storing and managing the evolution of system logs data for large, complex, software-intensive systems based on an in-depth case study at a world-leading telecommunications company. Second, we present an approach for generating and managing the evolution of log data in a DevOps environment that does not suffer from the aforementioned challenges and is optimized for use in machine learning. Third, we provide validation of the approach based on expert interviews that confirm that the approach addresses the identified challenges and problems.
引用
收藏
页码:29 / 33
页数:5
相关论文
共 15 条
[1]  
[Anonymous], 2008, WORKSH TACKL COMP PR
[2]  
Bosch Jan, 2017, SPEED DAT EC EXC SOF
[3]   Recurrent Neural Network Attention Mechanisms for Interpretable System Log Anomaly Detection [J].
Brown, Andy ;
Tuor, Aaron ;
Hutchinson, Brian ;
Nichols, Nicole .
PROCEEDINGS OF THE 1ST WORKSHOP ON MACHINE LEARNING FOR COMPUTING SYSTEMS (MLCS 2018), 2018,
[4]   Event Logs for the Analysis of Software Failures: A Rule-Based Approach [J].
Cinque, Marcello ;
Cotroneo, Domenico ;
Pecchia, Antonio .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2013, 39 (06) :806-821
[5]   DeepLog: Anomaly Detection and Diagnosis from System Logs through Deep Learning [J].
Du, Min ;
Li, Feifei ;
Zheng, Guineng ;
Srikumar, Vivek .
CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, :1285-1298
[6]  
Du M, 2016, IEEE DATA MINING, P859, DOI [10.1109/ICDM.2016.160, 10.1109/ICDM.2016.0103]
[7]  
Gunther C. W., 2014, XES STANDARD DEFINIT, DOI [10.1111/j.1477-2574.2011.00330.x, DOI 10.1111/J.1477-2574.2011.00330.X]
[8]   Drain: An Online Log Parsing Approach with Fixed Depth Tree [J].
He, Pinjia ;
Zhu, Jieming ;
Zheng, Zibin ;
Lyu, Michael R. .
2017 IEEE 24TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2017), 2017, :33-40
[9]  
Lou J.G., 2010, P 16 ACM SIGKDD INT, P613, DOI DOI 10.1145/1835804.1835883
[10]   A Search-based Approach for Accurate Identification of Log Message Formats [J].
Messaoudi, Salma ;
Panichella, Annibale ;
Bianculli, Domenico ;
Briand, Lionel ;
Sasnauskas, Raimondas .
2018 IEEE/ACM 26TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2018), 2018, :167-177