The SLOGERT Framework for Automated Log Knowledge Graph Construction

被引:14
作者
Ekelhart, Andreas [1 ]
Ekaputra, Fajar J. [2 ]
Kiesling, Elmar [1 ]
机构
[1] WU Vienna Univ Econ & Business, Welthandelspl 1, A-1020 Vienna, Austria
[2] TU Wien Vienna Univ Technol, Favoritenstr 9-11-194, A-1040 Vienna, Austria
来源
SEMANTIC WEB, ESWC 2021 | 2021年 / 12731卷
基金
奥地利科学基金会;
关键词
Knowledge graphs; Log analysis; Log vocabularies; Graph modelling patterns;
D O I
10.1007/978-3-030-77385-4_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Log files are a vital source of information for keeping systems running and healthy. However, analyzing raw log data, i.e., textual records of system events, typically involves tedious searching for and inspecting clues, as well as tracing and correlating them across log sources. Existing log management solutions ease this process with efficient data collection, storage, and normalization mechanisms, but identifying and linking entities across log sources and enriching them with background knowledge is largely an unresolved challenge. To facilitate a knowledge-based approach to log analysis, this paper introduces SLOGERT, a flexible framework and workflow for automated construction of knowledge graphs from arbitrary raw log messages. At its core, it automatically identifies rich RDF graph modelling patterns to represent types of events and extracted parameters that appear in a log stream. We present the workflow, the developed vocabularies for log integration, and our prototypical implementation. To demonstrate the viability of this approach, we conduct a performance analysis and illustrate its application on a large public log dataset in the security domain.
引用
收藏
页码:631 / 646
页数:16
相关论文
共 42 条
[1]  
[Anonymous], 2019, INT JOINT C ART INT
[2]  
[Anonymous], 2011, P 9 WORKSH MIN LEARN
[3]  
[Anonymous], 2015, P 24 ACM INT C INFOR, DOI DOI 10.1145/2806416.2806498
[4]  
[Anonymous], 2010, 2010 INT C INF EM TE
[5]  
[Anonymous], 2016, IAICT, DOI DOI 10.1007/978-3-319-41763-921
[6]   Statistical Log Differencing [J].
Bao, Lingfeng ;
Busany, Nimrod ;
Lo, David ;
Maoz, Shahar .
34TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2019), 2019, :851-862
[7]  
Chang A X., 2014, 201402 CSTR STANF U
[8]  
Cui WD, 2018, PROCEEDINGS OF THE 13TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P17
[9]  
Donato D, 2010, LECT NOTES COMPUT SC, V6158, P126, DOI 10.1007/978-3-642-13962-8_14
[10]   DeepLog: Anomaly Detection and Diagnosis from System Logs through Deep Learning [J].
Du, Min ;
Li, Feifei ;
Zheng, Guineng ;
Srikumar, Vivek .
CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, :1285-1298