A Lightweight Algorithm for Message Type Extraction in System Application Logs

被引:98
作者
Makanju, Adetokunbo [1 ]
Zincir-Heywood, A. Nur [1 ]
Milios, Evangelos E. [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 4R2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Algorithms; experimentation; event log mining; fault management; clustering; MINING FREQUENT PATTERNS;
D O I
10.1109/TKDE.2011.138
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Message type or message cluster extraction is an important task in the analysis of system logs in computer networks. Defining these message types automatically facilitates the automatic analysis of system logs. When the message types that exist in a log file are represented explicitly, they can form the basis for carrying out other automatic application log analysis tasks. In this paper, we introduce a novel algorithm for carrying out message type extraction from event log files. IPLoM, which stands for Iterative Partitioning Log Mining, works through a 4-step process. The first three steps hierarchically partition the event log into groups of event log messages or event clusters. In its fourth and final stage, IPLoM produces a message type description or line format for each of the message clusters. IPLoM is able to find clusters in data irrespective of the frequency of its instances in the data, it scales gracefully in the case of long message type patterns and produces message type descriptions at a level of abstraction, which is preferred by a human observer. Evaluations show that IPLoM outperforms similar algorithms statistically significantly.
引用
收藏
页码:1921 / 1936
页数:16
相关论文
共 34 条
[1]  
Agrawal R., P 20 INT C VERY LARG
[2]  
Agrawal R., 1998, P ACM SIGMOD INT C M
[3]  
[Anonymous], 2009, US COMP FAIL DAT REP
[4]  
Bellec J. -H., 2007, PIKM 07, P55
[5]  
Cohen I., 2005, Proceedings of the twentieth ACM symposium on Operating systems principles, SOSP '05, (New York, NY, USA), P105, DOI [10.1145/1095810.1095821, DOI 10.1145/1095810.1095821]
[6]   Web Services Navigator: Visualizing the execution of Web Services [J].
De Pauw, W ;
Lei, M ;
Pring, E ;
Villard, L ;
Arnold, M ;
Morar, JF .
IBM SYSTEMS JOURNAL, 2005, 44 (04) :821-845
[7]   Execution Anomaly Detection in Distributed Systems through Unstructured Log Analysis [J].
Fu, Qiang ;
Lou, Jian-Guang ;
Wang, Yi ;
Li, Jiang .
2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, :149-+
[8]  
Goil S., 1999, ORAL HLTH STATUS ORA
[9]   Generic adapter logging toolkit [J].
Grabarnik, G ;
Salahshour, A ;
Subramanian, B ;
Ma, S .
INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING, PROCEEDINGS, 2004, :308-309
[10]  
Guha S., 1998, CURE, P73, DOI DOI 10.1145/276305.276312