Correlating Events with Time Series for Incident Diagnosis

被引:81
作者
Luo, Chen [1 ]
Lou, Jian-Guang [2 ]
Lin, Qingwei [2 ]
Fu, Qiang [2 ]
Ding, Rui [2 ]
Zhang, Dongmei [2 ]
Wang, Zhe [1 ]
机构
[1] Jilin Univ, Changchun, Peoples R China
[2] Microsoft Res, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14) | 2014年
关键词
Correlation; Incident Diagnosis; Two-sample Problem;
D O I
10.1145/2623330.2623374
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As online services have more and more popular, incident diagnosis has emerged as a critical task in minimizing the service downtime and ensuring high quality of the services provided. For most online services, incident diagnosis is mainly conducted by analyzing a large amount of telemetry data collected from the services at runtime. Time series data and event sequence data are two major types of telemetry data. Techniques of correlation analysis are important tools that are widely used by engineers for data-driven incident diagnosis. Despite their importance, there has been little previous work addressing the correlation between two types of heterogeneous data for incident diagnosis: continuous time series data and temporal event data. In this paper, we propose an approach to evaluate the correlation between time series data and event data. Our approach is capable of discovering three important aspects of event-timeseries correlation in the context of incident diagnosis: existence of correlation, temporal order, and monotonic effect. Our experimental results on simulation data sets and two real data sets demonstrate the effectiveness of the algorithm.
引用
收藏
页码:1583 / 1592
页数:10
相关论文
共 32 条
[1]  
[Anonymous], 2008, INFORM WEEK
[2]  
[Anonymous], 1993, DETECTION ABRUPT CHA
[3]   Towards highly reliable enterprise network services via inference of multi-level dependencies [J].
Bahl, Paramvir ;
Chandra, Ranveer ;
Greenberg, Albert ;
Kandula, Srikanth ;
Maltz, David A. ;
Zhang, Ming .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2007, 37 (04) :13-24
[4]  
Berndt DJ., 1994, USING DYNAMIC TIME W, DOI DOI 10.5555/3000850.3000887
[5]  
Chen YP, 2013, 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), P383
[6]  
Cohen I, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P231
[7]  
Cohen I., 2005, Proceedings of the twentieth ACM symposium on Operating systems principles, SOSP '05, (New York, NY, USA), P105, DOI [10.1145/1095810.1095821, DOI 10.1145/1095810.1095821]
[8]  
Cohen J., 1988, Statistical power analysis for the behavioral sciences, VSecond
[9]  
Fu Q., 2012, SRDS
[10]  
Gretton A., 2007, KERNEL METHOD 2 SAMP, V19, P513