Correlating Events with Time Series for Incident Diagnosis

被引:81
作者
Luo, Chen [1 ]
Lou, Jian-Guang [2 ]
Lin, Qingwei [2 ]
Fu, Qiang [2 ]
Ding, Rui [2 ]
Zhang, Dongmei [2 ]
Wang, Zhe [1 ]
机构
[1] Jilin Univ, Changchun, Peoples R China
[2] Microsoft Res, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14) | 2014年
关键词
Correlation; Incident Diagnosis; Two-sample Problem;
D O I
10.1145/2623330.2623374
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As online services have more and more popular, incident diagnosis has emerged as a critical task in minimizing the service downtime and ensuring high quality of the services provided. For most online services, incident diagnosis is mainly conducted by analyzing a large amount of telemetry data collected from the services at runtime. Time series data and event sequence data are two major types of telemetry data. Techniques of correlation analysis are important tools that are widely used by engineers for data-driven incident diagnosis. Despite their importance, there has been little previous work addressing the correlation between two types of heterogeneous data for incident diagnosis: continuous time series data and temporal event data. In this paper, we propose an approach to evaluate the correlation between time series data and event data. Our approach is capable of discovering three important aspects of event-timeseries correlation in the context of incident diagnosis: existence of correlation, temporal order, and monotonic effect. Our experimental results on simulation data sets and two real data sets demonstrate the effectiveness of the algorithm.
引用
收藏
页码:1583 / 1592
页数:10
相关论文
共 32 条
[21]   ON A TEST OF WHETHER ONE OF 2 RANDOM VARIABLES IS STOCHASTICALLY LARGER THAN THE OTHER [J].
MANN, HB ;
WHITNEY, DR .
ANNALS OF MATHEMATICAL STATISTICS, 1947, 18 (01) :50-60
[22]   Event correlation for process discovery from web service interaction logs [J].
Motahari-Nezhad, Hamid Reza ;
Saint-Paul, Regis ;
Casati, Fabio ;
Benatallah, Boualem .
VLDB JOURNAL, 2011, 20 (03) :417-444
[23]  
Pearl J., 2009, MODELS REASONING INF
[24]  
Piateski G, 1991, Knowledge discovery in databases
[25]   CORRELATION BETWEEN CLIMATE EVENTS IN THE NORTH-ATLANTIC AND CHINA DURING LAST GLACIATION [J].
PORTER, SC ;
AN, ZS .
NATURE, 1995, 375 (6529) :305-308
[26]  
Powers D.M., 2020, EVALUATION PRECISION
[27]  
Rosner B., 2010, FUNDAMENTALS BIOSTAT, V7th
[28]   MULTIVARIATE 2-SAMPLE TESTS BASED ON NEAREST NEIGHBORS [J].
SCHILLING, MF .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1986, 81 (395) :799-806
[29]  
Sejdinovic Dino., 2012, Proceedings of the 29th International Conference on Machine Learning (ICML-12), P1111
[30]   Signed directed acyclic graphs for causal inference [J].
VanderWeele, Tyler J. ;
Robins, James M. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2010, 72 :111-127