Anomaly Detection in Streams with Extreme Value Theory

被引:341
作者
Siffer, Alban [1 ]
Fouque, Pierre-Alain [2 ]
Termier, Alexandre [3 ]
Largouet, Christine [4 ]
机构
[1] INRIA, IRISA, Amossys, Villers Les Nancy, France
[2] Univ Rennes 1, IUF, IRISA, Rennes, France
[3] Univ Rennes 1, INRIA, IRISA, Rennes, France
[4] INRIA, AgroCampus, IRISA, Villers Les Nancy, France
来源
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2017年
关键词
Outliers in time series; Extreme Value Theory; Streaming;
D O I
10.1145/3097983.3098144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection in time series has attracted considerable attention due to its importance in many real-world applications including intrusion detection, energy management and finance. Most approaches for detecting outliers rely on either manually set thresholds or assumptions on the distribution of data according to Chandola, Banerjee and Kumar. Here, we propose a new approach to detect outliers in streaming univariate time series based on Extreme Value Theory that does not require to hand-set thresholds and makes no assumption on the distribution: the main parameter is only the risk, controlling the number of false positives. Our approach can be used for outlier detection, but more generally for automatically setting thresholds, making it useful in wide number of situations. We also experiment our algorithms on various real-world datasets which confirm its soundness and efficiency.
引用
收藏
页码:1067 / 1075
页数:9
相关论文
共 27 条
[1]  
Agarwal Deepak, 2005, ICDM
[2]  
Angiulli F., 2007, P 16 ACM C C INF KNO
[3]  
Angiulli Fabrizio, 2006, IEEE T KNOWLEDGE DAT
[4]  
[Anonymous], 2000, ICML
[5]  
[Anonymous], 1995, SIAM J SCI COMPUTING
[6]  
[Anonymous], 2010, P 2010 ACM C EM NETW, DOI DOI 10.1145/1921168.1921179
[7]   RESIDUAL LIFE TIME AT GREAT AGE [J].
BALKEMA, AA ;
DEHAAN, L .
ANNALS OF PROBABILITY, 1974, 2 (05) :792-804
[8]  
Beirlant J., 2006, Statistics of extremes: theory and applications
[9]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[10]   Anomaly Detection: A Survey [J].
Chandola, Varun ;
Banerjee, Arindam ;
Kumar, Vipin .
ACM COMPUTING SURVEYS, 2009, 41 (03)