Local Evaluation of Time Series Anomaly Detection Algorithms

被引:47
作者
Huet, Alexis [1 ]
Navarro, Jose Manuel [1 ]
Rossi, Dario [1 ]
机构
[1] Huawei Technol Co Ltd, Boulogne Billancourt, France
来源
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022 | 2022年
关键词
time series; anomaly detection; evaluation; metrics; precision; recall;
D O I
10.1145/3534678.3539339
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, specific evaluation metrics for time series anomaly detection algorithms have been developed to handle the limitations of the classical precision and recall. However, such metrics are heuristically built as an aggregate of multiple desirable aspects, introduce parameters and wipe out the interpretability of the output. In this article, we first highlight the limitations of the classical precision/recall, as well as the main issues of the recent eventbased metrics s for instance, we show that an adversary algorithm can reach high precision and recall on almost any dataset under weak assumption. To cope with the above problems, we propose a theoretically grounded, robust, parameter-free and interpretable extension to precision/recall metrics, based on the concept of laffiliationz between the ground truth and the prediction sets. Our metrics leverage measures of duration between ground truth and predictions, and have thus an intuitive interpretation. By further comparison against random sampling, we obtain a normalized precision/recall, quantifying how much a given set of results is better than a random baseline prediction. By construction, our approach keeps the evaluation local regarding ground truth events, enabling fine-grained visualization and interpretation of algorithmic results. We compare our proposal against various public time series anomaly detection datasets, algorithms and metrics. We further derive theoretical properties of the affiliation metrics that give explicit expectations about their behavior and ensure robustness against adversary strategies.
引用
收藏
页码:635 / 645
页数:11
相关论文
共 31 条
[1]   Unsupervised real-time anomaly detection for streaming data [J].
Ahmad, Subutai ;
Lavin, Alexander ;
Purdy, Scott ;
Agha, Zuha .
NEUROCOMPUTING, 2017, 262 :134-147
[2]  
[Anonymous], 2014, LWA
[3]  
Deza M.M., 2009, Encyclopedia of distances, DOI [DOI 10.1007/978-3-642-00234-2, 10.1007/978-3-642-00234-2, DOI 10.1007/978-3-642-00234-2_1, 10.1007/978-3-662-44342-2]
[4]  
DUBUISSON MP, 1994, INT C PATT RECOG, P566, DOI 10.1109/ICPR.1994.576361
[5]   Metrics based on average distance between sets [J].
Fujita, Osamu .
JAPAN JOURNAL OF INDUSTRIAL AND APPLIED MATHEMATICS, 2013, 30 (01) :1-19
[6]  
Garg A., 2021, An evaluation of anomaly detection and diagnosis in multivariate time series
[7]   Matrix Profile VIII: Domain Agnostic Online Semantic Segmentation at Superhuman Performance Levels [J].
Gharghabi, Shaghayegh ;
Ding, Yifei ;
Yeh, Chin-Chia Michael ;
Kamgar, Kaveh ;
Ulanova, Liudmila ;
Keogh, Eamonn .
2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, :117-126
[8]   A Dataset to Support Research in the Design of Secure Water Treatment Systems [J].
Goh, Jonathan ;
Adepu, Sridhar ;
Junejo, Khurum Nazir ;
Mathur, Aditya .
CRITICAL INFORMATION INFRASTRUCTURES SECURITY (CRITIS 2016), 2018, 10242 :88-99
[9]  
Huet Alexis, 2022, Affiliation precision/recall library, DOI DOI 10.6084/M9.FIGSHARE.19131425
[10]   Detecting Spacecraft Anomalies Using LSTMs and Nonparametric Dynamic Thresholding [J].
Hundman, Kyle ;
Constantinou, Valentino ;
Laporte, Christopher ;
Colwell, Ian ;
Soderstrom, Tom .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :387-395