MERLIN: Parameter-Free Discovery of Arbitrary Length Anomalies in Massive Time Series Archives

被引:79
作者
Nakamura, Takaaki [1 ]
Imamura, Makoto [2 ]
Mercer, Ryan [3 ]
Keogh, Eamonn [3 ]
机构
[1] Mitsubishi Electr Corp, Tokyo, Japan
[2] Tokai Univ, Hiratsuka, Kanagawa, Japan
[3] Univ Calif Riverside, Riverside, CA 92521 USA
来源
20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020) | 2020年
关键词
Time Series; Anomaly detection; Multi-Scale;
D O I
10.1109/ICDM50108.2020.00147
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series anomaly detection remains a perennially important research topic. If anything, it is a task that has become increasingly important in the burgeoning age of IoT. While there are hundreds of anomaly detection methods in the literature, one definition, time series discords, has emerged as a competitive and popular choice for practitioners. Time series discords are subsequences of a time series that are maximally far away from their nearest neighbors. Perhaps the most attractive feature of discords is their simplicity. Unlike many parameter laden methods, discords require only a single parameter to be set by the user: the subsequence length. In this work we argue that the utility of discords is reduced by sensitivity to this single user choice. The obvious solution to this problem, computing discords of all lengths then selecting the best anomalies (under some measure), seems to be computationally untenable. However, in this work we introduce MERLIN, an algorithm that can efficiently and exactly find discords of all lengths in massive time series archives.
引用
收藏
页码:1190 / 1195
页数:6
相关论文
共 15 条
[1]  
[Anonymous], 2017, NEUROCOMPUTING
[2]   The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances [J].
Bagnall, Anthony ;
Lines, Jason ;
Bostrom, Aaron ;
Large, James ;
Keogh, Eamonn .
DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (03) :606-660
[3]  
Beyer K, 1999, LECT NOTES COMPUT SC, V1540, P217
[4]  
Filonov P., 2016, Multivariate industrial time series with cyber-attack simulation: Fault detection using an lstmbased predictive data model
[5]   Detecting Spacecraft Anomalies Using LSTMs and Nonparametric Dynamic Thresholding [J].
Hundman, Kyle ;
Constantinou, Valentino ;
Laporte, Christopher ;
Colwell, Ian ;
Soderstrom, Tom .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :387-395
[6]   HOT SAX: Efficiently finding the most unusual time series subsequence [J].
Keogh, E ;
Lin, J ;
Fu, AD .
Fifth IEEE International Conference on Data Mining, Proceedings, 2005, :226-233
[7]  
Laptev N., 2015, S5-A Labeled Anomaly Detection Dataset, version 1.0(16M)
[8]   Approximations to magic: Finding unusual medical time series [J].
Lin, J ;
Keogh, E ;
Fu, A ;
Van Herle, H .
18th IEEE Symposium on Computer-Based Medical Systems, Proceedings, 2005, :329-334
[9]  
Yeh CCM, 2016, IEEE DATA MINING, P1317, DOI [10.1109/ICDM.2016.0179, 10.1109/ICDM.2016.89]
[10]   ON PROBLEMS IN WHICH A CHANGE IN A PARAMETER OCCURS AT AN UNKNOWN POINT [J].
PAGE, ES .
BIOMETRIKA, 1957, 44 (1-2) :248-252