Time series anomaly detection based on shapelet learning

被引:39
作者
Beggel, Laura [1 ,2 ]
Kausler, Bernhard X. [1 ]
Schiegg, Martin [1 ]
Pfeiffer, Michael [1 ]
Bischl, Bernd [2 ]
机构
[1] Robert Bosch GmbH, Bosch Ctr Artificial Intelligence, Robert Bosch Campus 1, D-71272 Renningen, Germany
[2] Ludwig Maximilians Univ Munchen, Dept Stat, Munich, Germany
关键词
Unsupervised learning; Feature learning; Support vector data description; Block-coordinate descent;
D O I
10.1007/s00180-018-0824-9
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of learning to detect anomalous time series from an unlabeled data set, possibly contaminated with anomalies in the training data. This scenario is important for applications in medicine, economics, or industrial quality control, in which labeling is difficult and requires expensive expert knowledge, and anomalous data is difficult to obtain. This article presents a novel method for unsupervised anomaly detection based on the shapelet transformation for time series. Our approach learns representative features that describe the shape of time series stemming from the normal class, and simultaneously learns to accurately detect anomalous time series. An objective function is proposed that encourages learning of a feature representation in which the normal time series lie within a compact hypersphere of the feature space, whereas anomalous observations will lie outside of a decision boundary. This objective is optimized by a block-coordinate descent procedure. Our method can efficiently detect anomalous time series in unseen test data without retraining the model by reusing the learned feature representation. We demonstrate on multiple benchmark data sets that our approach reliably detects anomalous time series, and is more robust than competing methods when the training instances contain anomalous time series.
引用
收藏
页码:945 / 976
页数:32
相关论文
共 34 条
  • [1] [Anonymous], 2016, The analysis of time series: an introduction. s.l
  • [2] Bagnall A, 2017, UEA UCR TIME SERIES
  • [3] The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances
    Bagnall, Anthony
    Lines, Jason
    Bostrom, Aaron
    Large, James
    Keogh, Eamonn
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (03) : 606 - 660
  • [4] Boyd S., 2004, Lecture notes of EE392o
  • [5] Boyd Stephen, 2004, Convex Optimization, DOI 10.1017/CBO9780511804441
  • [6] Brodersen Kay H., 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3121, DOI 10.1109/ICPR.2010.764
  • [7] Chandola V., 2009, THESIS
  • [8] Chuah MC, 2007, LECT NOTES COMPUT SC, V4743, P123
  • [9] Emmott A. F., 2013, P ACM SIGKDD WORKSH, P16, DOI [10.1145/2500853.2500858, DOI 10.1145/2500853.2500858]
  • [10] Time-Series Data Mining
    Esling, Philippe
    Agon, Carlos
    [J]. ACM COMPUTING SURVEYS, 2012, 45 (01)