Novel semi-metrics for multivariate change point analysis and anomaly detection

被引:25
作者
James, Nick [1 ]
Menzies, Max [2 ]
Azizi, Lamiae [1 ]
Chan, Jennifer [1 ]
机构
[1] Univ Sydney, Sch Math & Stat, Sydney, NSW, Australia
[2] Tsinghua Univ, Yau Math Sci Ctr, Beijing, Peoples R China
关键词
Semi-metrics; Change-point detection; Multivariate analysis; Time series; Anomaly detection; HAUSDORFF-LIKE METRICS; DISTANCE; SETS; ALGORITHM; SEQUENCE; SHIFT;
D O I
10.1016/j.physd.2020.132636
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper proposes a new method for determining similarity and anomalies between time series, most practically effective in large collections of (likely related) time series, by measuring distances between structural breaks within such a collection. We introduce a class of semi-metric distance measures, which we term MJ distances. These semi-metrics provide an advantage over existing options such as the Hausdorff and Wasserstein metrics. We prove they have desirable properties, including better sensitivity to outliers, while experiments on simulated data demonstrate that they uncover similarity within collections of time series more effectively. Semi-metrics carry a potential disadvantage: without the triangle inequality, they may not satisfy a "transitivity property of closeness.'' We analyse this failure with proof and introduce an computational method to investigate, in which we demonstrate that our semi-metrics violate transitivity infrequently and mildly. Finally, we apply our methods to cryptocurrency and measles data, introducing a judicious application of eigenvalue analysis. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 39 条
[1]  
[Anonymous], 2015, J STAT SOFTW, DOI DOI 10.18637/JSS.V066.I03
[2]   Mesh: Measuring errors between surfaces using the Hausdorff distance [J].
Aspert, N ;
Santa-Cruz, D ;
Ebrahimi, T .
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, :705-708
[3]   A LINEAR TIME ALGORITHM FOR THE HAUSDORFF DISTANCE BETWEEN CONVEX POLYGONS [J].
ATALLAH, MJ .
INFORMATION PROCESSING LETTERS, 1983, 17 (04) :207-209
[4]   COMPUTING SOME DISTANCE FUNCTIONS BETWEEN POLYGONS [J].
ATALLAH, MJ ;
RIBEIRO, CC ;
LIFSCHITZ, S .
PATTERN RECOGNITION, 1991, 24 (08) :775-781
[5]  
Axler S., 2015, LINEAR ALGEBRA DONE, DOI DOI 10.1007/978-3-319-11080-6
[6]  
Baddeley A., 1992, Nieuw Archief voor Wiskunde, V10, P157
[7]   Precise Hausdorff distance computation between polygonal meshes [J].
Barton, Michael ;
Hanniel, Iddo ;
Elber, Gershon ;
Kim, Myung-Soo .
COMPUTER AIDED GEOMETRIC DESIGN, 2010, 27 (08) :580-591
[8]   On Hausdorff-like metrics for fuzzy sets [J].
Boxer, L .
PATTERN RECOGNITION LETTERS, 1997, 18 (02) :115-118
[9]   On the nonexistence of Hausdorff-like metrics for fuzzy sets [J].
Brass, P .
PATTERN RECOGNITION LETTERS, 2002, 23 (1-3) :39-43
[10]   A modified Hausdorff distance between fuzzy sets [J].
Chaudhuri, BB ;
Rosenfeld, A .
INFORMATION SCIENCES, 1999, 118 (1-4) :159-171