A fast sorting-based aggregation method for symbolic time series representation

被引:0
作者
Chen, Xinye [1 ]
Guttel, Stefan [1 ]
机构
[1] Univ Manchester, Dept Math, Manchester, Lancs, England
来源
21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021 | 2021年
关键词
time series mining; algorithms; symbolic aggregation;
D O I
10.1109/ICDMW53433.2021.00131
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A new variant of the adaptive Brownian bridge-based approximation (ABBA) for the symbolic representation of time series is presented. This variant, called fABBA, utilizes a new aggregation approach tailored to the piecewise representation of time series. By replacing the k-means clustering used in ABBA with a sorting-based aggregation technique, and thereby avoiding repeated within-cluster-sum-of-squares computations, the computational complexity is significantly reduced. Also, in contrast to the original variant, the new approach does not require the number of time series symbols to be specified in advance. Through extensive tests and using performance profiles, we demonstrate that the new method significantly outperforms ABBA. In terms of reconstruction quality, we find that fABBA remains competitive to ABBA, despite the significant reduction in computation time, while it outperforms the popular SAX and Id-SAX representations of time series.
引用
收藏
页码:1009 / 1016
页数:8
相关论文
共 40 条
[1]  
[Anonymous], 2007, ACM Transactions on Knowledge Discovery from Data (TKDD), DOI [DOI 10.1145/1217299.1217300, DOI 10.1145/1297332.1297338]
[2]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[3]  
Bettaiah V, 2014, P 2014 ACM SE REG C, P1
[4]   Symbolic Representation of Time Series: A Hierarchical Coclustering Formalization [J].
Bondu, Alexis ;
Boulle, Marc ;
Cornuejols, Antoine .
ADVANCED ANALYSIS AND LEARNING ON TEMPORAL DATA, AALTD 2015, 2016, 9785 :3-16
[5]   Locally adaptive dimensionality reduction for indexing large time series databases [J].
Chakrabarti, K ;
Keogh, E ;
Mehrotra, S ;
Pazzani, M .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2002, 27 (02) :188-228
[6]   Robust path-based spectral clustering [J].
Chang, Hong ;
Yeung, Dit-Yan .
PATTERN RECOGNITION, 2008, 41 (01) :191-203
[7]   Benchmarking optimization software with performance profiles [J].
Dolan, ED ;
Moré, JJ .
MATHEMATICAL PROGRAMMING, 2002, 91 (02) :201-213
[8]  
Elsworth S., 2020, TIME SERIES FORECOST
[9]   ABBA: adaptive Brownian bridge-based symbolic aggregation of time series [J].
Elsworth, Steven ;
Guttel, Stefan .
DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (04) :1175-1200
[10]   Time-Series Data Mining [J].
Esling, Philippe ;
Agon, Carlos .
ACM COMPUTING SURVEYS, 2012, 45 (01)