An Empirical Approach for Clustering-Based Time Series Summarisation Assessment

被引:1
作者
Bianchini, Devis [1 ]
Garda, Massimiliano [1 ]
机构
[1] Univ Brescia, Dept Informat Engn, Via Branze 38, I-25123 Brescia, Italy
来源
2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024 | 2024年
关键词
Time Series; Data Summarisation; Incremental Clustering; Data Stream Analysis; DATA STREAM;
D O I
10.1109/COMPSAC61105.2024.00046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last decades, the rise of Big Data solutions has significantly advanced the analysis of time series data as representation of dynamic phenomena through sequences of observations. Recent research efforts have advocated for the adoption of data summarisation techniques, such as incremental clustering, to promptly capture data evolution, thus facilitating domain experts in making informed and proactive decisions, capitalising on a compact representation of time series. Nevertheless, while incremental clustering effectively reduces data volume, thus preserving relevant statistical information, it is crucial to estimate the degree of approximation between the original time series data and its summarised version. This evaluation is pivotal whenever the summarisation output is the starting point to set up complex analytical pipelines (e.g., for pattern recognition and anomaly detection purposes). Stemming from practical and empirical considerations made upon both a synthetic and a realworld dataset, we propose in this paper a variant of a renowned quality metric for incremental clustering, to assess the extent to which the time series summary accurately captures the dynamics of the original data.
引用
收藏
页码:279 / 284
页数:6
相关论文
共 20 条
[1]  
Aggarwal Charu C., 2003, P 2003 VLDB C, P81, DOI DOI 10.1016/B978-012722442-8/50016-1
[2]   Data summarization: a survey [J].
Ahmed, Mohiuddin .
KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 58 (02) :249-273
[3]   Data stream analysis: Foundations, major tasks and tools [J].
Bahri, Maroua ;
Bifet, Albert ;
Gama, Joao ;
Gomes, Heitor Murilo ;
Maniu, Silviu .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 11 (03)
[4]   A big data exploration approach to exploit in-vehicle data for smart road maintenance [J].
Bianchini, Devis ;
De Antonellis, Valeria ;
Garda, Massimiliano .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 149 :701-716
[5]   Density-Based Clustering over an Evolving Data Stream with Noise [J].
Cao, Feng ;
Ester, Martin ;
Qian, Weining ;
Zhou, Aoying .
PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, :328-+
[6]   Optimizing Data Stream Representation: An Extensive Survey on Stream Clustering Algorithms [J].
Carnein, Matthias ;
Trautmann, Heike .
BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2019, 61 (03) :277-297
[7]  
Chen YX, 2007, KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P133
[8]   A single pass algorithm for clustering evolving data streams based on swarm intelligence [J].
Forestiero, Agostino ;
Pizzuti, Clara ;
Spezzano, Giandomenico .
DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 26 (01) :1-26
[9]  
Ghosal Attri, 2020, Emerging Technology in Modelling and Graphics. Proceedings of IEM Graph 2018. Advances in Intelligent Systems and Computing (AISC 937), P69, DOI 10.1007/978-981-13-7403-6_9
[10]   Big data stream analysis: a systematic literature review [J].
Kolajo, Taiwo ;
Daramola, Olawande ;
Adebiyi, Ayodele .
JOURNAL OF BIG DATA, 2019, 6 (01)