Time-series data dynamic density clustering

被引:0
作者
Chen, Hao [1 ,2 ]
Xia, Yu [1 ]
Pan, Yuekai [1 ]
Yang, Qing [3 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian 710121, Shaanxi, Peoples R China
[2] Xian Univ Posts & Telecommun, Shaanxi Key Lab Network Data Anal & Intelligent P, Xian, Shaanxi, Peoples R China
[3] Xian Phys Educ Univ, Sch Sport & Hlth Sci, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Time-series data; dynamic clustering; density clustering;
D O I
10.3233/IDA-205459
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many clustering problems, the whole data is not always static. Over time, part of it is likely to be changed, such as updated, erased, etc. Suffer this effect, the timeline can be divided into multiple time segments. And, the data at each time slice is static. Then, the data along the timeline shows a series of dynamic intermediate states. The union set of data from all time slices is called the time-series data. Obviously, the traditional clustering process does not apply directly to the time-series data. Meanwhile, repeating the clustering process at every time slices costs tremendous. In this paper, we analyze the transition rules of the data set and cluster structure when the time slice shifts to the next. We find there is a distinct correlation of data set and succession of cluster structure between two adjacent ones, which means we can use it to reduce the cost of the whole clustering process. Inspired by it, we propose a dynamic density clustering method (DDC) for time-series data. In the simulations, we choose 6 representative problems to construct the time-series data for testing DDC. The results show DDC can get high accuracy results for all 6 problems while reducing the overall cost markedly.
引用
收藏
页码:1487 / 1506
页数:20
相关论文
共 29 条
[1]  
Agrawal R., 1993, Foundations of Data Organization and Algorithms. 4th International Conference. FODO '93 Proceedings, P69
[2]   Dynamic clustering of residential electricity consumption time series data based on Hausdorff distance [J].
Benitez, Ignacio ;
Diez, Jose-Luis ;
Quijano, Alfredo ;
Delgado, Ignacio .
ELECTRIC POWER SYSTEMS RESEARCH, 2016, 140 :517-526
[3]  
Ester M., 1996, PROC 2 INT C KNOWLED, P226, DOI DOI 10.5555/3001460.3001507
[4]  
Fan Zhongxin, 2019, Journal of Computer Applications, V39, P1027, DOI 10.11772/j.issn.1001-9081.2018081790
[5]   A review on time series data mining [J].
Fu, Tak-chung .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (01) :164-181
[6]   KmL: k-means for longitudinal data [J].
Genolini, Christophe ;
Falissard, Bruno .
COMPUTATIONAL STATISTICS, 2010, 25 (02) :317-328
[7]  
[韩利钊 Han Lizhao], 2018, [计算机应用研究, Application Research of Computers], V35, P1668
[8]  
He X.X., 2014, THESIS U SCI TECHNOL
[9]   Agreement-based fuzzy C-means for clustering data with blocks of features [J].
Izakian, Hesam ;
Pedrycz, Witold .
NEUROCOMPUTING, 2014, 127 :266-280
[10]  
[嵇敏 Ji Min], 2016, [系统科学与数学, Journal of Systems Science and Mathematical Sciences], V36, P53