Trendlets: A novel probabilistic representational structures for clustering the time series data

被引:27
作者
Johnpaul, C., I [1 ,2 ]
Prasad, Munaga V. N. K. [2 ]
Nickolas, S. [1 ]
Gangadharan, G. R. [1 ]
机构
[1] Natl Inst Technol, Tiruchirappalli, Tamil Nadu, India
[2] Inst Dev & Res Banking Technol, Hyderabad, India
关键词
Time series; Clustering; Pattern matching; Trend; Seasonality; SELECTION;
D O I
10.1016/j.eswa.2019.113119
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series data is a sequence of values recorded systematically over a period which are mostly used for prediction, clustering, and analysis. The two essential features of a time series data are trend and seasonality. Preprocessing of the time series data is necessary for performing prediction tasks. In most of the cases, the trend and the seasonality are removed before applying the regression algorithms. The accuracy of such algorithms depends upon the functions used for the removal of trend and seasonality. Clustering of an unlabeled time series data with the presence of trend and seasonality is challenging. In this paper, we propose a probabilistic representational learning method for grouping the time series data. We introduce five terminologies in our method of clustering namely the trendlets, uplets, downlets, equalets and trendlet string. These elements are the representational building blocks of our proposed method. Experiments on the proposed algorithm are performed with the renewable energy data on the electricity supply system of continental Europe which includes the demand and inflow of renewable energy for the term 2012 to 2014 and UCR-2018 time series archive containing 128 datasets. We compared our proposed representational method with various clustering algorithms using the silhouette score. Mini-batch k-means and agglomerative hierarchical clustering algorithms show better performance in terms of quality, logical accordance with data and time taken for clustering. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:21
相关论文
共 37 条
  • [1] Cross-domain Meta-learning for Time-series Forecasting
    Ali, Abbas Raza
    Gabrys, Bogdan
    Budka, Marcin
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 9 - 18
  • [2] [Anonymous], EUR WORKSH VIS AN EU
  • [3] [Anonymous], 2017, SCI COMPUTING ALGORI
  • [4] [Anonymous], ARXIV14100095V1
  • [5] [Anonymous], 28 INT C SOFTW ENG K
  • [6] [Anonymous], TIME SERIES DEFNITIO
  • [7] [Anonymous], ARXIV181011624
  • [8] [Anonymous], DETERMINING CLUSTERS
  • [9] Using Empirical Recurrence Rates Ratio for Time Series Data Similarity
    Bhaduri, Moinak
    Zhan, Justin
    [J]. IEEE ACCESS, 2018, 6 : 30855 - 30864
  • [10] A time series clustering approach for Building Automation and Control Systems
    Bode, Gerrit
    Schreiber, Thomas
    Baranski, Marc
    Mueller, Dirk
    [J]. APPLIED ENERGY, 2019, 238 : 1337 - 1345