Graft: A graph based time series data mining framework

被引:8
作者
Mishra, Kakuli [1 ]
Basu, Srinka [2 ]
Maulik, Ujjwal [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[2] Univ Kalyani, Dept Engn & Technol Studies, Nadia, India
关键词
Clustering; Time series similarity; Graph based networks; Directed and undirected graphs; Eigen values; COMPLEX NETWORKS; NEURAL-NETWORKS; UNIFYING VIEW; PREDICTION; DISCORDS; MOTIFS; SIGNAL; JOINS;
D O I
10.1016/j.engappai.2022.104695
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Rapid technology integration causes a high dimensional time series data accumulation in multiple domains and applying the classical data mining tools and techniques becomes a challenging task. Hence, the time series data representation have gained popularity over the years, which ease the task of mining, analysis and visualization. Graph based representation is one such emerging tool in which the time series data is represented as nodes and edges of graph. The current graph based representation is designed either to mine the motif or discords from a single time series or cluster the time series where each node represents a time series sample. Such representation technique causes information loss and also no further analysis could be performed other than clustering. To address these challenges, we propose a unique graph representation for time series dataset that works on multiple domains. Novelty of the graph representation is that it is unique for multiple time series and it acts as a framework for whole time series clustering, temporal pattern extraction from each cluster and temporally dependent rare event discovery. A new research direction for the proposed graph based framework is shown. Comparative analysis reveal the superiority of the proposed framework particularly as a clustering technique. The key contributions of the paper are: (i) transformation strategy of time series database from time domain to graph structure in topological domain (ii) time series clustering using path level analysis (iii) identification of temporally dependent co-occurring patterns (iv) rare event detection using component level analysis
引用
收藏
页数:18
相关论文
共 80 条
[1]  
Alfke D., 2021, ARXIV210408153
[2]   A new hybrid financial time series prediction model [J].
Alhnaity, Bashar ;
Abbod, Maysam .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 95
[3]   Time series trend detection and forecasting using complex network topology analysis [J].
Anghinoni, Leandro ;
Zhao, Liang ;
Ji, Donghong ;
Pan, Heng .
NEURAL NETWORKS, 2019, 117 :295-306
[4]  
Ausgrid, 2020, SOL HOM EL DAT
[5]   A time series clustering approach for Building Automation and Control Systems [J].
Bode, Gerrit ;
Schreiber, Thomas ;
Baranski, Marc ;
Mueller, Dirk .
APPLIED ENERGY, 2019, 238 :1337-1345
[6]   Series2Graph: Graph-based Subsequence Anomaly Detection for Time Series [J].
Boniol, Paul ;
Palpanas, Themis .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (11) :1821-1834
[7]   An efficient approach to mine flexible periodic patterns in time series databases [J].
Chanda, Ashis Kumar ;
Saha, Swapnil ;
Nishi, Manziba Akanda ;
Samiullah, Md. ;
Ahmed, Chowdhury Farhan .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 44 :46-63
[8]   Shortest paths algorithms: Theory and experimental evaluation [J].
Cherkassky, BV ;
Goldberg, AV ;
Radzik, T .
MATHEMATICAL PROGRAMMING, 1996, 73 (02) :129-174
[9]   A generalized matrix profile framework with support for contextual series analysis [J].
De Paepe, Dieter ;
Vanden Hautte, Sander ;
Steenwinckel, Bram ;
De Turck, Filip ;
Ongenae, Femke ;
Janssens, Olivier ;
Van Hoecke, Sofie .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 90
[10]   A review on time series forecasting techniques for building energy consumption [J].
Deb, Chirag ;
Zhang, Fan ;
Yang, Junjing ;
Lee, Siew Eang ;
Shah, Kwok Wei .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2017, 74 :902-924