DSCo-NG: A Practical Language Modeling Approach for Time Series Classification

被引:8
作者
Li, Daoyuan [1 ]
Bissyande, Tegawende F. [1 ]
Klein, Jacques [1 ]
Le Traon, Yves [1 ]
机构
[1] Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust SnT, Luxembourg, Luxembourg
来源
ADVANCES IN INTELLIGENT DATA ANALYSIS XV | 2016年 / 9897卷
关键词
REPRESENTATION; DISTANCE;
D O I
10.1007/978-3-319-46349-0_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The abundance of time series data in various domains and their high dimensionality characteristic are challenging for harvesting useful information from them. To tackle storage and processing challenges, compression-based techniques have been proposed. Our previous work, Domain Series Corpus (DSCo), compresses time series into symbolic strings and takes advantage of language modeling techniques to extract from the training set knowledge about different classes. However, this approach was flawed in practice due to its excessive memory usage and the need for a priori knowledge about the dataset. In this paper we propose DSCo-NG, which reduces DSCo's complexity and offers an efficient (linear time complexity and low memory footprint), accurate (performance comparable to approaches working on uncompressed data) and generic (so that it can be applied to various domains) approach for time series classification. Our confidence is backed with extensive experimental evaluation against publicly accessible datasets, which also offers insights on when DSCo-NG can be a better choice than others.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 25 条
[1]  
[Anonymous], 1994, USING DYNAMIC TIME W
[2]  
Batista G. E., 2011, P 2011 SIAM INT C DA, P699, DOI DOI 10.1137/1.9781611972818.60
[3]   A Bag-of-Features Framework to Classify Time Series [J].
Baydogan, Mustafa Gokce ;
Runger, George ;
Tuv, Eugene .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) :2796-2802
[4]   Statistical language model adaptation: review and perspectives [J].
Bellegarda, JR .
SPEECH COMMUNICATION, 2004, 42 (01) :93-108
[5]  
Chung F., 2001, INT JOINT C ART INT, P1
[6]   A review on time series data mining [J].
Fu, Tak-chung .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (01) :164-181
[7]   Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases [J].
Eamonn Keogh ;
Kaushik Chakrabarti ;
Michael Pazzani ;
Sharad Mehrotra .
Knowledge and Information Systems, 2001, 3 (3) :263-286
[8]   Fast similarity search in the presence of longitudinal scaling in time series databases [J].
Keogh, E .
NINTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1997, :578-584
[9]  
Keogh E. J., 2004, P 10 ACM SIGKDD INT, P206, DOI [DOI 10.1145/1014052.1014077, 10.1145/1014052.1014077]
[10]  
Li D., 2016, 28 INT C SOFTW ENG K