Classification System for Time Series Data Based on Feature Pattern Extraction

被引:0
作者
Sugimura, Hiroshi [1 ]
Matsumoto, Kazunori [1 ]
机构
[1] Kanagawa Inst Technol, Grad Sch Engn, Atsugi, Japan
来源
2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC) | 2011年
关键词
datamining; time series; classification; TF*IDF; dynamic time warping; decision tree; genetic algorithm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a system which acquires feature patterns and makes classifiers for time series data without using background knowledge given by a user. Time series data are widely appeared in finance, medical research, industrial sensors, etc. The system acquires the feature patterns that characterize similar data in database. We focus on two aspects of the feature pattern: global and local frequency. Our purpose is to acquire features of each data by extracting these patterns. The system cut out subsequences from time series data. Several representative sequences are extracted from these subsequences by using clustering. Feature patterns are acquired from these representative sequences. For this purpose, we develop a method that applies TF*IDF weight technique, which is often used in text mining, to time series data. The time series data are classified by using the acquired feature patterns. In accordance with a criterion that is based on the entropy theory, feature patterns are improved by the automatic process, generation by generation, using the genetic algorithm. By using the final and optimized feature patterns, we build a decision tree that determines future behaviors. We explain how these two tools are combinatory applied in the entire knowledge discovery process.
引用
收藏
页码:1340 / 1345
页数:6
相关论文
共 11 条
[1]  
Abe H., 2007, 3 INT WORKSH MIN COM, P49
[2]  
Agrawal R., 2002, DAT ENG 1995 P 11 IN, P3
[3]  
[Anonymous], 2014, C4. 5: programs for machine learning
[4]  
[Anonymous], 1975, Ann Arbor
[5]  
[Anonymous], 2007, Principles of Data Mining: Undergraduate Topics in Computer Science
[6]  
Berndt DJ., 1994, USING DYNAMIC TIME W, DOI DOI 10.5555/3000850.3000887
[7]   A Genetic Algorithm for Constructing Compact Binary Decision Trees [J].
Cha, Sung-Hyuk ;
Tappert, Charles .
JOURNAL OF PATTERN RECOGNITION RESEARCH, 2009, 4 (01) :1-13
[8]  
Haigh K.Z., 2004, P 7 WORKSH MIN SCI E
[9]   Heartbeat Time Series Classification With Support Vector Machines [J].
Kampouraki, Argyro ;
Manis, George ;
Nikou, Christophoros .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2009, 13 (04) :512-518
[10]   An efficient k-means clustering algorithm:: Analysis and implementation [J].
Kanungo, T ;
Mount, DM ;
Netanyahu, NS ;
Piatko, CD ;
Silverman, R ;
Wu, AY .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (07) :881-892