An evolutionary K-means algorithm for clustering time series data

被引:0
作者
Zhang, H [1 ]
Ho, TB [1 ]
Lin, MS [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Tatsunokuchi, Ishikawa 9231292, Japan
来源
PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7 | 2004年
关键词
time series; clustering; genetic algorithms; K-means;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well known that the K-means clustering algorithm is easy to get stuck at locally optimal points for high dimensional data. Many initialization techniques have been proposed to attack this problem, but with only limited success. In this paper we propose an evolutionary K-means algorithm to attack this problem. The proposed algorithm combines Genetic Algorithms and K-means algorithm together for improving the search ability of the K-means algorithm. We rearrange the clusters in crossover operation based on the distance of clustering centers to avoid generating meaningless offspring. A new genetic operator called swap is proposed to replace the traditional mutation operator for avoiding producing invalid offspring. Experiments performed on some publicly available time series data sets demonstrate the effectiveness and efficiency of the proposed algorithm.
引用
收藏
页码:1282 / 1287
页数:6
相关论文
共 9 条
  • [1] An evolutionary technique based on K-Means algorithm for optimal clustering in RN
    Bandyopadhyay, S
    Maulik, U
    [J]. INFORMATION SCIENCES, 2002, 146 (1-4) : 221 - 237
  • [2] Bradley P. S., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P9
  • [3] BRADLEY PS, 1998, P 15 INT C MACH LEAR, P91
  • [4] Ho T.B., 2003, P 9 ACM SIGKDD INT C, P369
  • [5] Distance measures for effective clustering of ARIMA time-series
    Kalpakis, K
    Gada, D
    Puttagunta, V
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 273 - 280
  • [6] Keogh E., 2008, UCR TIME SERIES DATA
  • [7] Michalewicz Z., 1992, GENETIC ALGORITHM DA
  • [8] In search of optimal clusters using genetic algorithms
    Murthy, CA
    Chowdhury, N
    [J]. PATTERN RECOGNITION LETTERS, 1996, 17 (08) : 825 - 832
  • [9] PERUSE: An unsupervised algorithm for finding recurring patterns in time series
    Oates, T
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 330 - 337