HMM-based hybrid meta-clustering ensemble for temporal data

被引:25
作者
Yang, Yun [1 ]
Jiang, Jianmin [1 ,2 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
[2] Univ Surrey, Dept Comp, Guildford GU2 7XH, Surrey, England
关键词
Temporal data clustering; Ensemble learning; Hidden Markov Model; Model selection; Time series; TIME-SERIES; MODEL;
D O I
10.1016/j.knosys.2013.12.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal data have many distinct characteristics, including high dimensionality, complex time dependency, and large volume, all of which make the temporal data clustering more challenging than conventional static datasets. In this paper, we propose a HMM-based partitioning ensemble based on hierarchical clustering refinement to solve the problems of initialization and model selection for temporal data clustering. Our approach results four major benefits, which can be highlighted as: (i) the model initialization problem is solved by associating the ensemble technique; (ii) the appropriate cluster number can be automatically determined by applying proposed consensus function on the multiple partitions obtained from the target dataset during clustering ensemble phase; (iii) no parameter re-estimation is required for the new merged pair of cluster, which significantly reduces the computing cost of its final refinement process based on HMM agglomerative clustering; and finally (iv) the composite model is better in characterizing the complex structure of clusters. Our approach has been evaluated on synthetic data and time series benchmark, and yields promising results for clustering tasks. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:299 / 310
页数:12
相关论文
共 56 条
  • [1] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [2] [Anonymous], 1997, Statistical methods for speech recognition
  • [3] [Anonymous], J BIOMED INFORM
  • [4] [Anonymous], J MACHINE LEARNING R
  • [5] [Anonymous], COMPUT STAT DATA ANA
  • [6] [Anonymous], NEURAL NETW
  • [7] [Anonymous], 2006, 11 IEEE S COMP COMM, DOI DOI 10.1109/ISCC.2006.73
  • [8] [Anonymous], P INT FLOR ART INT R
  • [9] [Anonymous], 2010 10 INT C INT SY
  • [10] [Anonymous], 1996, KDD, DOI DOI 10.1023/A:1009745219419