Class Discovery Based on K-means Clustering and Perturbation Analysis

被引:0
作者
Ru, Xiaohu [1 ]
Liu, Zheng [1 ]
Huang, Zhitao [1 ]
Jiang, Wenli [1 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci & Engn, Changsha 410073, Hunan, Peoples R China
来源
2015 8TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP) | 2015年
关键词
pattern recognition; class discovery; k-means clustering; perturbation; disagreement/agreement index (DAI);
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Class discovery, which aims to identify the underlying category structure, is an important issue in pattern recognition and knowledge discovery. The key task in class discovery is to estimate the number of classes. Classical estimation approaches usually face the problems of low accuracy, high complexity, or difficulty in choosing an appropriate penalty function. In this paper, an effective class discovery method is proposed. The method first utilizes the characteristics of the mean-square-error produced by k-means clustering, giving a coarse estimate of the number of classes, and then calculates the difference between the clustering results obtained from the original dataset and the perturbed dataset to further determine the real number of classes. Experiments on simulated and real-world data demonstrate that the proposed method has satisfactory performance in different situations. Moreover, this method relies loosely on artificially selected parameters, thus can be reliably used in wide applications.
引用
收藏
页码:1236 / 1240
页数:5
相关论文
共 12 条
[11]   Class Discovery From Gene Expression Data Based on Perturbation and Cluster Ensemble [J].
Yu, Zhiwen ;
Wong, Hau-San .
IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2009, 8 (02) :147-160
[12]   An efficient k′-means clustering algorithm [J].
Zalik, Krista Rizman .
PATTERN RECOGNITION LETTERS, 2008, 29 (09) :1385-1391