The research on selective clustering ensemble algorithm based on fractal dimension and projection

被引:0
作者
机构
[1] School of Management, Hefei University of Technology, Hefei
来源
Wu, Xiaoxuan | 2015年 / Binary Information Press卷 / 11期
基金
中国国家自然科学基金;
关键词
Fractal dimension; Projection clustering; Reference partition; Selective clustering ensemble;
D O I
10.12733/jcis14352
中图分类号
学科分类号
摘要
Selective clustering ensemble algorithm can eliminate the inferior quality clustering member's influence and can achieve a better clustering solution relative to the clustering ensemble algorithm. For high dimensional data clustering, in this paper, a novel selective clustering ensemble algorithm based on fractal dimension and projection is proposed. Firstly, the clustering members are generated by the clustering algorithm based on fractal dimension and projection to realize dimension reduction and clustering; then the selection strategy based on the best reference partition is used to produce the components of the ensemble system in order to pick part of high quality cluster members to realize ensemble by using weighted Co-association matrix, and get the final clustering results at last. The experimental results on UCI data set verify the validity of the proposed algorithm for dealing with high dimensional data clustering. The new algorithm is able to achieve statistically significant performance improvement over other clustering algorithms. ©, 2015, Binary Information Press. All right reserved.
引用
收藏
页码:4025 / 4035
页数:10
相关论文
共 22 条
  • [1] Ni Z., Ni L.-P., Liu H., Jia R., Dynamic Data Mining, (2010)
  • [2] Yang L.-Y., Zhou H.-J., Zhuo Q., Wang W.-Y., Weighted cluster ensemble based on significance of attribute, Computer Science, 4, (2009)
  • [3] Jiang S., Cluster fusion algorithm based on majority voting mechanism, Journal of Chinese Computer Systems, 2, pp. 306-309, (2007)
  • [4] Yang L.-Y., Wang W.-Y., Clustering ensemble approaches: An overview, Application Research of Computers, 12, pp. 8-14, (2005)
  • [5] Zhao Z., Zou Y., Li G., New clustering algorithm based on combination of genetic algorithm and ant colony algorithm, Science Technology and Engineering, 6, (2006)
  • [6] Fern X.Z., Brodley C.E., Random projection for high dimensional data clustering: A cluster ensemble approach, ICM, 3, pp. 186-193, (2003)
  • [7] Strehl A., Ghosh J., Cluster ensembles - A knowledge reuse framework for combining multiple partitions, The Journal of Machine Learning Research, 3, pp. 583-617, (2003)
  • [8] Fred A.L.N., Jain A.K., Data clustering using evidence accumulation, In Pattern Recognition, Proceedings. 16th International Conference on, 4, pp. 276-280, (2002)
  • [9] Monti S., Tamayo P., Mesirov J., Golub T., Consensus clustering: A resampling-based method for class discovery and visualization of gene expression microarray data, Machine learning, 52, 1-2, pp. 91-118, (2003)
  • [10] Topchy A., Jain A.K., Punch W., Combining multiple weak clusterings, In Data Mining, ICDM 2003. Third IEEE International Conference on, pp. 331-338, (2003)