Clustering Aggregation Based on Genetic Algorithm for Documents Clustering

被引:13
作者
Zhang, Zhenya [1 ]
Cheng, Hongmei [2 ]
Zhang, Shuguang [3 ]
Chen, Wanli [1 ]
Fang, Qiansheng [1 ]
机构
[1] AIAI, Sch Elect & Informat Engn, Hefei 230022, Peoples R China
[2] AIAI, Dept Engn Management, Hefei 230022, Peoples R China
[3] Univ Sci & Technol China, Dept Stat & Finance, Hefei, Peoples R China
来源
2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8 | 2008年
关键词
D O I
10.1109/CEC.2008.4631225
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering aggregation problem is a kind of formal description for clustering ensemble problem and technologies for the solving of clustering aggregation problem can be used to construct clustering division with better clustering performance when the clustering performances of each original clustering division are fluctuant or weak. In this paper, an approach based on genetic algorithm for clustering aggregation problem, named as GeneticCA, is presented. To estimate the clustering performance of a clustering division, clustering precision is defined and features of clustering precision are discussed. In our experiments about clustering performances of GeneticCA for document clustering, hamming neural network is used to construct clustering divisions with fluctuant and weak clustering performances. Experimental results show that the clustering performance of clustering division constructed by GeneticCA is better than clustering performance of original clustering divisions with clustering precision as criterion.
引用
收藏
页码:3156 / +
页数:2
相关论文
共 14 条
  • [1] Ailon Nir, 2005, P 37 ANN ACM S THEOR, P684, DOI [10.1145/1060590.1060692, DOI 10.1145/1060590.1060692]
  • [2] Andritsos P, 2004, LECT NOTES COMPUT SC, V2992, P123
  • [3] [Anonymous], 2007, ACM Transactions on Knowledge Discovery from Data
  • [4] Correlation clustering
    Bansal, N
    Blum, A
    Chawla, S
    [J]. MACHINE LEARNING, 2004, 56 (1-3) : 89 - 113
  • [5] BOUTIS C, 2004, P EUR C PRINC PRACT, P63
  • [6] Clustering with qualitative information
    Charikar, M
    Guruswami, V
    Wirth, A
    [J]. 44TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2003, : 524 - 533
  • [7] Chen G L, 1996, Genetic algorithm and its application
  • [8] Correlation clustering in general weighted graphs
    Demaine, Erik D.
    Emanuel, Dotan
    Fiat, Amos
    Immorlica, Nicole
    [J]. THEORETICAL COMPUTER SCIENCE, 2006, 361 (2-3) : 172 - 187
  • [9] Dong Lehong, 2006, COMPUTER APPL SOFTWA, V23, P27
  • [10] Shen X., 2000, Comput. Eng. Appl, V36, P31