An Evolutionary Approach for Document Clustering

被引:14
作者
Akter, Ruksana [1 ]
Chung, Yoojin [1 ]
机构
[1] Hankuk Univ Foreign Studies, Dept Comp Sci & Engn, Mohyun 449791, Yongin, South Korea
来源
2013 INTERNATIONAL CONFERENCE ON ELECTRONIC ENGINEERING AND COMPUTER SCIENCE (EECS 2013) | 2013年 / 4卷
关键词
Document clustering; Genetic Algorithm; Local minima; Cluster Evaluation; GENETIC ALGORITHM;
D O I
10.1016/j.ieri.2013.11.053
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose an evolutionary approach based on genetic algorithm for text document clustering. Instead of applying genetic algorithm on the whole dataset, we partition the dataset into some groups and apply genetic algorithm to each of the partitions separately. Finally, we apply another genetic algorithm phase on the outcomes of the earlier ones. This allows to get rid of the local minima, which is one of the major problems of using genetic algorithms. Another good feature of our proposal is that we do not require specifying the total clusters to be made in advance as most of the available methods. Experimental results also show the superior performance of our approach as compared to the previous approaches. (C) 2013 The Authors. Published by Elsevier B.V.
引用
收藏
页码:370 / 375
页数:6
相关论文
共 15 条
  • [1] Andre J, 2000, ADV ENG SOFTW, V32, P49
  • [2] [Anonymous], 1988, ALGORITHMS CLUSTERIN
  • [3] [Anonymous], 1988, Information processing management
  • [4] [Anonymous], ACM COMPUTING SURVEY
  • [5] [Anonymous], 1973, Pattern Classification and Scene Analysis
  • [6] Nonparametric genetic clustering: Comparison of validity indices
    Bandyopadhyay, S
    Maulik, U
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2001, 31 (01): : 120 - 125
  • [7] Cha S. M, 2001, NEW MIGRATION METHOD
  • [8] Choi LC, 2011, COMM COM INF SC, V257, P212
  • [9] CLUSTER SEPARATION MEASURE
    DAVIES, DL
    BOULDIN, DW
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) : 224 - 227
  • [10] Fragoudis D., 2005, KNOWLWDGE INFORM SYS