Application of Ant Colony Optimization for Feature Selection in Text Categorization

被引:20
作者
Aghdam, Mehdi Hosseinzadeh [1 ]
Ghasem-Aghaee, Nasser [1 ]
Basiri, Mohammad Ehsan [1 ]
机构
[1] Univ Isfahan, Dept Comp Engn, Esfahan, Iran
来源
2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8 | 2008年
关键词
D O I
10.1109/CEC.2008.4631182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is commonly used to reduce dimensionality of datasets with tens or hundreds of thousands of features. A major problem of text categorization is the high dimensionality of the feature space; therefore, feature selection is the most important step in text categorization. This paper presents a novel feature selection algorithm that is based on ant colony optimization. Ant colony optimization algorithm is inspired by observation on real ants in their search for the shortest paths to food sources. Proposed algorithm is easily implemented and because of use of a simple classifier in that, its computational complexity is very low. The performance of proposed algorithm is compared to the performance of information gain and CHI algorithms on the task of feature selection in Reuters-21578 dataset. Simulation results on Reuters-21578 dataset show the superiority of the proposed algorithm.
引用
收藏
页码:2867 / 2873
页数:7
相关论文
共 33 条
[1]  
Al-Ani A, 2005, PROC WRLD ACAD SCI E, V4, P35
[2]  
[Anonymous], REUTERS 21578 TEXT C
[3]  
[Anonymous], 2004, IEEE Comput. Intell. Bull.
[4]  
Bins J, 2000, THESIS COLORADO STAT
[5]  
Blum C., 2001, P MIC 2001 MET INT C, V2, P399
[6]  
Bonabeau E., 1999, Swarm Intelligence: From Natural to Artificial Systems
[7]  
CAROPRESO MF, 2006, BAG WORDS TEXT REPRE, P324
[8]   Ant colony optimization theory: A survey [J].
Dorigo, M ;
Blum, C .
THEORETICAL COMPUTER SCIENCE, 2005, 344 (2-3) :243-278
[9]   Ant system: Optimization by a colony of cooperating agents [J].
Dorigo, M ;
Maniezzo, V ;
Colorni, A .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1996, 26 (01) :29-41
[10]  
DORIGO M, 1994, 1994 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOL 1-7, P887, DOI 10.1109/ICNN.1994.374297