A two-stage hybrid ant colony optimization for high-dimensional feature selection

被引:111
作者
Ma, Wenping [1 ,2 ]
Zhou, Xiaobo [1 ,2 ]
Zhu, Hao [1 ,2 ]
Li, Longwei [1 ,2 ]
Jiao, Licheng [1 ,2 ]
机构
[1] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Minist Educ, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Shaanxi, Peoples R China
[2] Xidian Univ, Sch Artificial Intelligence, Joint Int Res Lab Intelligent Percept & Computat, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Feature selection; Ant colony optimization; High-dimensional data; Classification; Optimal feature subset size; FEATURE SUBSET; ALGORITHM; CLASSIFICATION;
D O I
10.1016/j.patcog.2021.107933
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ant colony optimization (ACO) is widely used in feature selection owing to its excellent global/local search capabilities and flexible graph representation. However, the current ACO-based feature selection methods are mainly applied to low-dimensional datasets. For thousands of dimensional datasets, the search for the optimal feature subset (OFS) becomes extremely difficult due to the exponential increase of the search space. In this paper, we propose a two-stage hybrid ACO for high-dimensional feature se-lection (TSHFS-ACO). As an additional stage, it uses the interval strategy to determine the size of OFS for the following OFS search. Compared to the traditional one-stage methods that determine the size of OFS and search for OFS simultaneously, the stage of checking the performance of partial feature number endpoints in advance helps to reduce the complexity of the algorithm and alleviate the algorithm from getting into a local optimum. Moreover, the advanced ACO algorithm embeds the hybrid model, which uses the features' inherent relevance attributes and the classification performance to guide OFS search. The test results on eleven high-dimensional public datasets show that TSHFS-ACO is suitable for high-dimensional feature selection. The obtained OFS has state-of-the-art performance on most datasets. And compared with other ACO-based feature selection methods, TSHFS-ACO has a shorter running time. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 43 条
[1]   Group search optimizer: a nature-inspired meta-heuristic optimization algorithm with its results, variants, and applications [J].
Abualigah, Laith .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (07) :2949-2972
[2]   A novel hybrid antlion optimization algorithm for multi-objective task scheduling problems in cloud computing environments [J].
Abualigah, Laith ;
Diabat, Ali .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (01) :205-223
[3]   A combination of objective functions and hybrid Krill herd algorithm for text document clustering analysis [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin ;
Hanandeh, Essam Said .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 73 :111-125
[4]   A new feature selection method to improve the document clustering using particle swarm optimization algorithm [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin ;
Hanandeh, Essam Said .
JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 25 :456-466
[5]   Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin .
JOURNAL OF SUPERCOMPUTING, 2017, 73 (11) :4773-4795
[6]  
Abualigah LMQ., 2015, INT J COMPUT SCI ENG, V5, P19, DOI DOI 10.5121/IJCSEA.2015.5102
[7]  
Abualigah LMQ, 2019, FEATURE SELECTION EN, DOI [DOI 10.1007/978-3-030-10674-4, 10.1007/978-3-030-10674-4]
[8]   Text feature selection using ant colony optimization [J].
Aghdam, Mehdi Hosseinzadeh ;
Ghasem-Aghaee, Nasser ;
Basiri, Mohammad Ehsan .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :6843-6853
[9]   Object-based classification of hyperspectral data using Random Forest algorithm [J].
Amini, Saeid ;
Homayouni, Saeid ;
Safari, Abdolreza ;
Darvishsefat, Ali A. .
GEO-SPATIAL INFORMATION SCIENCE, 2018, 21 (02) :127-138
[10]   Genetic programming for multiple-feature construction on high-dimensional classification [J].
Binh Tran ;
Xue, Bing ;
Zhang, Mengjie .
PATTERN RECOGNITION, 2019, 93 :404-417