An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selection

被引:0
作者
Pirgazi, Jamshid [1 ]
Kallehbasti, Mohammad Mehdi Pourhashem [1 ]
Sorkhi, Ali Ghanbari [1 ]
Kermani, Ali [1 ]
机构
[1] Univ Sci & Technol Mazandaran, Dept Elect & Comp Engn, Behshahr, Iran
关键词
Feature selection; High-dimensional data; Harris Hawks optimization; Global search; PARTICLE SWARM OPTIMIZATION; ALGORITHM; WOLF;
D O I
10.34172/bi.30340
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Introduction: High-dimensional datasets often contain an abundance of features, many of which are irrelevant to the subject of interest. This issue is compounded by the frequently low number of samples and imbalanced class samples. These factors can negatively impact the performance of classification algorithms, necessitating feature selection before classification. The primary objective of feature selection algorithms is to identify a minimal subset of features that enables accurate classification. Methods: In this paper, we propose a two-stage hybrid method for the optimal selection of relevant features. In the first stage, a filter method is employed to assign weights to the features, facilitating the removal of redundant and irrelevant features and reducing the computational cost of classification algorithms. A subset of high-weight features is retained for further processing in the second stage. In this stage, an enhanced Harris Hawks Optimization algorithm and GRASP, augmented with crossover and mutation operators from genetic algorithms, are utilized based on the weights calculated in the first stage to identify the optimal feature set. Results: Experimental results demonstrate that the proposed algorithm successfully identifies the optimal subset of features. Conclusion: The two-stage hybrid method effectively selects the optimal subset of features, improving the performance of classification algorithms on high-dimensional datasets. This approach addresses the challenges posed by the abundance of features, low number of samples, and imbalanced class samples, demonstrating its potential for application in various fields.
引用
收藏
页数:14
相关论文
共 50 条
[41]   An Improved Harris Hawks Optimization Algorithm With Simulated Annealing for Feature Selection in the Medical Field [J].
Elgamal, Zenab Mohamed ;
Yasin, Norizan Binti Mohd ;
Tubishat, Mohammad ;
Alswaitti, Mohammed ;
Mirjalili, Seyedali .
IEEE ACCESS, 2020, 8 :186638-186652
[42]   Filter-Wrapper Approach to Feature Selection Using RST-DPSO for Mining Protein Function [J].
Rahman, Shuzlina Abdul ;
Abu Bakar, Azuraliza ;
Hussein, Zeti Azura Mohamed .
2009 2ND CONFERENCE ON DATA MINING AND OPTIMIZATION, 2009, :78-+
[43]   Improved Harris Hawks Algorithm and Its Application in Feature Selection [J].
Zhang, Qianqian ;
Li, Yingmei ;
Zhan, Jianjun ;
Chen, Shan .
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (01) :1251-1273
[44]   An efficient malware detection approach with feature weighting based on Harris Hawks optimization [J].
Alzubi, Omar A. ;
Alzubi, Jafar A. ;
Al-Zoubi, Ala' M. ;
Hassonah, Mohammad A. ;
Kose, Utku .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (04) :2369-2387
[45]   An efficient malware detection approach with feature weighting based on Harris Hawks optimization [J].
Omar A. Alzubi ;
Jafar A. Alzubi ;
Ala’ M. Al-Zoubi ;
Mohammad A. Hassonah ;
Utku Kose .
Cluster Computing, 2022, 25 :2369-2387
[46]   Improved swarm-optimization-based filter-wrapper gene selection from microarray data for gene expression tumor classification [J].
Ke, Lin ;
Li, Min ;
Wang, Lei ;
Deng, Shaobo ;
Ye, Jun ;
Yu, Xiang .
PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (02) :455-472
[47]   A novel filter-wrapper hybrid gene selection approach for microarray data based on multi-objective forest optimization algorithm [J].
Nouri-Moghaddam, Babak ;
Ghazanfari, Mehdi ;
Fathian, Mohammad .
DECISION SCIENCE LETTERS, 2020, 9 (03) :271-290
[48]   Efficient Multi-Swarm Binary Harris Hawks Optimization as a Feature Selection Approach for Software Fault Prediction [J].
Thaher, Thaer ;
Arman, Nabil .
2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2020, :249-254
[49]   Hybrid filter-wrapper attribute selection with alpha-level fuzzy rough sets [J].
Nguyen Ngoc Thuy ;
Wongthanavasu, Sartra .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
[50]   A Hybrid Filter/Wrapper Approach of Feature Selection for Gene Expression Data [J].
Ke, Chao-Hsuan ;
Yang, Cheng-Hong ;
Chuang, Li-Yeh ;
Yang, Cheng-San .
2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, :2663-+