A parallel metaheuristic approach for ensemble feature selection based on multi-core architectures

被引:28
作者
Hijazi, Neveen Mohammed [1 ]
Faris, Hossam [1 ,2 ]
Aljarah, Ibrahim [1 ]
机构
[1] Univ Jordan, King Abdullah II Sch Informat Technol, Amman, Jordan
[2] Al Hussein Tech Univ, Sch Comp & Informat, Amman, Jordan
关键词
Meta-heuristics; Evolutionary computation; Parallel processing; Feature selection; Ensemble learning; DIFFERENTIAL EVOLUTION; GENE SELECTION; ALGORITHM; CLASSIFICATION; OPTIMIZATION; MACHINE; SCHEME;
D O I
10.1016/j.eswa.2021.115290
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble learning have emerged as a useful machine learning technique, which is based on the idea that combining the output of multiple models instead of using a single model. This practice, known as "diversity", and it usually enhances the performance. On other hand, ensemble feature selection method is based on the same idea, where multiple feature subsets are combined to select an optimal subset of features. Learning methods have difficulties with the dimensionality curse that impact the performance and increase the time exponentially. To overcome this issue, we propose a parallel heterogeneous ensemble feature selection based on three wellregarded algorithms: genetic algorithm, particle swarm optimizer, and grey wolf optimizer. The proposed approach is based on four phases; namely, distribution phase, parallel ensemble feature selection phase, combining and aggregation phase, and testing phase. Three implementations of the proposed approach are presented: a sequential approach running on the central processing unit (CPU), a parallel approach running on multi-core CPU, and a parallel approach running on multi-core CPU with graphics processing units (GPU). To assess the performance of the proposed approach twenty-one large datasets were used. The results show that the proposed parallel approach improved the performance in terms of the prediction results and running time.
引用
收藏
页数:30
相关论文
共 89 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   A new fusion of grey wolf optimizer algorithm with a two-phase mutation for feature selection [J].
Abdel-Basset, Mohamed ;
El-Shahat, Doaa ;
El-henawy, Ibrahim ;
de Albuquerque, Victor Hugo C. ;
Mirjalili, Seyedali .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 139
[3]  
Abu Khurma R, 2020, ALGO INTELL SY, P131, DOI 10.1007/978-981-32-9990-0_8
[4]   Feature Selection Method Based on Grey Wolf Optimization for Coronary Artery Disease Classification [J].
Al-Tashi, Qasem ;
Rais, Helmi ;
Jadid, Said .
RECENT TRENDS IN DATA SCIENCE AND SOFT COMPUTING, IRICT 2018, 2019, 843 :257-266
[5]   Binary Optimization Using Hybrid Grey Wolf Optimization for Feature Selection [J].
Al-Tashi, Qasem ;
Kadir, Said Jadid Abdul ;
Rais, Helmi Md ;
Mirjalili, Seyedali ;
Alhussian, Hitham .
IEEE ACCESS, 2019, 7 :39496-39508
[6]   Parallel metaheuristics: recent advances and new trends [J].
Alba, Enrique ;
Luque, Gabriel ;
Nesmachnow, Sergio .
INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 2013, 20 (01) :1-48
[7]   Software Defect Prediction Using Heterogeneous Ensemble Classification Based on Segmented Patterns [J].
Alsawalqah, Hamad ;
Hijazi, Neveen ;
Eshtay, Mohammed ;
Faris, Hossam ;
Al Radaideh, Ahmed ;
Aljarah, Ibrahim ;
Alshamaileh, Yazan .
APPLIED SCIENCES-BASEL, 2020, 10 (05)
[8]   mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling [J].
Alshamlan, Hala ;
Badr, Ghada ;
Alohali, Yousef .
BIOMED RESEARCH INTERNATIONAL, 2015, 2015
[9]  
[Anonymous], 2015, DECISION COMMITTEE E, DOI DOI 10.1016/S0967-2109(97)89838-9
[10]  
[Anonymous], 2015, arXiv