MEL: Efficient Multi-Task Evolutionary Learning for High-Dimensional Feature Selection

被引:13
作者
Wang, Xubin [1 ,2 ]
Shangguan, Haojiong [1 ,2 ]
Huang, Fengyi [1 ]
Wu, Shangrui [1 ,2 ]
Jia, Weijia [1 ,2 ]
机构
[1] Hong Kong Baptist Univ, Hong Kong, Peoples R China
[2] Beijing Normal Univ, Inst Artificial Intelligence & Future Networks, Zhuhai 519088, Guangdong, Peoples R China
关键词
Feature extraction; Task analysis; Multitasking; Statistics; Sociology; Vectors; Particle swarm optimization; Feature selection; high-dimensional classification; knowledge transfer; multi-task learning; particle swarm optimization; OPTIMIZATION ALGORITHM;
D O I
10.1109/TKDE.2024.3366333
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is a crucial step in data mining to enhance model performance by reducing data dimensionality. However, the increasing dimensionality of collected data exacerbates the challenge known as the "curse of dimensionality", where computation grows exponentially with the number of dimensions. To tackle this issue, evolutionary computational (EC) approaches have gained popularity due to their simplicity and applicability. Unfortunately, the diverse designs of EC methods result in varying abilities to handle different data, often underutilizing and not sharing information effectively. In this article, we propose a novel approach called PSO-based Multi-task Evolutionary Learning (MEL) that leverages multi-task learning to address these challenges. By incorporating information sharing between different feature selection tasks, MEL achieves enhanced learning ability and efficiency. We evaluate the effectiveness of MEL through extensive experiments on 22 high-dimensional datasets. Comparing against 24 EC approaches, our method exhibits strong competitiveness. In addition, we have open-sourced our code on GitHub.
引用
收藏
页码:4020 / 4033
页数:14
相关论文
共 68 条
[1]   Text feature selection using ant colony optimization [J].
Aghdam, Mehdi Hosseinzadeh ;
Ghasem-Aghaee, Nasser ;
Basiri, Mohammad Ehsan .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :6843-6853
[2]  
Alenezi F. N., 2021, Sci. Rep, V11, P1
[3]  
Alqahtani S., 2020, P 58 ANN M ASS COMPU, P8238, DOI 10.18653/v1/2020.acl-main.732
[4]  
Bach F.R., 2008, Proceedings of the 25th International Conference on Machine learning, P33
[5]   Application of high-dimensional feature selection: evaluation for genomic prediction in man [J].
Bermingham, M. L. ;
Pong-Wong, R. ;
Spiliopoulou, A. ;
Hayward, C. ;
Rudan, I. ;
Campbell, H. ;
Wright, A. F. ;
Wilson, J. F. ;
Agakov, F. ;
Navarro, P. ;
Haley, C. S. .
SCIENTIFIC REPORTS, 2015, 5
[6]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[7]   Evolutionary Multitasking for Feature Selection in High-Dimensional Classification via Particle Swarm Optimization [J].
Chen, Ke ;
Xue, Bing ;
Zhang, Mengjie ;
Zhou, Fengyu .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (03) :446-460
[8]   An Evolutionary Multitasking-Based Feature Selection Method for High-Dimensional Classification [J].
Chen, Ke ;
Xue, Bing ;
Zhang, Mengjie ;
Zhou, Fengyu .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) :7172-7186
[9]   Multi-Label Nonlinear Matrix Completion With Transductive Multi-Task Feature Selection for Joint MGMT and IDH1 Status Prediction of Patient With High-Grade Gliomas [J].
Chen, Lei ;
Zhang, Han ;
Lu, Junfeng ;
Thung, Kimhan ;
Aibaidula, Abudumijiti ;
Liu, Luyan ;
Chen, Songcan ;
Jin, Lei ;
Wu, Jinsong ;
Wang, Qian ;
Zhou, Liangfu ;
Shen, Dinggang .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (08) :1775-1787
[10]  
Chen Y, 2019, 24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), P420, DOI [10.1109/TCAD.2019.2912948, 10.1145/3287624.3287685]