FSPL: A Meta-Learning Approach for a Filter and Embedded Feature Selection Pipeline

被引:4
作者
Lazebnik, Teddy [1 ]
Rosenfeld, Avi [2 ]
机构
[1] UCL, Dept Canc Biol, Canc Inst, 72 Huntley St, London WC1E 6DD, England
[2] Jerusalem Coll Technol, Dept Comp Sci, 21 Ha Vaad Ha Leumi St, Jerusalem, Israel
关键词
feature selection pipeline; meta-learning; no free lunch; autoML; genetic algorithm; WRAPPER APPROACH; CLASSIFICATION; OPTIMIZATION;
D O I
10.34768/amcs-2023-0009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There are two main approaches to tackle the challenge of finding the best filter or embedded feature selection (FS) algorithm: searching for the one best FS algorithm and creating an ensemble of all available FS algorithms. However, in practice, these two processes usually occur as part of a larger machine learning pipeline and not separately. We posit that, due to the influence of the filter FS on the embedded FS, one should aim to optimize both of them as a single FS pipeline rather than separately. We propose a meta-learning approach that automatically finds the best filter and embedded FS pipeline for a given dataset called FSPL. We demonstrate the performance of FSPL on n = 90 datasets, obtaining 0.496 accuracy for the optimal FS pipeline, revealing an improvement of up to 5.98 percent in the model's accuracy compared to the second-best meta-learning method.
引用
收藏
页码:103 / 115
页数:13
相关论文
共 82 条
[1]  
Aarts E.H.L., 1987, Pattern recognition theory and applications, V30, P179, DOI DOI 10.1007/978-3-642-83069-315
[2]  
Abdullah A.S., 2017, INDIAN J SCI TECHNOL, V10, P1, DOI DOI 10.17485/ijst/2017/v10i18/111768
[3]  
Aggarwal C.C., 2014, Data Classification: Algorithms and Applications, V1st, DOI DOI 10.1201/B17320
[4]  
Akshaikhdeeb B., 2017, 6 INT C ELECT ENG IN, P1
[5]  
[Anonymous], 2011, P 27 C UNC ART INT U
[6]  
Anthony T, 2017, ADV NEUR IN, V30
[7]  
Azhagusundari B., 2013, Int J Innovat Technol Explor Eng, V2, P18, DOI DOI 10.1371/JOURNAL.PONE.0166017
[8]   ON THE PREDICTIVE POWER OF META-FEATURES IN OPENML [J].
Bilalli, Besim ;
Abello, Alberto ;
Aluja-Banet, Tomas .
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2017, 27 (04) :697-712
[9]   Optimization of process route by Genetic Algorithms [J].
Bo, ZW ;
Hua, LZ ;
Yu, ZG .
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2006, 22 (02) :180-188
[10]   Ensembles for feature selection: A review and future trends [J].
Bolon-Canedo, Veronica ;
Alonso-Betanzos, Amparo .
INFORMATION FUSION, 2019, 52 :1-12