A New Approach for Wrapper Feature Selection Using Genetic Algorithm for Big Data

被引:11
|
作者
Bouaguel, Waad [1 ]
机构
[1] Univ Tunis, LARODEC, ISG, Tunis, Tunisia
来源
INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES 2015 | 2016年 / 5卷
关键词
Wrapper; Feature selection; Big data; CLASSIFICATION; PREDICTION;
D O I
10.1007/978-3-319-27000-5_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increased dimensionality of genomic and proteomic data produced by microarray and mass spectrometry technology makes testing and training of general classification method difficult. Special data analysis is demanded in this case and one of the common ways to handle high dimensionality is identification of the most relevant features in the data. Wrapper feature selection is one of the most common and effective techniques for feature selection. Although efficient, wrapper methods have some limitations due to the fact that their result depends on the search strategy. In theory when a complex search is used, it may take much longer to choose the best subset of features and may be impractical in some cases. Hence we propose a new wrapper feature selection for big data based on a random search using genetic algorithm and prior information. The new approach was tested on 2 biological dataset and compared to two well known wrapper feature selection approaches and results illustrate that our approach gives the best performances.
引用
收藏
页码:75 / 83
页数:9
相关论文
共 50 条
  • [41] Filter-Wrapper Approach to Feature Selection Using RST-DPSO for Mining Protein Function
    Rahman, Shuzlina Abdul
    Abu Bakar, Azuraliza
    Hussein, Zeti Azura Mohamed
    2009 2ND CONFERENCE ON DATA MINING AND OPTIMIZATION, 2009, : 78 - +
  • [42] Quantum based Whale Optimization Algorithm for wrapper feature selection
    Agrawal, R. K.
    Kaur, Baljeet
    Sharma, Surbhi
    APPLIED SOFT COMPUTING, 2020, 89
  • [43] A novel hybrid wrapper–filter approach based on genetic algorithm, particle swarm optimization for feature subset selection
    Fateme Moslehi
    Abdorrahman Haeri
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 1105 - 1127
  • [44] A new and fast rival genetic algorithm for feature selection
    Too, Jingwei
    Abdullah, Abdul Rahim
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (03) : 2844 - 2874
  • [45] A new and fast rival genetic algorithm for feature selection
    Jingwei Too
    Abdul Rahim Abdullah
    The Journal of Supercomputing, 2021, 77 : 2844 - 2874
  • [46] Feature Selection on High Dimensional Data using Wrapper Based Subset Selection
    Manikandan, G.
    Susi, E.
    Abirami, S.
    2017 SECOND INTERNATIONAL CONFERENCE ON RECENT TRENDS AND CHALLENGES IN COMPUTATIONAL MODELS (ICRTCCM), 2017, : 320 - 325
  • [47] Wrapper feature selection with partially labeled data
    Vasilii Feofanov
    Emilie Devijver
    Massih-Reza Amini
    Applied Intelligence, 2022, 52 : 12316 - 12329
  • [48] A Hybrid Feature Selection Approach Based on Statistical and Wrapper Methods
    Kaya, Mahmut
    Bilge, Basalt Sakir
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 2101 - 2104
  • [49] Filter-Wrapper Approach to Feature Selection of GPCR Protein
    Kamal, Nor Ashikin Mohamad
    Abu Bakar, Azuraliza
    Zainudin, Suhaila
    5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, : 693 - 698
  • [50] Wrapper feature selection with partially labeled data
    Feofanov, Vasilii
    Devijver, Emilie
    Amini, Massih-Reza
    APPLIED INTELLIGENCE, 2022, 52 (11) : 12316 - 12329