A New Approach for Wrapper Feature Selection Using Genetic Algorithm for Big Data

被引:11
|
作者
Bouaguel, Waad [1 ]
机构
[1] Univ Tunis, LARODEC, ISG, Tunis, Tunisia
来源
INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES 2015 | 2016年 / 5卷
关键词
Wrapper; Feature selection; Big data; CLASSIFICATION; PREDICTION;
D O I
10.1007/978-3-319-27000-5_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increased dimensionality of genomic and proteomic data produced by microarray and mass spectrometry technology makes testing and training of general classification method difficult. Special data analysis is demanded in this case and one of the common ways to handle high dimensionality is identification of the most relevant features in the data. Wrapper feature selection is one of the most common and effective techniques for feature selection. Although efficient, wrapper methods have some limitations due to the fact that their result depends on the search strategy. In theory when a complex search is used, it may take much longer to choose the best subset of features and may be impractical in some cases. Hence we propose a new wrapper feature selection for big data based on a random search using genetic algorithm and prior information. The new approach was tested on 2 biological dataset and compared to two well known wrapper feature selection approaches and results illustrate that our approach gives the best performances.
引用
收藏
页码:75 / 83
页数:9
相关论文
共 50 条
  • [1] Feature Selection Using Genetic Algorithm for Big Data
    Saidi, Rania
    Ncir, Waad Bouaguel
    Essoussi, Nadia
    INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 352 - 361
  • [2] Surrogate-Assisted Genetic Algorithm for Wrapper Feature Selection
    Altarabichi, Mohammed Ghaith
    Nowaczyk, Slawomir
    Pashami, Sepideh
    Mashhadi, Peyman Sheikholharam
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 776 - 785
  • [3] A Wrapper Feature Selection Approach to Classification with Missing Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2016, PT I, 2016, 9597 : 685 - 700
  • [4] Hybrid Efficient Genetic Algorithm for Big Data Feature Selection Problems
    Mohammed, Tareq Abed
    Bayat, Oguz
    Ucan, Osman N.
    Alhayali, Shaymaa
    FOUNDATIONS OF SCIENCE, 2020, 25 (04) : 1009 - 1025
  • [5] A wrapper feature selection approach using Markov blankets
    Hassan, Atif
    Paik, Jiaul Hoque
    Khare, Swanand Ravindra
    Hassan, Syed Asif
    PATTERN RECOGNITION, 2025, 158
  • [6] A new wrapper feature selection approach using neural network
    Kabir, Md Monirul
    Islam, Md Monirul
    Murase, Kazuyuki
    NEUROCOMPUTING, 2010, 73 (16-18) : 3273 - 3283
  • [7] A Hybrid Genetic Algorithm With Wrapper-Embedded Approaches for Feature Selection
    Liu, Xiao-Ying
    Liang, Yong
    Wang, Sai
    Yang, Zi-Yi
    Ye, Han-Shuo
    IEEE ACCESS, 2018, 6 : 22863 - 22874
  • [8] A Genetic Based Wrapper Feature Selection Approach Using Nearest Neighbour Distance Matrix
    Sainin, Mohd Shamrie
    Alfred, Rayner
    2011 3RD CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2011, : 237 - 242
  • [9] A Weighted Wrapper Approach to Feature Selection
    Kusy, Maciej
    Zajdel, Roman
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2021, 31 (04) : 685 - 696
  • [10] Scalable feature subset selection for big data using parallel hybrid evolutionary algorithm based wrapper under apache spark environment
    Vivek, Yelleti
    Ravi, Vadlamani
    Krishna, P. Radha
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (03): : 1949 - 1983