A New Approach for Wrapper Feature Selection Using Genetic Algorithm for Big Data

被引:11
|
作者
Bouaguel, Waad [1 ]
机构
[1] Univ Tunis, LARODEC, ISG, Tunis, Tunisia
来源
INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES 2015 | 2016年 / 5卷
关键词
Wrapper; Feature selection; Big data; CLASSIFICATION; PREDICTION;
D O I
10.1007/978-3-319-27000-5_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increased dimensionality of genomic and proteomic data produced by microarray and mass spectrometry technology makes testing and training of general classification method difficult. Special data analysis is demanded in this case and one of the common ways to handle high dimensionality is identification of the most relevant features in the data. Wrapper feature selection is one of the most common and effective techniques for feature selection. Although efficient, wrapper methods have some limitations due to the fact that their result depends on the search strategy. In theory when a complex search is used, it may take much longer to choose the best subset of features and may be impractical in some cases. Hence we propose a new wrapper feature selection for big data based on a random search using genetic algorithm and prior information. The new approach was tested on 2 biological dataset and compared to two well known wrapper feature selection approaches and results illustrate that our approach gives the best performances.
引用
收藏
页码:75 / 83
页数:9
相关论文
共 50 条
  • [21] A hybrid genetic algorithm for feature selection wrapper based on mutual information
    Huang, Jinjie
    Cai, Yunze
    Xu, Xiaoming
    PATTERN RECOGNITION LETTERS, 2007, 28 (13) : 1825 - 1844
  • [22] Enhancing Big Data Feature Selection Using a Hybrid Correlation-Based Feature Selection
    Mohamad, Masurah
    Selamat, Ali
    Krejcar, Ondrej
    Crespo, Ruben Gonzalez
    Herrera-Viedma, Enrique
    Fujita, Hamido
    ELECTRONICS, 2021, 10 (23)
  • [23] A novel ensemble-based wrapper method for feature selection using extreme learning machine and genetic algorithm
    Xue, Xiaowei
    Yao, Min
    Wu, Zhaohui
    KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 57 (02) : 389 - 412
  • [24] A new hybrid filter/wrapper algorithm for feature selection in classification
    Zhang, Jixiong
    Xiong, Yanmei
    Min, Shungeng
    ANALYTICA CHIMICA ACTA, 2019, 1080 : 43 - 54
  • [25] Feature Selection in Big Data using Filter Based Techniques
    Srinivas, Sumitra K.
    Kancharla, Gangadhara Rao
    2019 4TH MEC INTERNATIONAL CONFERENCE ON BIG DATA AND SMART CITY (ICBDSC), 2019, : 139 - 145
  • [26] Analysis of Feature Selection and Extraction Algorithm for Loan Data: A Big Data Approach
    Attigeri, Girija
    Pai, Manohara M. M.
    Pai, Radhika M.
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 2147 - 2151
  • [27] Feature selection based on rough set approach, wrapper approach, and binary whale optimization algorithm
    Mohamed A. Tawhid
    Abdelmonem M. Ibrahim
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 573 - 602
  • [28] A novel hybrid wrapper-filter approach based on genetic algorithm, particle swarm optimization for feature subset selection
    Moslehi, Fateme
    Haeri, Abdorrahman
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (03) : 1105 - 1127
  • [29] Feature selection based on rough set approach, wrapper approach, and binary whale optimization algorithm
    Tawhid, Mohamed A.
    Ibrahim, Abdelmonem M.
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (03) : 573 - 602
  • [30] Feature Selection Using Submodular Approach for Financial Big Data
    Attigeri, Girija
    Pai, Manohara M. M.
    Pai, Radhika M.
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2019, 15 (06): : 1306 - 1325