A New Approach for Wrapper Feature Selection Using Genetic Algorithm for Big Data

被引:11
|
作者
Bouaguel, Waad [1 ]
机构
[1] Univ Tunis, LARODEC, ISG, Tunis, Tunisia
来源
INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES 2015 | 2016年 / 5卷
关键词
Wrapper; Feature selection; Big data; CLASSIFICATION; PREDICTION;
D O I
10.1007/978-3-319-27000-5_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increased dimensionality of genomic and proteomic data produced by microarray and mass spectrometry technology makes testing and training of general classification method difficult. Special data analysis is demanded in this case and one of the common ways to handle high dimensionality is identification of the most relevant features in the data. Wrapper feature selection is one of the most common and effective techniques for feature selection. Although efficient, wrapper methods have some limitations due to the fact that their result depends on the search strategy. In theory when a complex search is used, it may take much longer to choose the best subset of features and may be impractical in some cases. Hence we propose a new wrapper feature selection for big data based on a random search using genetic algorithm and prior information. The new approach was tested on 2 biological dataset and compared to two well known wrapper feature selection approaches and results illustrate that our approach gives the best performances.
引用
收藏
页码:75 / 83
页数:9
相关论文
共 50 条
  • [31] Binary Sand Cat Swarm Optimization Algorithm for Wrapper Feature Selection on Biological Data
    Seyyedabbasi, Amir
    BIOMIMETICS, 2023, 8 (03)
  • [32] Hybrid filter-wrapper feature selection using whale optimization algorithm: A multi-objective approach
    Got, Adel
    Moussaoui, Abdelouahab
    Zouache, Djaafar
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
  • [33] A greedy feature selection algorithm for Big Data of high dimensionality
    Ioannis Tsamardinos
    Giorgos Borboudakis
    Pavlos Katsogridakis
    Polyvios Pratikakis
    Vassilis Christophides
    Machine Learning, 2019, 108 : 149 - 202
  • [34] Using a genetic algorithm and a perceptron for feature selection and supervised class learning in DNA microarray data
    Karzynski, M
    Mateos, A
    Herrero, J
    Dopazo, J
    ARTIFICIAL INTELLIGENCE REVIEW, 2003, 20 (1-2) : 39 - 51
  • [35] A wrapper-based feature selection approach using osprey optimisation for software fault detection
    Rath, Pradeep Kumar
    Ghosh, Soumili
    Gourisaria, Mahendra Kumar
    Mahato, Susmita
    Das, Himansu
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2025, 18 (01) : 1 - 19
  • [36] Feature Selection Using Diploid Genetic Algorithm
    Jasuja A.
    Annals of Data Science, 2020, 7 (01) : 33 - 43
  • [37] A greedy feature selection algorithm for Big Data of high dimensionality
    Tsamardinos, Ioannis
    Borboudakis, Giorgos
    Katsogridakis, Pavlos
    Pratikakis, Polyvios
    Christophides, Vassilis
    MACHINE LEARNING, 2019, 108 (02) : 149 - 202
  • [38] Wrapper-based Feature Selection for Imbalanced Data using Binary Queuing Search Algorithm
    Thaher, Thaer
    Mafarja, Majdi
    Abdalhaq, Baker
    Chantar, Hamouda
    2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 318 - 323
  • [39] Efficient Genetic-Wrapper Algorithm Based Data Mining for Feature Subset Selection in a Power Quality Pattern Recognition Application
    Krishna, Brahmadesam
    Kaliaperumal, Baskaran
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2011, 8 (04) : 397 - 405
  • [40] A Novel Genetic Algorithm Approach to Simultaneous Feature Selection and Instance Selection
    Albuquerque, Inti Mateus Resende
    Bach Hoai Nguyen
    Xue, Bing
    Zhang, Mengjie
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 616 - 623