A feature selection approach combining neural networks with genetic algorithms

被引:1
作者
Huang, Zhi [1 ]
机构
[1] Mianyang Teachers Coll, Sch Informat Engn, Mianyang, Sichuan, Peoples R China
关键词
Feature selection; classification accuracy; Genetic Algorithms; neural networks; population size; computational time; UNSUPERVISED FEATURE-SELECTION; PARTICLE SWARM OPTIMIZATION; CLASSIFICATION; INFORMATION; POPULATION;
D O I
10.3233/AIC-190626
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Value Feature selection is an effective method to solve the curse of dimensionality, which widely employs Evolutionary Computation (EC), such as Genetic Algorithms (GA), by regarding feature subsets as individuals. However, it is impossible for EC based feature selection approaches to possess big population sizes because of very long and infeasible computational time. We have proposed a method screening individuals by estimating their classification performances rapidly instead of deriving theirs with a certain classifier dilatorily. Consequently, aiming at improving classification accuracies, we propose an approach named as FS-NN-GA (Feature Selection approach based on Neural Networks and Genetic Algorithms) in this work. The proposed approach employs the neural networks trained with some randomly generated individuals, and their actual classification accuracies to estimate individuals' classification accuracies and screens them in each round of GA. The individuals with low estimated accuracies are directly eliminated. Only a small number of individuals with high estimated accuracies are reserved, evaluated by deriving their accuracies with a certain classifier, and participate GA operations to be explored emphatically. As a result, big population sizes become feasible, and a huge number of individuals can be considered by GA in acceptable and feasible time, which improves performances of GA and derives high accuracies. We perform the experiments with 10 data sets in comparison with 11 available approaches. The experimental results show that FS-NN-GA outperforms other approaches on most data sets.
引用
收藏
页码:361 / 372
页数:12
相关论文
共 50 条
[1]   Improving feature ranking for biomarker discovery in proteomics mass spectrometry data using genetic programming [J].
Ahmed, Soha ;
Zhang, Mengjie ;
Peng, Lifeng .
CONNECTION SCIENCE, 2014, 26 (03) :215-243
[2]   Determining appropriate approaches for using data in feature selection [J].
Aldehim, Ghadah ;
Wang, Wenjia .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (03) :915-928
[3]   LEARNING BOOLEAN CONCEPTS IN THE PRESENCE OF MANY IRRELEVANT FEATURES [J].
ALMUALLIM, H ;
DIETTERICH, TG .
ARTIFICIAL INTELLIGENCE, 1994, 69 (1-2) :279-305
[4]  
[Anonymous], P 9 INT WORKSH MACH
[5]   Genetic algorithm with healthy population and multiple streams sharing information for clustering [J].
Beg, A. H. ;
Islam, Md Zahidul ;
Estivill-Castro, Vladimir .
KNOWLEDGE-BASED SYSTEMS, 2016, 114 :61-78
[6]   Information-theoretic selection of high-dimensional spectral features for structural recognition [J].
Bonev, Boyan ;
Escolano, Francisco ;
Giorgi, Daniela ;
Biasotti, Silvia .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (03) :214-228
[7]   A review on neural networks with random weights [J].
Cao, Weipeng ;
Wang, Xizhao ;
Ming, Zhong ;
Gao, Jinzhu .
NEUROCOMPUTING, 2018, 275 :278-287
[8]   An Integrated Mechanism for Feature Selection and Fuzzy Rule Extraction for Classification [J].
Chen, Yi-Cheng ;
Pal, Nikhil R. ;
Chung, I-Fang .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (04) :683-698
[9]   An architecture for emergency event prediction using LSTM recurrent neural networks [J].
Cortez, Bitzel ;
Carrera, Berny ;
Kim, Young-Jin ;
Jung, Jae-Yoon .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 97 :315-324
[10]  
Dash M., 1997, Intelligent Data Analysis, V1