Hybrid feature selection using micro genetic algorithm on microarray gene expression data

被引:12
作者
Pragadeesh, C. [1 ]
Jeyaraj, Rohana [1 ]
Siranjeevi, K. [1 ]
Abishek, R. [1 ]
Jeyakumar, G. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Amrita Sch Engn, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
关键词
Genetic algorithm; feature selection; microarray; hybrid methods; classification;
D O I
10.3233/JIFS-169935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research has proved that DNA Microarray data containing gene expression profiles are potentially excellent diagnostic tools in the medical industry. A persistent problem with regard to accessible microarray datasets is that the number of samples are much lesser than the number of features that are present. Thus, in order to extract accurate information from the dataset, one must use a robust technique. Feature selection (FS) has proved to be an effective way by which irrelevant and noisy data can be discarded. In FS, relevant features are picked, and result in commendable classification accuracy. This paper proposes a model that employs a compounded / hybrid feature selection technique (Filter + Wrapper) to classify microarray cancer data. Initially, a filter method called Information Gain (IG) to eliminate redundant features that will not contribute significantly to the final classification is used. Following to that, an evolutionary computing technique (micro Genetic Algorithm (mGA)) to find the best minimal subset of required features is employed. Then the features are classified using a traditional Support Vector Classifier and also cross validated to obtain high classification accuracy, using a minimal number of features. The complexity of the model is reduced significantly by adding mGA, as opposed to already existing models that use various other feature selection algorithms.
引用
收藏
页码:2241 / 2246
页数:6
相关论文
共 50 条
[21]   A feature selection method using fixed-point algorithm for DNA microarray gene expression data [J].
Sharma, Alok ;
Paliwal, Kuldip K. ;
Imoto, Seiya ;
Miyano, Satoru ;
Sharma, Vandana ;
Ananthanarayanan, Rajeshkannan .
INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2014, 18 (01) :55-59
[22]   A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification [J].
Almugren, Nada ;
Alshamlan, Hala .
IEEE ACCESS, 2019, 7 :78533-78548
[23]   Hybrid Feature Selection Method using Gene Expression Data [J].
Chuang, Li-Yeh ;
Wu, Kuo-Chuan ;
Yang, Cheng-Hong .
2008 IEEE CONFERENCE ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS SMCIA/08, 2009, :199-+
[24]   A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data [J].
Wang, Hong ;
Jing, Xingjian ;
Niu, Ben .
KNOWLEDGE-BASED SYSTEMS, 2017, 126 :8-19
[25]   Development of a library with feature selection algorithm based on microarray gene expression dataset for biomarker identification [J].
Lee, Sangbum ;
Oh, Sejong .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 16 (02) :93-110
[26]   A Hybrid Discrete Imperialist Competition Algorithm for Gene Selection for Microarray Data [J].
Aorigele ;
Tang, Zheng ;
Todo, Yuki ;
Gao, Shangce .
CURRENT PROTEOMICS, 2018, 15 (02) :99-110
[27]   Hybrid feature selection based on SLI and genetic algorithm for microarray datasets [J].
Sedighe Abasabadi ;
Hossein Nematzadeh ;
Homayun Motameni ;
Ebrahim Akbari .
The Journal of Supercomputing, 2022, 78 :19725-19753
[28]   Gene Selection for Microarray Data by a LDA-Based Genetic Algorithm [J].
Huerta, Edmundo Bonilla ;
Duval, Beatrice ;
Hao, Jin-Kao .
PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2008, 5265 :250-261
[29]   HYBRIDIZATION OF GENETIC AND QUANTUM ALGORITHM FOR GENE SELECTION AND CLASSIFICATION OF MICROARRAY DATA [J].
Abderrahim, Allani ;
Talbi, El-Ghazali ;
Khaled, Mellouli .
INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2012, 23 (02) :431-444
[30]   Hybrid feature selection based on SLI and genetic algorithm for microarray datasets [J].
Abasabadi, Sedighe ;
Nematzadeh, Hossein ;
Motameni, Homayun ;
Akbari, Ebrahim .
JOURNAL OF SUPERCOMPUTING, 2022, 78 (18) :19725-19753