Hybrid feature selection using micro genetic algorithm on microarray gene expression data

被引:13
作者
Pragadeesh, C. [1 ]
Jeyaraj, Rohana [1 ]
Siranjeevi, K. [1 ]
Abishek, R. [1 ]
Jeyakumar, G. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Amrita Sch Engn, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
关键词
Genetic algorithm; feature selection; microarray; hybrid methods; classification;
D O I
10.3233/JIFS-169935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research has proved that DNA Microarray data containing gene expression profiles are potentially excellent diagnostic tools in the medical industry. A persistent problem with regard to accessible microarray datasets is that the number of samples are much lesser than the number of features that are present. Thus, in order to extract accurate information from the dataset, one must use a robust technique. Feature selection (FS) has proved to be an effective way by which irrelevant and noisy data can be discarded. In FS, relevant features are picked, and result in commendable classification accuracy. This paper proposes a model that employs a compounded / hybrid feature selection technique (Filter + Wrapper) to classify microarray cancer data. Initially, a filter method called Information Gain (IG) to eliminate redundant features that will not contribute significantly to the final classification is used. Following to that, an evolutionary computing technique (micro Genetic Algorithm (mGA)) to find the best minimal subset of required features is employed. Then the features are classified using a traditional Support Vector Classifier and also cross validated to obtain high classification accuracy, using a minimal number of features. The complexity of the model is reduced significantly by adding mGA, as opposed to already existing models that use various other feature selection algorithms.
引用
收藏
页码:2241 / 2246
页数:6
相关论文
共 50 条
[41]   Simulated annealing aided genetic algorithm for gene selection from microarray data [J].
Marjit, Shyam ;
Bhattacharyya, Trinav ;
Chatterjee, Bitanu ;
Sarkar, Ram .
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 158
[42]   Hybrid genetic algorithm-neural network: Feature extraction for unpreprocessed microarray data [J].
Tong, Dong Ling ;
Schierz, Amanda C. .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2011, 53 (01) :47-56
[43]   A Filter Based Feature Selection Algorithm Using Null Space of Covariance Matrix for DNA Microarray Gene Expression Data [J].
Sharma, Alok ;
Imoto, Seiya ;
Miyano, Satoru .
CURRENT BIOINFORMATICS, 2012, 7 (03) :289-294
[44]   A novel feature selection method for data mining tasks using hybrid Sine Cosine Algorithm and Genetic Algorithm [J].
Laith Abualigah ;
Akram Jamal Dulaimi .
Cluster Computing, 2021, 24 :2161-2176
[45]   Feature Selection for Cancer Classification on Microarray Expression Data [J].
Hsu, Hui-Huang ;
Lu, Ming-Da .
ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, PROCEEDINGS, 2008, :153-158
[46]   A novel feature selection method for data mining tasks using hybrid Sine Cosine Algorithm and Genetic Algorithm [J].
Abualigah, Laith ;
Dulaimi, Akram Jamal .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03) :2161-2176
[47]   A Hybrid Filter/Wrapper Approach of Feature Selection for Gene Expression Data [J].
Ke, Chao-Hsuan ;
Yang, Cheng-Hong ;
Chuang, Li-Yeh ;
Yang, Cheng-San .
2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, :2663-+
[48]   A hybrid of clustering and quantum genetic algorithm for relevant genes selection for cancer microarray data [J].
Sardana, Manju ;
Agrawal, R. K. ;
Kaur, Baljeet .
INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2016, 20 (03) :161-173
[49]   A novel hybrid feature selection method for microarray data analysis [J].
Lee, Chien-Pang ;
Leu, Yungho .
APPLIED SOFT COMPUTING, 2011, 11 (01) :208-213
[50]   Hybrid GA-IBPSO for feature selection using microarray data [J].
Yang, Cheng-San ;
Chuang, Li-Yeh ;
Ho, Chang-Hsuan ;
Yang, Cheng-Hong .
IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, :284-+