Hybrid feature selection using micro genetic algorithm on microarray gene expression data

被引:13
作者
Pragadeesh, C. [1 ]
Jeyaraj, Rohana [1 ]
Siranjeevi, K. [1 ]
Abishek, R. [1 ]
Jeyakumar, G. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Amrita Sch Engn, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
关键词
Genetic algorithm; feature selection; microarray; hybrid methods; classification;
D O I
10.3233/JIFS-169935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research has proved that DNA Microarray data containing gene expression profiles are potentially excellent diagnostic tools in the medical industry. A persistent problem with regard to accessible microarray datasets is that the number of samples are much lesser than the number of features that are present. Thus, in order to extract accurate information from the dataset, one must use a robust technique. Feature selection (FS) has proved to be an effective way by which irrelevant and noisy data can be discarded. In FS, relevant features are picked, and result in commendable classification accuracy. This paper proposes a model that employs a compounded / hybrid feature selection technique (Filter + Wrapper) to classify microarray cancer data. Initially, a filter method called Information Gain (IG) to eliminate redundant features that will not contribute significantly to the final classification is used. Following to that, an evolutionary computing technique (micro Genetic Algorithm (mGA)) to find the best minimal subset of required features is employed. Then the features are classified using a traditional Support Vector Classifier and also cross validated to obtain high classification accuracy, using a minimal number of features. The complexity of the model is reduced significantly by adding mGA, as opposed to already existing models that use various other feature selection algorithms.
引用
收藏
页码:2241 / 2246
页数:6
相关论文
共 50 条
[31]   Feature Selection Using Genetic Algorithm for Big Data [J].
Saidi, Rania ;
Ncir, Waad Bouaguel ;
Essoussi, Nadia .
INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 :352-361
[32]   A Novel Hybrid Method for Gene Selection of Microarray Data [J].
Liao, Bo ;
Cao, Tao ;
Lu, Xinguo ;
Zhu, Wen .
JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2012, 9 (01) :5-9
[33]   Distributed feature selection (DFS) strategy for microarray gene expression data to improve the classification performance [J].
Potharaju, Sai Prasad ;
Sreedevi, M. .
CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH, 2019, 7 (02) :171-176
[34]   Hybrid genetic algorithm for feature selection with hyperspectral data [J].
Pal, Mahesh .
REMOTE SENSING LETTERS, 2013, 4 (07) :619-628
[35]   A hybrid feature selection method for DNA microarray data [J].
Chuang, Li-Yeh ;
Yang, Cheng-Huei ;
Wu, Kuo-Chuan ;
Yang, Cheng-Hong .
COMPUTERS IN BIOLOGY AND MEDICINE, 2011, 41 (04) :228-237
[36]   Hybrid Efficient Genetic Algorithm for Big Data Feature Selection Problems [J].
Mohammed, Tareq Abed ;
Bayat, Oguz ;
Ucan, Osman N. ;
Alhayali, Shaymaa .
FOUNDATIONS OF SCIENCE, 2020, 25 (04) :1009-1025
[37]   Hybrid feature selection approach based on GRASP for cancer microarray data [J].
Nagpal A. ;
Gaur D. .
Journal of Computing and Information Technology, 2017, 25 (02) :133-148
[38]   A Robust and Efficient Feature Selection Algorithm for Microarray Data [J].
Bari, Mehrab Ghanat ;
Salekin, Sirajul ;
Zhang, Jianqiu .
MOLECULAR INFORMATICS, 2017, 36 (04)
[39]   Impact of Feature Selection on Support Vector Machine Using Microarray Gene Expression Data [J].
Wahid, Choudhury Muhammad Mufassil ;
Ali, A. B. M. Shawkat ;
Tickle, Kevin .
2009 SECOND INTERNATIONAL CONFERENCE ON MACHINE VISION, PROCEEDINGS, ( ICMV 2009), 2009, :189-193
[40]   Feature Selection for Microarray Data Classification Using Hybrid Information Gain and a Modified Binary Krill Herd Algorithm [J].
Zhang, Ge ;
Hou, Jincui ;
Wang, Jianlin ;
Yan, Chaokun ;
Luo, Junwei .
INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2020, 12 (03) :288-301