CANCER MICROARRAY DATA FEATURE SELECTION USING MULTI-OBJECTIVE BINARY PARTICLE SWARM OPTIMIZATION ALGORITHM

被引:32
作者
Annavarapu, Chandra Sekhara Rao [1 ]
Dara, Suresh [1 ]
Banka, Haider [1 ]
机构
[1] Indian Sch Mines, Dept Comp Sci & Engn, Dhanbad 826004, Jharkhand, India
来源
EXCLI JOURNAL | 2016年 / 15卷
关键词
Cancer micro array; gene expressions; feature selection; binary PSO; classification; CLASSIFICATION; TECHNOLOGY; ENSEMBLE;
D O I
10.17179/excli2016-481
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cancer investigations in microarray data play a major role in cancer analysis and the treatment. Cancer microarray data consists of complex gene expressed patterns of cancer. In this article, a Multi-Objective Binary Particle Swarm Optimization (MOBPSO) algorithm is proposed for analyzing cancer gene expression data. Due to its high dimensionality, a fast heuristic based pre-processing technique is employed to reduce some of the crude domain features from the initial feature set. Since these pre-processed and reduced features are still high dimensional, the proposed MOBPSO algorithm is used for finding further feature subsets. The objective functions are suitably modeled by optimizing two conflicting objectives i.e., cardinality of feature subsets and distinctive capability of those selected subsets. As these two objective functions are conflicting in nature, they are more suitable for multi-objective modeling. The experiments are carried out on benchmark gene expression datasets, i.e., Colon, Lymphoma and Leukaemia available in literature. The performance of the selected feature subsets with their classification accuracy and validated using 10 fold cross validation techniques. A detailed comparative study is also made to show the betterment or competitiveness of the proposed algorithm.
引用
收藏
页码:460 / 473
页数:14
相关论文
共 34 条
[1]  
[Anonymous], P IEEE C SYST MAN CY
[2]  
[Anonymous], 1995, 1995 IEEE INT C
[3]  
[Anonymous], 2008, P 14 ACM SIGKDD INT
[4]   Evolutionary rough feature selection in gene expression data [J].
Banerjee, Mohua ;
Mitra, Sushmita ;
Banka, Haider .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2007, 37 (04) :622-632
[5]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28
[6]   Analysis of microarray data using Z score transformation [J].
Cheadle, C ;
Vawter, MP ;
Freed, WJ ;
Becker, KG .
JOURNAL OF MOLECULAR DIAGNOSTICS, 2003, 5 (02) :73-81
[7]  
Coello CAC, 2002, IEEE C EVOL COMPUTAT, P1051, DOI 10.1109/CEC.2002.1004388
[8]  
Deb K., 2001, MULTIOBJECTIVE OPTIM, V16
[9]   A filter model for feature subset selection based on genetic algorithm [J].
Elalami, M. E. .
KNOWLEDGE-BASED SYSTEMS, 2009, 22 (05) :356-362
[10]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537