A Novel Hybrid Feature Selection Model for Classification of Neuromuscular Dystrophies Using Bhattacharyya Coefficient, Genetic Algorithm and Radial Basis Function Based Support Vector Machine

被引:2
作者
Anand, Divya [1 ]
Pandey, Babita [2 ]
Pandey, Devendra K. [3 ]
机构
[1] Lovely Profess Univ, Sch Comp Sci & Engn, Chaheru, Punjab, India
[2] Lovely Profess Univ, Sch Comp Applicat, Chaheru, Punjab, India
[3] Lovely Profess Univ, Sch Biosci, Chaheru, Punjab, India
关键词
Bhattacharyya coefficient; Genetic algorithm; Support vector machine; Neuromuscular disorders; Microarray data; Radial basis function; CANCER CLASSIFICATION; DIAGNOSIS; SVM; SYSTEM;
D O I
10.1007/s12539-016-0183-6
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
An accurate classification of neuromuscular disorders is important in providing proper treatment facilities to the patients. Recently, the microarray technology is employed to monitor the level of activity or expression of large number of genes simultaneously. The gene expression data derived from the microarray experiment usually involve a large number of genes but a very few number of samples. There is a need to reduce the dimension of gene expression data which intends to find a small set of discriminative genes that accurately classifies the samples of various kinds of diseases. So, our goal is to find a small subset of genes which ensures the accurate classification of neuromuscular disorders. In the present paper, we propose a novel hybrid feature selection model for classification of neuromuscular disorders. The process of feature selection is done in two phases by integrating Bhattacharyya coefficient and genetic algorithm (GA). In the first phase, we find Bhattacharyya coefficient to choose a candidate gene subset by removing the most redundant genes. In the second phase, the target gene subset is created by selecting the most discriminative gene subset by applying GA wherein the fitness function is calculated using radial basis function support vector machine (RBF SVM). The proposed hybrid algorithm is applied on two publicly available microarray neuromuscular disorders datasets. The results are compared with two individual techniques of feature selection, namely Bhattacharyya coefficient and GA, and one integrated technique, i.e., Bhattacharyya-GA wherein the fitness function of GA is calculated using four other classifiers, which shows that the proposed integrated method is capable of giving the better classification accuracy.
引用
收藏
页码:244 / 250
页数:7
相关论文
共 35 条
[1]  
Aherne FJ, 1998, KYBERNETIKA, V34, P363
[2]   Artificial neural networks for diagnosis and survival prediction in colon cancer [J].
Ahmed, Farid E. .
MOLECULAR CANCER, 2005, 4 (1)
[3]  
Azuaje F, 2000, ENG MED BIOL SOC ANN, P308, DOI 10.1109/ITAB.2000.892406
[4]  
Babu KG, 2013, INT J APPL INNOV ENG, V2, P215
[5]   Nuclear envelope dystrophies show a transcriptional fingerprint suggesting disruption of Rb-MyoD pathways in muscle regeneration [J].
Bakay, M ;
Wang, ZY ;
Melcon, G ;
Schiltz, L ;
Xuan, JH ;
Zhao, P ;
Sartorelli, V ;
Seo, J ;
Pegoraro, E ;
Angelini, C ;
Shneiderman, B ;
Escolar, D ;
Chen, YW ;
Winokur, ST ;
Pachman, LM ;
Fan, CG ;
Mandler, R ;
Nevo, Y ;
Gordon, E ;
Zhu, YT ;
Dong, YB ;
Wang, Y ;
Hoffman, EP .
BRAIN, 2006, 129 :996-1013
[6]  
Berrar Daniel P, 2003, Pac Symp Biocomput, P5
[7]  
Chen Austin H., 2010, Proceedings of the 2nd International Conference on Software Engineering and Data Mining (SEDM 2010), P378
[8]   The classification of cancer stage microarray data [J].
Chen, Chi-Kan .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2012, 108 (03) :1070-1077
[10]   Feature selection using binary particle swarm optimization and support vector machines for medical diagnosis [J].
Daliri, Mohammad Reza .
BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2012, 57 (05) :395-402