Filter-Wrapper Approach to Feature Selection of GPCR Protein

被引:0
作者
Kamal, Nor Ashikin Mohamad [1 ]
Abu Bakar, Azuraliza [1 ]
Zainudin, Suhaila [1 ]
机构
[1] Fac Informat Sci & Technol, CAIT, Ukm Bangi 43600, Selangor, Malaysia
来源
5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015 | 2015年
关键词
Feature selection; Hierarchical classification; GPCR Proteins; CFS; GA; COUPLED RECEPTORS; HIERARCHICAL-CLASSIFICATION; PREDICTION; ATTRIBUTES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Protein dataset contains high dimensional feature space. These features may encompass of noise and not relatively to protein function. Therefore, we need to select the appropriate features to improve the efficiency and performance of the classifier. Feature selection is an important step in any classification tasks. Filter methods are important in order to obtain only the relevant features to the class and to avoid redundancy. While wrapper methods are applied to get optimized features and better classification accuracy. This paper proposed a feature selection strategy for hierarchical classification of G-Protein-Coupled Receptors (GPCR) based on hybridization of correlation feature selection (CFS) filter and genetic algorithm (GA) wrapper methods. The optimum features were then classified using K-nearest neighbor algorithm. These methods are capable to reduce the features and achieved comparable classification accuracy at every hierarchy level. The results also shown that the integration between CFS and GA is capable of searching the optimum features for hierarchical protein classification.
引用
收藏
页码:693 / 698
页数:6
相关论文
共 33 条
[1]  
[Anonymous], 1998, CORRELATION BASED FE
[2]  
[Anonymous], J ARTIFICIAL EVOLUTI
[3]  
Barros R.C., 2013, J COMPUTER IN PRESS
[4]   GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors [J].
Bhasin, M ;
Raghava, GPS .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W383-W389
[5]  
Bhasin M., NUCL ACIDS RES, V32, pW383
[6]   Prediction of protein cellular attributes using pseudo-amino acid composition [J].
Chou, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03) :246-255
[7]   On the hierarchical classification of G protein-coupled receptors [J].
Davies, Matthew N. ;
Secker, Andrew ;
Freitas, Alex A. ;
Mendao, Miguel ;
Timmis, Jon ;
Flower, Darren R. .
BIOINFORMATICS, 2007, 23 (23) :3113-3118
[8]   PseAAC-Builder: A cross-platform stand-alone program for generating various special Chou's pseudo-amino acid compositions [J].
Du, Pufeng ;
Wang, Xin ;
Xu, Chao ;
Gao, Yang .
ANALYTICAL BIOCHEMISTRY, 2012, 425 (02) :117-119
[9]  
Dumais S., 2000, SIGIR Forum, V34, P256
[10]  
Hayat M., 2010, 6 INT C EM TECHN ICE, P1