Mem-PHybrid: Hybrid features-based prediction system for classifying membrane protein types

被引:39
作者
Hayat, Maqsood [1 ]
Khan, Asifullah [1 ]
机构
[1] PIEAS, Dept Comp & Informat Sci, Islamabad, Pakistan
关键词
Membrane proteins; Split amino acid composition; Physicochemical properties; Multi-profiles Bayes; Random forest; Support vector machine; ET-KNN; AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINES; SUBCELLULAR LOCATION; TOPOLOGY PREDICTION; WEB SERVER; CLASSIFICATION; NETWORKS; LOCALIZATION; WATERMARK; ENSEMBLE;
D O I
10.1016/j.ab.2012.02.007
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Membrane proteins are a major class of proteins and encoded by approximately 20% to 30% of genes in most organisms. In this work, a two-layer novel membrane protein prediction system, called Mem-PHybrid, is proposed. It is able to first identify the protein query as a membrane or nonmembrane protein. In the second level, it further identifies the type of membrane protein. The proposed Mem-PHybrid prediction system is based on hybrid features, whereby a fusion of both the physicochemical and split amino acid composition-based features is performed. This enables the proposed Mem-PHybrid to exploit the discrimination capabilities of both types of feature extraction strategy. In addition, minimum redundancy and maximum relevance has also been applied to reduce the dimensionality of a feature vector. We employ random forest, evidence-theoretic K-nearest neighbor, and support vector machine (SVM) as classifiers and analyze their performance on two datasets. SVM using hybrid features yields the highest accuracy of 89.6% and 97.3% on dataset1 and 91.5% and 95.5% on dataset2 for jackknife and independent dataset tests, respectively. The enhanced prediction performance of Mem-PHybrid is largely attributed to the exploitation of the discrimination power of the hybrid features and of the learning capability of SVM. Mem-PHybrid is accessible at http://111.68.99.218/Mem-PHybrid. (c) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:35 / 44
页数:10
相关论文
共 66 条
[1]   Bayesian Models and Algorithms for Protein β-Sheet Prediction [J].
Aydin, Zafer ;
Altunbasak, Yucel ;
Erdogan, Hakan .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (02) :395-409
[2]   PRED-TMBB:: a web server for predicting the topology of β-barrel outer membrane proteins [J].
Bagos, PG ;
Liakopoulos, TD ;
Spyropoulos, IC ;
Hamodrakas, SJ .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W400-W404
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence [J].
Cai, CZ ;
Han, LY ;
Ji, ZL ;
Chen, X ;
Chen, YZ .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3692-3697
[5]   Application of SVM to predict membrane protein types [J].
Cai, YD ;
Ricardo, PW ;
Jen, CH ;
Chou, KC .
JOURNAL OF THEORETICAL BIOLOGY, 2004, 226 (04) :373-376
[6]   The structural basis for intramembrane assembly of an activating immunoreceptor complex [J].
Call, Matthew E. ;
Wucherpfennig, Kai W. ;
Chou, James J. .
NATURE IMMUNOLOGY, 2010, 11 (11) :1023-U73
[7]   Prediction of Integral Membrane Protein Type by Collocated Hydrophobic Amino Acid Pair [J].
Chen, Ke ;
Jiang, Yingfu ;
Du, Li ;
Kurgan, Lukasz .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2009, 30 (01) :163-172
[8]   Analysis of Protein Pathway Networks Using Hybrid Properties [J].
Chen, Lei ;
Huang, Tao ;
Shi, Xiao-He ;
Cai, Yu-Dong ;
Chou, Kuo-Chen .
MOLECULES, 2010, 15 (11) :8177-8192
[9]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[10]  
Chou K., 2010, Nat. Sci, V2, P1090, DOI DOI 10.4236/NS.2010.210136