Functional discrimination of membrane proteins using machine learning techniques

被引:57
作者
Gromiha, M. Michael [1 ]
Yabuki, Yukimitsu [1 ]
机构
[1] AIST Tokyo, CBRC, Koto Ku, Tokyo 1350064, Japan
关键词
D O I
10.1186/1471-2105-9-135
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Discriminating membrane proteins based on their functions is an important task in genome annotation. In this work, we have analyzed the characteristic features of amino acid residues in membrane proteins that perform major functions, such as channels/pores, electrochemical potential-driven transporters and primary active transporters. Results: We observed that the residues Asp, Asn and Tyr are dominant in channels/pores whereas the composition of hydrophobic residues, Phe, Gly, Ile, Leu and Val is high in electrochemical potential-driven transporters. The composition of all the amino acids in primary active transporters lies in between other two classes of proteins. We have utilized different machine learning algorithms, such as, Bayes rule, Logistic function, Neural network, Support vector machine, Decision tree etc. for discriminating these classes of proteins. We observed that most of the algorithms have discriminated them with similar accuracy. The neural network method discriminated the channels/pores, electrochemical potential-driven transporters and active transporters with the 5-fold cross validation accuracy of 64% in a data set of 1718 membrane proteins. The application of amino acid occurrence improved the overall accuracy to 68%. In addition, we have discriminated transporters from other alpha-helical and beta-barrel membrane proteins with the accuracy of 85% using k-nearest neighbor method. The classification of transporters and all other proteins (globular and membrane) showed the accuracy of 82%. Conclusion: The performance of discrimination with amino acid occurrence is better than that with amino acid composition. We suggest that this method could be effectively used to discriminate transporters from all other globular and membrane proteins, and classify them into channels/pores, electrochemical and active transporters.
引用
收藏
页数:8
相关论文
共 28 条
[1]   Structure and mechanism of the lactose permease of Escherichia coli [J].
Abramson, J ;
Smirnova, I ;
Kasho, V ;
Verner, G ;
Kaback, HR ;
Iwata, S .
SCIENCE, 2003, 301 (5633) :610-615
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
[Anonymous], 2005, Data Mining Pratical Machine Learning Tools and Techniques
[4]   A Hidden Markov Model method, capable of predicting and discriminating β-barrel outer membrane proteins -: art. no. 29 [J].
Bagos, PG ;
Liakopoulos, TD ;
Spyropoulos, IC ;
Hamodrakas, SJ .
BMC BIOINFORMATICS, 2004, 5 (1)
[5]   The structure of Escherichia coli BtuF and binding to its cognate ATP binding cassette transporter [J].
Borths, EL ;
Locher, KP ;
Lee, AT ;
Rees, DC .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (26) :16642-16647
[6]   Predicting membrane protein type by functional domain composition and pseudo-amino acid composition [J].
Cai, YD ;
Chou, KC .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 238 (02) :395-400
[7]   The Escherichia coli outer membrane cobalamin transporter BtuB:: Structural analysis of calcium and substrate binding, and identification of orthologous transporters by sequence/structure conservation [J].
Chimento, DP ;
Kadner, RJ ;
Wiener, MC .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 332 (05) :999-1014
[8]   Substrate-induced transmembrane signaling in the cobalamin transporter BtuB [J].
Chimento, DP ;
Mohanty, AK ;
Kadner, RJ ;
Wiener, MC .
NATURE STRUCTURAL BIOLOGY, 2003, 10 (05) :394-401
[9]   Gating the selectivity filter in ClC chloride channels [J].
Dutzler, R ;
Campbell, EB ;
MacKinnon, R .
SCIENCE, 2003, 300 (5616) :108-112
[10]   Structure of a glycerol-conducting channel and the basis for its selectivity [J].
Fu, DX ;
Libson, A ;
Miercke, LJW ;
Weitzman, C ;
Nollert, P ;
Krucinski, J ;
Stroud, RM .
SCIENCE, 2000, 290 (5491) :481-486