Prediction of Integral Membrane Protein Type by Collocated Hydrophobic Amino Acid Pair

被引:71
|
作者
Chen, Ke [1 ]
Jiang, Yingfu [1 ]
Du, Li [1 ]
Kurgan, Lukasz [1 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB, Canada
关键词
type of integral membrane protein; transmembrane protein; hydrophobic AA pairs; PSI-BLAST profile; support vector machine; SUPPORT VECTOR MACHINES; GXXXG MOTIF; WEB SERVER; CLASSIFIER; INFORMATION; SEGMENTS; REVEALS; CHANNEL; SVM;
D O I
10.1002/jcc.21053
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A computational model, IMP-TYPE, is proposed for the classification of five types of integral membrane proteins from protein sequence. The proposed model aims not only at providing accurate predictions but most importantly it incorporates interesting and transparent biological patterns. When contrasted with the best-performing existing models, IMP-TYPE reduces the error rates of these methods by 19 and 34% for two out-of-sample tests performed on benchmark datasets. Our empirical evaluations also show that the proposed method provides even bigger improvements, i.e., 29 and 45% error rate reductions, when predictions are performed for sequences that share low (40%) identity with sequences from the training dataset. We also show that IMP-TYPE can be used in a standalone mode, i.e., it duplicates significant majority of correct predictions provided by other leading methods, while providing additional correct predictions which are incorrectly classified by the other methods. Our method computes predictions using a Support Vector Machine classifier that takes feature-based encoded sequence as its input. The input feature set includes hydrophobic AA pairs, which were selected by utilizing a consensus of three feature selection algorithms. The hydrophobic residues that build tip the AA pairs used by our method are shown to be associated with the formation of transmembrane helices in a few recent studies concerning integral membrane proteins. Our study also indicates that Met and Phe display a certain degree of hydrophobicity, which may be snore crucial than their polarity or aromaticity when they occur in the transmembrane segments. This conclusion is supported by a recent study on potential of mean force for membrane protein folding and a study of scales for membrane propensity of amino acids. (C) 2008 Wiley Periodicals, Inc. J Comput Chem 30: 163-172, 2009
引用
收藏
页码:163 / 172
页数:10
相关论文
共 50 条
  • [41] An Ensemble Classifier for Eukaryotic Protein Subcellular Location Prediction Using Gene Ontology Categories and Amino Acid Hydrophobicity
    Li, Liqi
    Zhang, Yuan
    Zou, Lingyun
    Li, Changqing
    Yu, Bo
    Zheng, Xiaoqi
    Zhou, Yue
    PLOS ONE, 2012, 7 (01):
  • [42] Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition
    Ahmad, Khurshid
    Waris, Muhammad
    Hayat, Maqsood
    JOURNAL OF MEMBRANE BIOLOGY, 2016, 249 (03) : 293 - 304
  • [43] Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis
    Liu, Bin
    Chen, Junjie
    Wang, Xiaolong
    MOLECULAR GENETICS AND GENOMICS, 2015, 290 (05) : 1919 - 1931
  • [44] Classification of membrane protein types using Voting Feature Interval in combination with Chou's Pseudo Amino Acid Composition
    Ali, Farman
    Hayat, Maqsood
    JOURNAL OF THEORETICAL BIOLOGY, 2015, 384 : 78 - 83
  • [45] Improving Protein Localization Prediction Using Amino Acid Group Based Physichemical Encoding
    Hu, Jianjun
    Zhang, Fan
    BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2009, 5462 : 248 - 258
  • [46] Prediction of apoptosis protein subcellular location based on amphiphilic pseudo amino acid composition
    Su, Wenxia
    Deng, Shuyi
    Gu, Zhifeng
    Yang, Keli
    Ding, Hui
    Chen, Hui
    Zhang, Zhaoyue
    FRONTIERS IN GENETICS, 2023, 14
  • [47] A Homology and Pseudo Amino Acid Composition-based Multi-label Model for Predicting Human Membrane Protein Types
    Huang, Yanjun
    Huang, Guohua
    CURRENT PROTEOMICS, 2018, 15 (02) : 135 - 141
  • [48] Prediction of Protein Secondary Structure Content by Using the Concept of Chou's Pseudo Amino Acid Composition and Support Vector Machine
    Chen, Chao
    Chen, Lixuan
    Zou, Xiaoyong
    Cai, Peixiang
    PROTEIN AND PEPTIDE LETTERS, 2009, 16 (01) : 27 - 31
  • [49] ProClusEnsem: Predicting membrane protein types by fusing different modes of pseudo amino acid composition
    Wang, Jingyan
    Li, Yongping
    Wang, Quanquan
    You, Xinge
    Man, Jiaju
    Wang, Chao
    Gao, Xin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2012, 42 (05) : 564 - 574
  • [50] Amino acid composition analysis of human secondary transport proteins and implications for reliable membrane topology prediction
    Saidijam, Massoud
    Azizpour, Sonia
    Patching, Simon G.
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2017, 35 (05) : 929 - 949