Sequence-based prediction of protein interaction sites with an integrative method

被引:117
|
作者
Chen, Xue-Wen [1 ,2 ]
Jeong, Jong Cheol [1 ]
机构
[1] Univ Kansas, Informat & Telecommun Technol Ctr, Bioinformat & Computat Life Sci Lab, Lawrence, KS 66045 USA
[2] Univ Kansas, Dept Comp Sci & Elect Engn, Lawrence, KS 66045 USA
基金
美国国家科学基金会;
关键词
MOLECULAR CHAPERONE; SURFACE COMPLEMENTARITY; HYDROPHOBIC MOMENT; SUBSTRATE-BINDING; CRYSTAL-STRUCTURE; SOFT DOCKING; J-DOMAIN; RECOGNITION; CONSERVATION; MUTATIONS;
D O I
10.1093/bioinformatics/btp039
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Identification of protein interaction sites has significant impact on understanding protein function, elucidating signal transduction networks and drug design studies. With the exponentially growing protein sequence data, predictive methods using sequence information only for protein interaction site prediction have drawn increasing interest. In this article, we propose a predictive model for identifying protein interaction sites. Without using any structure data, the proposed method extracts a wide range of features from protein sequences. A random forest-based integrative model is developed to effectively utilize these features and to deal with the imbalanced data classification problem commonly encountered in binding site predictions. Results: We evaluate the predictive method using 2829 interface residues and 24 616 non-interface residues extracted from 99 polypeptide chains in the Protein Data Bank. The experimental results show that the proposed method performs significantly better than two other sequence-based predictive methods and can reliably predict residues involved in protein interaction sites. Furthermore, we apply the method to predict interaction sites and to construct three protein complexes: the DnaK molecular chaperone system, 1YUW and 1DKG, which provide new insight into the sequence function relationship. We show that the predicted interaction sites can be valuable as a first approach for guiding experimental methods investigating protein-protein interactions and localizing the specific interface residues.
引用
收藏
页码:585 / 591
页数:7
相关论文
共 50 条
  • [31] Prediction of protein-protein interaction types using association rule based classification
    Park, Sung Hee
    Reyes, Jose A.
    Gilbert, David R.
    Kim, Ji Woong
    Kim, Sangsoo
    BMC BIOINFORMATICS, 2009, 10
  • [32] SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues
    Yang, Xiaoxia
    Wang, Jia
    Sun, Jun
    Liu, Rong
    PLOS ONE, 2015, 10 (07):
  • [33] A sequence-based method for predicting extant fold switchers that undergo α-helix ⇆ β-strand transitions
    Mishra, Soumya
    Looger, Loren L.
    Porter, Lauren L.
    BIOPOLYMERS, 2021, 112 (10)
  • [34] Prediction of protein-protein interaction sites by means of ensemble learning and weighted feature descriptor
    Du, Xiuquan
    Sun, Shiwei
    Hu, Changlin
    Li, Xinrui
    Xia, Junfeng
    JOURNAL OF BIOLOGICAL RESEARCH-THESSALONIKI, 2016, 23
  • [35] Protein complex prediction based on simultaneous protein interaction network
    Jung, Suk Hoon
    Hyun, Bora
    Jang, Woo-Hyuk
    Hur, Hee-Young
    Han, Dong-Soo
    BIOINFORMATICS, 2010, 26 (03) : 385 - 391
  • [36] Highly accurate sequence-based prediction of half-sphere exposures of amino acid residues in proteins
    Heffernan, Rhys
    Dehzangi, Abdollah
    Lyons, James
    Paliwal, Kuldip
    Sharma, Alok
    Wang, Jihua
    Sattar, Abdul
    Zhou, Yaoqi
    Yang, Yuedong
    BIOINFORMATICS, 2016, 32 (06) : 843 - 849
  • [37] A Sequence-Based Computational Model for the Prediction of the Solvent Accessible Surface Area for α-Helix and β-Barrel Transmembrane Residues
    Wang, Chengqi
    Xi, Lili
    Li, Shuyan
    Liu, Huanxiang
    Yao, Xiaojun
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2012, 33 (01) : 11 - 17
  • [38] IntPred: a structure-based predictor of protein-protein interaction sites
    Northey, Thomas C.
    Baresic, Anja
    Martin, Andrew C. R.
    BIOINFORMATICS, 2018, 34 (02) : 223 - 229
  • [39] Prediction of protein binding sites using physical and chemical descriptors and the support vector machine regression method
    Sun Zhong-Hua
    Jiang Fan
    CHINESE PHYSICS B, 2010, 19 (11)
  • [40] iFrag: A Protein-Protein Interface Prediction Server Based on Sequence Fragments
    Garcia-Garcia, Javier
    Valls-Comamala, Victoria
    Guney, Emre
    Andreu, David
    Munoz, Francisco J.
    Fernandez-Fuentes, Narcis
    Oliva, Baldo
    JOURNAL OF MOLECULAR BIOLOGY, 2017, 429 (03) : 382 - 389