Sequence-based prediction of protein interaction sites with an integrative method

被引:117
|
作者
Chen, Xue-Wen [1 ,2 ]
Jeong, Jong Cheol [1 ]
机构
[1] Univ Kansas, Informat & Telecommun Technol Ctr, Bioinformat & Computat Life Sci Lab, Lawrence, KS 66045 USA
[2] Univ Kansas, Dept Comp Sci & Elect Engn, Lawrence, KS 66045 USA
基金
美国国家科学基金会;
关键词
MOLECULAR CHAPERONE; SURFACE COMPLEMENTARITY; HYDROPHOBIC MOMENT; SUBSTRATE-BINDING; CRYSTAL-STRUCTURE; SOFT DOCKING; J-DOMAIN; RECOGNITION; CONSERVATION; MUTATIONS;
D O I
10.1093/bioinformatics/btp039
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Identification of protein interaction sites has significant impact on understanding protein function, elucidating signal transduction networks and drug design studies. With the exponentially growing protein sequence data, predictive methods using sequence information only for protein interaction site prediction have drawn increasing interest. In this article, we propose a predictive model for identifying protein interaction sites. Without using any structure data, the proposed method extracts a wide range of features from protein sequences. A random forest-based integrative model is developed to effectively utilize these features and to deal with the imbalanced data classification problem commonly encountered in binding site predictions. Results: We evaluate the predictive method using 2829 interface residues and 24 616 non-interface residues extracted from 99 polypeptide chains in the Protein Data Bank. The experimental results show that the proposed method performs significantly better than two other sequence-based predictive methods and can reliably predict residues involved in protein interaction sites. Furthermore, we apply the method to predict interaction sites and to construct three protein complexes: the DnaK molecular chaperone system, 1YUW and 1DKG, which provide new insight into the sequence function relationship. We show that the predicted interaction sites can be valuable as a first approach for guiding experimental methods investigating protein-protein interactions and localizing the specific interface residues.
引用
收藏
页码:585 / 591
页数:7
相关论文
共 50 条
  • [31] Sequence-Based Prediction of Transmembrane Protein Crystallization Propensity
    Qizhi Zhu
    Lihua Wang
    Ruyu Dai
    Wei Zhang
    Wending Tang
    Yannan Bin
    Zeliang Wang
    Junfeng Xia
    Interdisciplinary Sciences: Computational Life Sciences, 2021, 13 : 693 - 702
  • [32] Sequence-Based Prediction of Transmembrane Protein Crystallization Propensity
    Zhu, Qizhi
    Wang, Lihua
    Dai, Ruyu
    Zhang, Wei
    Tang, Wending
    Bin, Yannan
    Wang, Zeliang
    Xia, Junfeng
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (04) : 693 - 702
  • [33] SOLpro: accurate sequence-based prediction of protein solubility
    Magnan, Christophe N.
    Randall, Arlo
    Baldi, Pierre
    BIOINFORMATICS, 2009, 25 (17) : 2200 - 2207
  • [34] Sequence-based prediction of protein binding mode landscapes
    Horvath, Attila
    Miskei, Marton
    Ambrusl, Viktor
    Vendruscolo, Michele
    Fuxreiter, Monika
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (05)
  • [35] TUnA: an uncertainty-aware transformer model for sequence-based protein-protein interaction prediction
    Ko, Young Su
    Parkinson, Jonathan
    Liu, Cong
    Wang, Wei
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (05)
  • [36] Human disease-gene classification with integrative sequence-based and topological features of protein-protein interaction networks
    Smalter, Aaron
    Lei, Seak Fei
    Chen, Xue-wen
    2007 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2007, : 209 - 214
  • [37] PCSPred&x005F;SC: Prediction of Protein Citrullination Sites Using an Effective Sequence-Based Combined Method
    Zhang, Lina
    Chen, Jingui
    Zhang, Chengjin
    Gao, Rui
    Yang, Runtao
    IEEE ACCESS, 2020, 8 : 88453 - 88463
  • [38] Human protein-protein interaction prediction by a novel sequence-based co-evolution method: co-evolutionary divergence
    Liu, Chia Hsin
    Li, Ker-Chau
    Yuan, Shinsheng
    BIOINFORMATICS, 2013, 29 (01) : 92 - 98
  • [39] Sequence-based prediction of protein–protein interaction using auto-feature engineering of RNN-based model
    Mewara B.
    Lalwani S.
    Research on Biomedical Engineering, 2023, 39 (01) : 259 - 272
  • [40] Recent developments of sequence-based prediction of protein-protein interactions
    Murakami, Yoichi
    Mizuguchi, Kenji
    BIOPHYSICAL REVIEWS, 2022, 14 (06) : 1393 - 1411