Sequence-based prediction of protein interaction sites with an integrative method

被引:117
|
作者
Chen, Xue-Wen [1 ,2 ]
Jeong, Jong Cheol [1 ]
机构
[1] Univ Kansas, Informat & Telecommun Technol Ctr, Bioinformat & Computat Life Sci Lab, Lawrence, KS 66045 USA
[2] Univ Kansas, Dept Comp Sci & Elect Engn, Lawrence, KS 66045 USA
基金
美国国家科学基金会;
关键词
MOLECULAR CHAPERONE; SURFACE COMPLEMENTARITY; HYDROPHOBIC MOMENT; SUBSTRATE-BINDING; CRYSTAL-STRUCTURE; SOFT DOCKING; J-DOMAIN; RECOGNITION; CONSERVATION; MUTATIONS;
D O I
10.1093/bioinformatics/btp039
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Identification of protein interaction sites has significant impact on understanding protein function, elucidating signal transduction networks and drug design studies. With the exponentially growing protein sequence data, predictive methods using sequence information only for protein interaction site prediction have drawn increasing interest. In this article, we propose a predictive model for identifying protein interaction sites. Without using any structure data, the proposed method extracts a wide range of features from protein sequences. A random forest-based integrative model is developed to effectively utilize these features and to deal with the imbalanced data classification problem commonly encountered in binding site predictions. Results: We evaluate the predictive method using 2829 interface residues and 24 616 non-interface residues extracted from 99 polypeptide chains in the Protein Data Bank. The experimental results show that the proposed method performs significantly better than two other sequence-based predictive methods and can reliably predict residues involved in protein interaction sites. Furthermore, we apply the method to predict interaction sites and to construct three protein complexes: the DnaK molecular chaperone system, 1YUW and 1DKG, which provide new insight into the sequence function relationship. We show that the predicted interaction sites can be valuable as a first approach for guiding experimental methods investigating protein-protein interactions and localizing the specific interface residues.
引用
收藏
页码:585 / 591
页数:7
相关论文
共 50 条
  • [1] SeqTMPPI: Sequence-Based Transmembrane Protein Interaction Prediction
    Wang, Han
    Jiang, Jiuhong
    Chen, Qiufen
    Zhang, Chunhua
    Lu, Chang
    Ma, Zhiqiang
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 96 - 99
  • [2] Sequence-based prediction of protein-protein interaction sites with L1-logreg classifier
    Dhole, Kaustubh
    Singh, Gurdeep
    Pai, Priyadarshini P.
    Mondal, Sukanta
    JOURNAL OF THEORETICAL BIOLOGY, 2014, 348 : 47 - 54
  • [3] Sequence-based prediction of protein-protein interaction sites by simplified long short-term memory network
    Zhang, Buzhong
    Li, Jinyan
    Quan, Lijun
    Chen, Yu
    Lu, Qiang
    NEUROCOMPUTING, 2019, 357 : 86 - 100
  • [4] Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction
    Khatun, Mst Shamima
    Shoombuatong, Watshara
    Hasan, Md Mehedi
    Kurata, Hiroyuki
    CURRENT GENOMICS, 2020, 21 (06) : 454 - 463
  • [5] Sequence-based prediction of protein domains
    Liu, JF
    Rost, B
    NUCLEIC ACIDS RESEARCH, 2004, 32 (12) : 3522 - 3530
  • [6] Sequence-Based Prediction of Protein Solubility
    Agostini, Federico
    Vendruscolo, Michele
    Tartaglia, Gian Gaetano
    JOURNAL OF MOLECULAR BIOLOGY, 2012, 421 (2-3) : 237 - 241
  • [7] Cracking the black box of deep sequence-based protein-protein interaction prediction
    Bernett, Judith
    Blumenthal, David B.
    List, Markus
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [8] Sequence-based protein-protein interaction prediction via support vector machine
    Yongcui Wang
    Jiguang Wang
    Zhixia Yang
    Naiyang Deng
    Journal of Systems Science and Complexity, 2010, 23 : 1012 - 1023
  • [9] DeNovo: virus-host sequence-based protein-protein interaction prediction
    Eid, Fatma-Elzahraa
    ElHefnawi, Mahmoud
    Heath, Lenwood S.
    BIOINFORMATICS, 2016, 32 (08) : 1144 - 1150
  • [10] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Tanlin Sun
    Bo Zhou
    Luhua Lai
    Jianfeng Pei
    BMC Bioinformatics, 18