Sequence-based prediction of protein interaction sites with an integrative method

被引:117
|
作者
Chen, Xue-Wen [1 ,2 ]
Jeong, Jong Cheol [1 ]
机构
[1] Univ Kansas, Informat & Telecommun Technol Ctr, Bioinformat & Computat Life Sci Lab, Lawrence, KS 66045 USA
[2] Univ Kansas, Dept Comp Sci & Elect Engn, Lawrence, KS 66045 USA
基金
美国国家科学基金会;
关键词
MOLECULAR CHAPERONE; SURFACE COMPLEMENTARITY; HYDROPHOBIC MOMENT; SUBSTRATE-BINDING; CRYSTAL-STRUCTURE; SOFT DOCKING; J-DOMAIN; RECOGNITION; CONSERVATION; MUTATIONS;
D O I
10.1093/bioinformatics/btp039
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Identification of protein interaction sites has significant impact on understanding protein function, elucidating signal transduction networks and drug design studies. With the exponentially growing protein sequence data, predictive methods using sequence information only for protein interaction site prediction have drawn increasing interest. In this article, we propose a predictive model for identifying protein interaction sites. Without using any structure data, the proposed method extracts a wide range of features from protein sequences. A random forest-based integrative model is developed to effectively utilize these features and to deal with the imbalanced data classification problem commonly encountered in binding site predictions. Results: We evaluate the predictive method using 2829 interface residues and 24 616 non-interface residues extracted from 99 polypeptide chains in the Protein Data Bank. The experimental results show that the proposed method performs significantly better than two other sequence-based predictive methods and can reliably predict residues involved in protein interaction sites. Furthermore, we apply the method to predict interaction sites and to construct three protein complexes: the DnaK molecular chaperone system, 1YUW and 1DKG, which provide new insight into the sequence function relationship. We show that the predicted interaction sites can be valuable as a first approach for guiding experimental methods investigating protein-protein interactions and localizing the specific interface residues.
引用
收藏
页码:585 / 591
页数:7
相关论文
共 50 条
  • [21] Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins
    Daniele Raimondi
    Gabriele Orlando
    Rita Pancsa
    Taushif Khan
    Wim F. Vranken
    Scientific Reports, 7
  • [22] Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins
    Raimondi, Daniele
    Orlando, Gabriele
    Pancsa, Rita
    Khan, Taushif
    Vranken, Wim F.
    SCIENTIFIC REPORTS, 2017, 7
  • [23] Sequence-based prediction of physicochemical interactions at protein functional sites using a function-and-interaction-annotated domain profile database
    Min Han
    Yifan Song
    Jiaqiang Qian
    Dengming Ming
    BMC Bioinformatics, 19
  • [24] Sequence-based prediction of physicochemical interactions at protein functional sites using a function-and-interaction-annotated domain profile database
    Han, Min
    Song, Yifan
    Qian, Jiaqiang
    Ming, Dengming
    BMC BIOINFORMATICS, 2018, 19
  • [25] Recent developments of sequence-based prediction of protein–protein interactions
    Yoichi Murakami
    Kenji Mizuguchi
    Biophysical Reviews, 2022, 14 : 1393 - 1411
  • [26] Seeing the trees through the forest: sequence-based homo- and heteromeric protein-protein interaction sites prediction using random forest
    Hou, Qingzhen
    De Geest, Paul F. G.
    Vranken, Wim F.
    Heringa, Jaap
    Feenstra, K. Anton
    BIOINFORMATICS, 2017, 33 (10) : 1479 - 1487
  • [27] Sequence-Based Prediction of Protein-Carbohydrate Binding Sites Using Support Vector Machines
    Taherzadeh, Ghazaleh
    Zhou, Yaoqi
    Liew, Alan Wee-Chung
    Yang, Yuedong
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (10) : 2115 - 2122
  • [28] A Comprehensive Comparative Review of Protein Sequence-Based Computational Prediction Models of Lysine Succinylation Sites
    Tasmia, Samme Amena
    Kibria, Md. Kaderi
    Islam, Md. Ariful
    Khatun, Mst Shamima
    Mollah, Md. Nurul Haque
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2022, 23 (11) : 744 - 756
  • [29] Sequence-Based Prediction of Protein-Peptide Binding Sites Using Support Vector Machine
    Taherzadeh, Ghazaleh
    Yang, Yuedong
    Zhang, Tuo
    Liew, Alan Wee-Chung
    Zhou, Yaoqi
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2016, 37 (13) : 1223 - 1229
  • [30] Recent advances in sequence-based protein structure prediction
    Dukka, B. K. C.
    BRIEFINGS IN BIOINFORMATICS, 2017, 18 (06) : 1021 - 1032