Prediction of protein-binding residues: dichotomy of sequence-based methods developed using structured complexes versus disordered proteins

被引:18
|
作者
Zhang, Jian [1 ]
Ghadermarzi, Sina [2 ]
Kurgan, Lukasz [2 ]
机构
[1] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang 464000, Peoples R China
[2] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
MOLECULAR RECOGNITION FEATURES; INTRINSIC DISORDER; INTERACTION SITES; COMPUTATIONAL PREDICTION; MORFS; IDENTIFICATION; REGIONS; RNA; DNA; SERVER;
D O I
10.1093/bioinformatics/btaa573
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: There are over 30 sequence-based predictors of the protein-binding residues (PBRs). They use either structure-annotated or disorder-annotated training datasets, potentially creating a dichotomy where the structure-/disorder-specific models may not be able to cross-over to accurately predict the other type. Moreover, the structure-trained predictors were shown to substantially cross-predict PBRs among residues that interact with non-protein partners (nucleic acids and small ligands). We address these issues by performing first-of-its-kind comparative study of a representative collection of disorder- and structure-trained predictors using a comprehensive benchmark set with the structure- and disorder-derived annotations of PBRs (to analyze the cross-over) and the protein-, nucleic acid- and small ligand-binding proteins (to study the cross-predictions). Results: Three predictors provide accurate results: SCRIBER, ANCHOR and disoRDPbind. Some of the structure-trained methods make accurate predictions on the structure-annotated proteins. Similarly, the disorder-trained predictors predict well on the disorder-annotated proteins. However, the considered predictors generally fail to crossover, with the exception of SCRIBER. Our study also reveals that virtually all methods substantially cross-predict PBRs, except for SCRIBER for the structure-annotated proteins and disoRDPbind for the disorder-annotated proteins. We formulate a novel hybrid predictor, hybridPBRpred, that combines results produced by disoRDPbind and SCRIBER to accurately predict disorder- and structure-annotated PBRs. HybridPBRpred generates accurate results that cross-over structure- and disorder-annotated proteins and produces relatively low amount of cross-predictions, offering an accurate alternative to predict PBRs.
引用
收藏
页码:4729 / 4738
页数:10
相关论文
共 49 条
  • [1] HybridDBRpred: improved sequence-based prediction of DNA-binding amino acids using annotations from structured complexes and disordered proteins
    Zhang, Jian
    Basu, Sushmita
    Kurgan, Lukasz
    NUCLEIC ACIDS RESEARCH, 2023,
  • [2] Review and comparative assessment of sequence-based predictors of protein-binding residues
    Zhang, Jian
    Kurgan, Lukasz
    BRIEFINGS IN BIOINFORMATICS, 2018, 19 (05) : 821 - 837
  • [3] SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences
    Zhang, Jian
    Kurgan, Lukasz
    BIOINFORMATICS, 2019, 35 (14) : I343 - I353
  • [4] DeepDISOBind: accurate prediction of RNA-, DNA- and protein-binding intrinsically disordered residues with deep multi-task learning
    Zhang, Fuhao
    Zhao, Bi
    Shi, Wenbo
    Li, Min
    Kurgan, Lukasz
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [5] Sequence-Based Prediction of DNA-Binding Residues in Proteins with Conservation and Correlation Information
    Ma, Xin
    Guo, Jing
    Liu, Hong-De
    Xie, Jian-Ming
    Sun, Xiao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (06) : 1766 - 1775
  • [6] CoMemMoRFPred: Sequence-based Prediction of MemMoRFs by Combining Predictors of Intrinsic Disorder, MoRFs and Disordered Lipid-binding Regions
    Basu, Sushmita
    Hegedus, Tamas
    Kurgan, Lukasz
    JOURNAL OF MOLECULAR BIOLOGY, 2023, 435 (21)
  • [7] PreDisorder: ab initio sequence-based prediction of protein disordered regions
    Deng, Xin
    Eickholt, Jesse
    Cheng, Jianlin
    BMC BIOINFORMATICS, 2009, 10
  • [8] Sequence-Based Prediction of Protein-Carbohydrate Binding Sites Using Support Vector Machines
    Taherzadeh, Ghazaleh
    Zhou, Yaoqi
    Liew, Alan Wee-Chung
    Yang, Yuedong
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (10) : 2115 - 2122
  • [9] Sequence-Based Prediction of microRNA-Binding Residues in Proteins Using Cost-Sensitive Laplacian Support Vector Machines
    Wu, Jian-Sheng
    Zhou, Zhi-Hua
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2013, 10 (03) : 752 - 759
  • [10] Improving Sequence-Based Prediction of Protein Peptide Binding Residues by Introducing Intrinsic Disorder and a Consensus Method
    Zhao, Zijuan
    Peng, Zhenling
    Yang, Jianyi
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2018, 58 (07) : 1459 - 1468