Prediction of protein-binding residues: dichotomy of sequence-based methods developed using structured complexes versus disordered proteins

被引：20

作者：

Zhang, Jian ^{[1
]}

Ghadermarzi, Sina ^{[2
]}

Kurgan, Lukasz ^{[2
]}

机构：

[1] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang 464000, Peoples R China

[2] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA

来源：

BIOINFORMATICS | 2020年 / 36卷 / 18期

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

MOLECULAR RECOGNITION FEATURES; INTRINSIC DISORDER; INTERACTION SITES; COMPUTATIONAL PREDICTION; MORFS; IDENTIFICATION; REGIONS; RNA; DNA; SERVER;

D O I：

10.1093/bioinformatics/btaa573

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Motivation: There are over 30 sequence-based predictors of the protein-binding residues (PBRs). They use either structure-annotated or disorder-annotated training datasets, potentially creating a dichotomy where the structure-/disorder-specific models may not be able to cross-over to accurately predict the other type. Moreover, the structure-trained predictors were shown to substantially cross-predict PBRs among residues that interact with non-protein partners (nucleic acids and small ligands). We address these issues by performing first-of-its-kind comparative study of a representative collection of disorder- and structure-trained predictors using a comprehensive benchmark set with the structure- and disorder-derived annotations of PBRs (to analyze the cross-over) and the protein-, nucleic acid- and small ligand-binding proteins (to study the cross-predictions). Results: Three predictors provide accurate results: SCRIBER, ANCHOR and disoRDPbind. Some of the structure-trained methods make accurate predictions on the structure-annotated proteins. Similarly, the disorder-trained predictors predict well on the disorder-annotated proteins. However, the considered predictors generally fail to crossover, with the exception of SCRIBER. Our study also reveals that virtually all methods substantially cross-predict PBRs, except for SCRIBER for the structure-annotated proteins and disoRDPbind for the disorder-annotated proteins. We formulate a novel hybrid predictor, hybridPBRpred, that combines results produced by disoRDPbind and SCRIBER to accurately predict disorder- and structure-annotated PBRs. HybridPBRpred generates accurate results that cross-over structure- and disorder-annotated proteins and produces relatively low amount of cross-predictions, offering an accurate alternative to predict PBRs.

引用

页码：4729 / 4738

页数：10

共 95 条

[1] Improving the prediction of yeast protein function using weighted protein-protein interactions [J].

Ahmed, Khaled S. ;

Saloma, Nahed H. ;

Kadah, Yasser M. .

THEORETICAL BIOLOGY AND MEDICAL MODELLING, 2011, 8

[2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].

Altschul, SF ;

Madden, TL ;

Schaffer, AA ;

Zhang, JH ;

Zhang, Z ;

Miller, W ;

Lipman, DJ .

NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402

[3] Binding Site Prediction for Protein-Protein Interactions and Novel Motif Discovery using Re-occurring Polypeptide Sequences [J].

Amos-Binks, Adam ;

Patulea, Catalin ;

Pitre, Sylvain ;

Schoenrock, Andrew ;

Gui, Yuan ;

Green, James R. ;

Golshani, Ashkan ;

Dehne, Frank .

BMC BIOINFORMATICS, 2011, 12

[4]

Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]

[5]

Athanasios A, 2017, CURR DRUG METAB, V18, P5, DOI [10.2174/1389200217666161102150602, 10.2174/138920021801170119204832]

[6] Algorithmic approaches to protein-protein interaction site prediction [J].

Aumentado-Armstrong, Tristan T. ;

Istrate, Bogdan ;

Murgita, Robert A. .

ALGORITHMS FOR MOLECULAR BIOLOGY, 2015, 10

[7] The Protein Data Bank [J].

Berman, HM ;

Westbrook, J ;

Feng, Z ;

Gilliland, G ;

Bhat, TN ;

Weissig, H ;

Shindyalov, IN ;

Bourne, PE .

NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242

[8]

Burley SK, 2017, METHODS MOL BIOL, V1606, P627, DOI 10.1007/978-1-4939-7000-1_26

[9] mentha: a resource for browsing integrated protein-interaction networks [J].

Calderone, Alberto ;

Castagnoli, Luisa ;

Cesareni, Gianni .

NATURE METHODS, 2013, 10 (08) :690-691

[10] Sequence-based identification of interface residues by an integrative profile combining hydrophobic and evolutionary information [J].

Chen, Peng ;

Li, Jinyan .

BMC BIOINFORMATICS, 2010, 11

← 1 2 3 4 5 6 7 8 9 10 →