Motivation: There are over 30 sequence-based predictors of the protein-binding residues (PBRs). They use either structure-annotated or disorder-annotated training datasets, potentially creating a dichotomy where the structure-/disorder-specific models may not be able to cross-over to accurately predict the other type. Moreover, the structure-trained predictors were shown to substantially cross-predict PBRs among residues that interact with non-protein partners (nucleic acids and small ligands). We address these issues by performing first-of-its-kind comparative study of a representative collection of disorder- and structure-trained predictors using a comprehensive benchmark set with the structure- and disorder-derived annotations of PBRs (to analyze the cross-over) and the protein-, nucleic acid- and small ligand-binding proteins (to study the cross-predictions). Results: Three predictors provide accurate results: SCRIBER, ANCHOR and disoRDPbind. Some of the structure-trained methods make accurate predictions on the structure-annotated proteins. Similarly, the disorder-trained predictors predict well on the disorder-annotated proteins. However, the considered predictors generally fail to crossover, with the exception of SCRIBER. Our study also reveals that virtually all methods substantially cross-predict PBRs, except for SCRIBER for the structure-annotated proteins and disoRDPbind for the disorder-annotated proteins. We formulate a novel hybrid predictor, hybridPBRpred, that combines results produced by disoRDPbind and SCRIBER to accurately predict disorder- and structure-annotated PBRs. HybridPBRpred generates accurate results that cross-over structure- and disorder-annotated proteins and produces relatively low amount of cross-predictions, offering an accurate alternative to predict PBRs.
机构:
Nanjing Audit Univ, Golden Audit Coll, Nanjing 210029, Jiangsu, Peoples R ChinaNanjing Audit Univ, Golden Audit Coll, Nanjing 210029, Jiangsu, Peoples R China
Ma, Xin
Sun, Xiao
论文数: 0引用数: 0
h-index: 0
机构:
Southeast Univ, State Key Lab Bioelect, Nanjing 210096, Jiangsu, Peoples R ChinaNanjing Audit Univ, Golden Audit Coll, Nanjing 210029, Jiangsu, Peoples R China
机构:
Nanjing Audit Univ, Dept Elementary Courses, Golden Audit Coll, Nanjing 210029, Peoples R ChinaSoutheast Univ, State Key Lab Bioelect, Sch Biol Sci & Med Engn, Nanjing 210096, Peoples R China
Ma, Xin
Guo, Jing
论文数: 0引用数: 0
h-index: 0
机构:Southeast Univ, State Key Lab Bioelect, Sch Biol Sci & Med Engn, Nanjing 210096, Peoples R China
Guo, Jing
Wu, Jiansheng
论文数: 0引用数: 0
h-index: 0
机构:
Nanjing Univ Posts & Telecommun, Dept Bioinformat, Sch Geog & Biol Informat, Nanjing 210046, Peoples R ChinaSoutheast Univ, State Key Lab Bioelect, Sch Biol Sci & Med Engn, Nanjing 210096, Peoples R China
Wu, Jiansheng
Liu, Hongde
论文数: 0引用数: 0
h-index: 0
机构:Southeast Univ, State Key Lab Bioelect, Sch Biol Sci & Med Engn, Nanjing 210096, Peoples R China
Liu, Hongde
Yu, Jiafeng
论文数: 0引用数: 0
h-index: 0
机构:Southeast Univ, State Key Lab Bioelect, Sch Biol Sci & Med Engn, Nanjing 210096, Peoples R China
Yu, Jiafeng
Xie, Jianming
论文数: 0引用数: 0
h-index: 0
机构:Southeast Univ, State Key Lab Bioelect, Sch Biol Sci & Med Engn, Nanjing 210096, Peoples R China
Xie, Jianming
Sun, Xiao
论文数: 0引用数: 0
h-index: 0
机构:
Southeast Univ, State Key Lab Bioelect, Sch Biol Sci & Med Engn, Nanjing 210096, Peoples R ChinaSoutheast Univ, State Key Lab Bioelect, Sch Biol Sci & Med Engn, Nanjing 210096, Peoples R China
机构:
Iowa State Univ, Bioinformat & Computat Biol Program, Ames, IA 50011 USA
Iowa State Univ, Dept Comp Sci, Ames, IA USAIowa State Univ, Bioinformat & Computat Biol Program, Ames, IA 50011 USA
Walia, Rasna R.
Xue, Li C.
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USAIowa State Univ, Bioinformat & Computat Biol Program, Ames, IA 50011 USA
Xue, Li C.
Wilkins, Katherine
论文数: 0引用数: 0
h-index: 0
机构:
Cornell Univ, Dept Plant Pathol & Plant Microbe Biol, Ithaca, NY USA
Cornell Univ, Grad Field Computat Biol, Ithaca, NY USAIowa State Univ, Bioinformat & Computat Biol Program, Ames, IA 50011 USA
Wilkins, Katherine
El-Manzalawy, Yasser
论文数: 0引用数: 0
h-index: 0
机构:
Al Azhar Univ, Dept Syst & Comp Engn, Cairo, EgyptIowa State Univ, Bioinformat & Computat Biol Program, Ames, IA 50011 USA
El-Manzalawy, Yasser
Dobbs, Drena
论文数: 0引用数: 0
h-index: 0
机构:
Iowa State Univ, Bioinformat & Computat Biol Program, Ames, IA 50011 USA
Iowa State Univ, Dept Genet Dev & Cell Biol, Ames, IA USAIowa State Univ, Bioinformat & Computat Biol Program, Ames, IA 50011 USA
Dobbs, Drena
Honavar, Vasant
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
Penn State Univ, Bioinformat & Genom Grad Program, University Pk, PA 16802 USA
Penn State Univ, Huck Inst Life Sci, University Pk, PA 16802 USAIowa State Univ, Bioinformat & Computat Biol Program, Ames, IA 50011 USA