A novel prediction method for protein DNA-binding residues based on neighboring residue correlations

被引:2
作者
Song, Jiazhi [1 ,2 ,3 ]
Liu, Guixia [1 ,3 ]
Jiang, Jingqing [2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
[2] Inner Mongolia Minzu Univ, Coll Comp Sci & Technol, Tongliao, Inner Mongolia, Peoples R China
[3] Jilin Univ, Coll Comp Sci & Technol, Dept Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Jilin, Peoples R China
关键词
Bioinformatics; protein; machine learning; binding sites; sequence information; INTEGRATING SEQUENCE; DOMAIN; SITES;
D O I
10.1080/13102818.2022.2122871
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Accurately identifying the protein DNA-binding residues is important for understanding the protein-DNA recognition mechanism and protein function annotation. Many computational methods have been proposed to predict protein-DNA binding residues using protein sequence information; however, for severe imbalanced data like the protein-DNA binding dataset, the under-sampling technique which is applied by most previous methods cannot achieve satisfactory performance. In this study, an adjustment algorithm is proposed to offset the biased prediction results from the classifier. The proposed adjustment algorithm uses the binding probability between the target residue and its neighboring residues to identify more true binding residues which are wrongly predicted as non-binding. The proposed prediction method with adjustment algorithm achieves an area under the ROC curve (AUC) of 0.926 and 0.866 on two benchmark datasets and 0.882 on the independent testing set, which demonstrates that the proposed method can efficiently predict specific residues for protein-DNA interactions.
引用
收藏
页码:865 / 877
页数:13
相关论文
共 50 条
[31]   Improving a Designed Photocontrolled DNA-Binding Protein [J].
Fan, Helen Y. ;
Morgan, Stacy-Anne ;
Brechun, Katherine E. ;
Chen, Yih-Yang ;
Jaikaran, Anna S. I. ;
Woolley, G. Andrew .
BIOCHEMISTRY, 2011, 50 (07) :1226-1237
[32]   DNA-binding mechanism and evolution of replication protein A [J].
Madru, Clement ;
Martinez-Carranza, Markel ;
Laurent, Sebastien ;
Alberti, Alessandra C. ;
Chevreuil, Maelenn ;
Raynal, Bertrand ;
Haouz, Ahmed ;
Le Meur, Remy A. ;
Delarue, Marc ;
Henneke, Ghislaine ;
Flament, Didier ;
Krupovic, Mart ;
Legrand, Pierre ;
Sauguet, Ludovic .
NATURE COMMUNICATIONS, 2023, 14 (01)
[33]   THE ROLE OF TYROSINE RESIDUES IN THE DNA-BINDING SITE OF THE PF1 GENE-5 PROTEIN [J].
PLYTE, SE ;
KNEALE, GG .
PROTEIN ENGINEERING, 1991, 4 (05) :553-560
[34]   DNAgenie: accurate prediction of DNA-type-specific binding residues in protein sequences [J].
Zhang, Jian ;
Ghadermarzi, Sina ;
Katuwawala, Akila ;
Kurgan, Lukasz .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
[35]   DNA-Binding Property of the Novel DNA-Binding Domain STPR in FMBP-1 of the Silkworm Bombyx mori [J].
Takiya, Shigeharu ;
Saito, Shin ;
Yokoyama, Takuya ;
Matsumoto, Daisuke ;
Aizawa, Tomoyasu ;
Kamiya, Masakatsu ;
Demura, Makoto ;
Kawano, Keiichi .
JOURNAL OF BIOCHEMISTRY, 2009, 146 (01) :103-111
[36]   Improving Sequence-Based Prediction of Protein Peptide Binding Residues by Introducing Intrinsic Disorder and a Consensus Method [J].
Zhao, Zijuan ;
Peng, Zhenling ;
Yang, Jianyi .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2018, 58 (07) :1459-1468
[37]   Recognition of different DNA sequences by a DNA-binding protein alters protein dynamics differentially [J].
Mondol, Tanumoy ;
Batabyal, Subrata ;
Mazumder, Abhishek ;
Roy, Siddhartha ;
Pal, Samir Kumar .
FEBS LETTERS, 2012, 586 (03) :258-262
[38]   A SVM-based Approach for Predicting DNA-binding Residues in Proteins from Amino Acid Sequences [J].
Ma, Xin ;
Wu, Jian-Sheng ;
Liu, Hong-De ;
Yang, Xi-Nan ;
Xie, Jian-Ming ;
Sun, Xiao .
2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, :225-229
[39]   Prediction of DNA-binding residues in proteins from amino acid sequences using a random forest model with a hybrid feature [J].
Wu, Jiansheng ;
Liu, Hongde ;
Duan, Xueye ;
Ding, Yan ;
Wu, Hongtao ;
Bai, Yunfei ;
Sun, Xiao .
BIOINFORMATICS, 2009, 25 (01) :30-35
[40]   ProteDNA: a sequence-based predictor of sequence-specific DNA-binding residues in transcription factors [J].
Chu, Wen-Yi ;
Huang, Yu-Feng ;
Huang, Chun-Chin ;
Cheng, Yi-Sheng ;
Huang, Chien-Kang ;
Oyang, Yen-Jen .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W396-W401