A novel prediction method for protein DNA-binding residues based on neighboring residue correlations

被引:1
|
作者
Song, Jiazhi [1 ,2 ,3 ]
Liu, Guixia [1 ,3 ]
Jiang, Jingqing [2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
[2] Inner Mongolia Minzu Univ, Coll Comp Sci & Technol, Tongliao, Inner Mongolia, Peoples R China
[3] Jilin Univ, Coll Comp Sci & Technol, Dept Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Jilin, Peoples R China
关键词
Bioinformatics; protein; machine learning; binding sites; sequence information; INTEGRATING SEQUENCE; DOMAIN; SITES;
D O I
10.1080/13102818.2022.2122871
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Accurately identifying the protein DNA-binding residues is important for understanding the protein-DNA recognition mechanism and protein function annotation. Many computational methods have been proposed to predict protein-DNA binding residues using protein sequence information; however, for severe imbalanced data like the protein-DNA binding dataset, the under-sampling technique which is applied by most previous methods cannot achieve satisfactory performance. In this study, an adjustment algorithm is proposed to offset the biased prediction results from the classifier. The proposed adjustment algorithm uses the binding probability between the target residue and its neighboring residues to identify more true binding residues which are wrongly predicted as non-binding. The proposed prediction method with adjustment algorithm achieves an area under the ROC curve (AUC) of 0.926 and 0.866 on two benchmark datasets and 0.882 on the independent testing set, which demonstrates that the proposed method can efficiently predict specific residues for protein-DNA interactions.
引用
收藏
页码:865 / 877
页数:13
相关论文
共 50 条
  • [1] A Novel Sequence-Based Method of Predicting Protein DNA-Binding Residues, Using a Machine Learning Approach
    Cai, Yudong
    He, ZhiSong
    Shi, Xiaohe
    Kong, Xiangying
    Gu, Lei
    Xie, Lu
    MOLECULES AND CELLS, 2010, 30 (02) : 99 - 105
  • [2] An accurate feature-based method for identifying DNA-binding residues on protein surfaces
    Xiong, Yi
    Liu, Juan
    Wei, Dong-Qing
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (02) : 509 - 517
  • [3] Prediction of DNA-Binding Residues in Local Segments of Protein Sequences with Fuzzy Cognitive Maps
    Amirkhani, Abdollah
    Kolahdoozi, Mojtaba
    Wang, Chen
    Kurgan, Lukasz A.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (04) : 1372 - 1382
  • [4] Prediction of DNA-binding residues from protein sequence information using random forests
    Wang, Liangjiang
    Yang, Mary Qu
    Yang, Jack Y.
    BMC GENOMICS, 2009, 10
  • [5] Shape string: A new feature for prediction of DNA-binding residues
    Wang, Duo-Duo
    Li, Tong-Hua
    Sun, Jiang-Ming
    Li, Da-Peng
    Xiong, Wen-Wei
    Wang, Wen-Yan
    Tang, Sheng-Nan
    BIOCHIMIE, 2013, 95 (02) : 354 - 358
  • [6] Sequence-based prediction of DNA-binding sites on DNA-binding proteins
    Gou, Z.
    Hwang, S.
    Kuznetsov, B., I
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, : 268 - +
  • [7] StackDPPred: a stacking based prediction of DNA-binding protein from sequence
    Mishra, Avdesh
    Pokhrel, Pujan
    Hoque, Md Tamjidul
    BIOINFORMATICS, 2019, 35 (03) : 433 - 441
  • [8] Sequence-Based Prediction of DNA-Binding Residues in Proteins with Conservation and Correlation Information
    Ma, Xin
    Guo, Jing
    Liu, Hong-De
    Xie, Jian-Ming
    Sun, Xiao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (06) : 1766 - 1775
  • [9] A deep learning-based method for the prediction of DNA interacting residues in a protein
    Patiyal, Sumeet
    Dhall, Anjali
    Raghava, Gajendra P. S.
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [10] Prediction of DNA-binding residues from sequence information using convolutional neural network
    Zhou, Jiyun
    Lu, Qin
    Xu, Ruifeng
    Gui, Lin
    Wang, Hongpeng
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 17 (02) : 132 - 152