CNNsite: Prediction of DNA-binding Residues in Proteins Using Convolutional Neural Network with Sequence Features

被引:0
|
作者
Zhou, Jiyun [1 ,2 ]
Lu, Qin [2 ]
Xu, Ruifeng [1 ]
Gui, Lin [1 ]
Wang, Hongpeng [1 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Sch Comp Sci & Technol, Shenzhen, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
EFFICIENT PREDICTION; ACCURATE PREDICTION; SITES;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Protein-DNA complexes play crucial roles in gene regulation. The prediction of the residues involved in protein-DNA interactions is critical for understanding gene regulation. Although many methods have been proposed, most of them overlooked motif features. Motif features are sub sequences and are important for the recognition between a protein and DNA. In order to efficiently use motif features for the prediction of DNA-binding residues, we first apply the Convolutional Neural Network (CNN) method to capture the motif features from the sequences around the target residues. CNN modeling consists of a set of learnable motif detectors that can capture the important motif features by scanning the sequences around the target residues. Then we use a neural network classifier, referred to as CNNsite, by combining the captured motif features, sequence features and evolutionary features to predict binding residues from sequences. The datasets PDNA-62 and PDNA-224 are used to evaluate the performance of CNNsite by five-fold cross-validation. Performance evaluation shows that the motif features performs better than sequence features and evolutionary features with at least 6.73% on ST, 0.097 on MCC and 0.069 on AUC. When comparing with previously published methods, CNNsite performs better with at least 0.019 on MCC, 4.37% on ST and 0.040 on AUC. CNNsite is also evaluated on an independent dataset TS-72 and CNNsite outperforms the previous methods by at least 0.012 on AUC. The discriminant powers of the motif features of size from 2 to 6 residues show that many motif features with large discriminant power are composed by the residues that play important roles in the DNA-protein interactions. The standalone version of the CNNsite is available at http://hlt.hitsz.edu.cn:8080/CNNsite/.
引用
收藏
页码:78 / 85
页数:8
相关论文
共 50 条
  • [1] Prediction of DNA-binding residues from sequence information using convolutional neural network
    Zhou, Jiyun
    Lu, Qin
    Xu, Ruifeng
    Gui, Lin
    Wang, Hongpeng
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 17 (02) : 132 - 152
  • [2] Hybrid_DBP: Prediction of DNA-binding proteins using hybrid features and convolutional neural networks
    Yu, Shaoyou
    Peng, Dejun
    Zhu, Wen
    Liao, Bo
    Wang, Peng
    Yang, Dongxuan
    Wu, Fangxiang
    FRONTIERS IN PHARMACOLOGY, 2022, 13
  • [3] Prediction of DNA-binding residues from sequence
    Ofran, Yanay
    Mysore, Venkatesh
    Rost, Burkhard
    BIOINFORMATICS, 2007, 23 (13) : I347 - I353
  • [4] DP-Bind: a Web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins
    Hwang, Seungwoo
    Gou, Zhenkun
    Kuznetsov, Igor B.
    BIOINFORMATICS, 2007, 23 (05) : 634 - 636
  • [5] Structure based prediction of binding residues on DNA-binding proteins
    Bhardwaj, Nitin
    Langlois, Robert E.
    Hui, Guijun Zhao
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2611 - 2614
  • [6] Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information
    Ahmad, S
    Gromiha, MM
    Sarai, A
    BIOINFORMATICS, 2004, 20 (04) : 477 - 486
  • [7] INTERACT-O-FINDER: A Tool for Prediction of DNA-Binding Proteins Using Sequence Features
    Samant, Monika
    Jethva, Minesh
    Hasija, Yasha
    INTERNATIONAL JOURNAL OF PEPTIDE RESEARCH AND THERAPEUTICS, 2015, 21 (02) : 189 - 193
  • [8] Sequence-based prediction of DNA-binding sites on DNA-binding proteins
    Gou, Z.
    Hwang, S.
    Kuznetsov, B., I
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, : 268 - +
  • [9] INTERACT-O-FINDER: A Tool for Prediction of DNA-Binding Proteins Using Sequence Features
    Monika Samant
    Minesh Jethva
    Yasha Hasija
    International Journal of Peptide Research and Therapeutics, 2015, 21 : 189 - 193
  • [10] Sequence-Based Prediction of DNA-Binding Residues in Proteins with Conservation and Correlation Information
    Ma, Xin
    Guo, Jing
    Liu, Hong-De
    Xie, Jian-Ming
    Sun, Xiao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (06) : 1766 - 1775