DeepDISE: DNA Binding Site Prediction Using a Deep Learning Method

被引:10
作者
Hendrix, Samuel Godfrey [1 ]
Chang, Kuan Y. [2 ]
Ryu, Zeezoo [1 ,3 ]
Xie, Zhong-Ru [1 ]
机构
[1] Univ Georgia, Coll Engn, Sch Elect & Comp Engn, Computat Drug Discovery Lab, Athens, GA 30602 USA
[2] Natl Taiwan Ocean Univ, Dept Comp Sci & Engn, Keelung 202, Taiwan
[3] Univ Georgia, Dept Comp Sci, Franklin Coll Arts & Sci, Athens, GA 30602 USA
关键词
deep learning; protein-DNA interaction; binding site prediction; drug design; convolutional neural network; proteome; systems biology; PROTEINS; RESIDUES; CAVITIES; SEQUENCE; IDENTIFICATION; ALGORITHM; SURFACE; SERVER; MODEL; TOOL;
D O I
10.3390/ijms22115510
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
It is essential for future research to develop a new, reliable prediction method of DNA binding sites because DNA binding sites on DNA-binding proteins provide critical clues about protein function and drug discovery. However, the current prediction methods of DNA binding sites have relatively poor accuracy. Using 3D coordinates and the atom-type of surface protein atom as the input, we trained and tested a deep learning model to predict how likely a voxel on the protein surface is to be a DNA-binding site. Based on three different evaluation datasets, the results show that our model not only outperforms several previous methods on two commonly used datasets, but also demonstrates its robust performance to be consistent among the three datasets. The visualized prediction outcomes show that the binding sites are also mostly located in correct regions. We successfully built a deep learning model to predict the DNA binding sites on target proteins. It demonstrates that 3D protein structures plus atom-type information on protein surfaces can be used to predict the potential binding sites on a protein. This approach should be further extended to develop the binding sites of other important biological molecules.
引用
收藏
页数:13
相关论文
共 52 条
[21]   Predicting DNA-binding sites of proteins based on sequential and 3D structural information [J].
Li, Bi-Qing ;
Feng, Kai-Yan ;
Ding, Juan ;
Cai, Yu-Dong .
MOLECULAR GENETICS AND GENOMICS, 2014, 289 (03) :489-499
[22]   PreDNA: accurate prediction of DNA-binding sites in proteins by integrating sequence and geometric structure information [J].
Li, Tao ;
Li, Qian-Zhong ;
Liu, Shuai ;
Fan, Guo-Liang ;
Zuo, Yong-Chun ;
Peng, Yong .
BIOINFORMATICS, 2013, 29 (06) :678-685
[23]   Anatomy of protein pockets and cavities: Measurement of binding site geometry and implications for ligand design [J].
Liang, J ;
Edelsbrunner, H ;
Woodward, C .
PROTEIN SCIENCE, 1998, 7 (09) :1884-1897
[24]   Protein binding site prediction using an empirical scoring function [J].
Liang, Shide ;
Zhang, Chi ;
Liu, Song ;
Zhou, Yaoqi .
NUCLEIC ACIDS RESEARCH, 2006, 34 (13) :3698-3707
[25]   DNABind: A hybrid algorithm for structure-based prediction of DNA-binding residues by combining machine learning- and template-based approaches [J].
Liu, Rong ;
Hu, Jianjun .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2013, 81 (11) :1885-1899
[26]   A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs [J].
Miao, Zhichao ;
Westhof, Eric .
PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (12)
[27]   On the nature of cavities on protein surfaces: Application to the identification of drug-binding sites [J].
Nayal, Murad ;
Honig, Barry .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 63 (04) :892-906
[28]   Prediction of DNA-binding residues from sequence [J].
Ofran, Yanay ;
Mysore, Venkatesh ;
Rost, Burkhard .
BIOINFORMATICS, 2007, 23 (13) :I347-I353
[29]   DNABINDPROT: fluctuation-based predictor of DNA-binding residues within a network of interacting residues [J].
Ozbek, Pemra ;
Soner, Seren ;
Erman, Burak ;
Haliloglu, Turkan .
NUCLEIC ACIDS RESEARCH, 2010, 38 :W417-W423
[30]   U-Net: Convolutional Networks for Biomedical Image Segmentation [J].
Ronneberger, Olaf ;
Fischer, Philipp ;
Brox, Thomas .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241