DeepDISE: DNA Binding Site Prediction Using a Deep Learning Method

被引:9
作者
Hendrix, Samuel Godfrey [1 ]
Chang, Kuan Y. [2 ]
Ryu, Zeezoo [1 ,3 ]
Xie, Zhong-Ru [1 ]
机构
[1] Univ Georgia, Coll Engn, Sch Elect & Comp Engn, Computat Drug Discovery Lab, Athens, GA 30602 USA
[2] Natl Taiwan Ocean Univ, Dept Comp Sci & Engn, Keelung 202, Taiwan
[3] Univ Georgia, Dept Comp Sci, Franklin Coll Arts & Sci, Athens, GA 30602 USA
关键词
deep learning; protein-DNA interaction; binding site prediction; drug design; convolutional neural network; proteome; systems biology; PROTEINS; RESIDUES; CAVITIES; SEQUENCE; IDENTIFICATION; ALGORITHM; SURFACE; SERVER; MODEL; TOOL;
D O I
10.3390/ijms22115510
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
It is essential for future research to develop a new, reliable prediction method of DNA binding sites because DNA binding sites on DNA-binding proteins provide critical clues about protein function and drug discovery. However, the current prediction methods of DNA binding sites have relatively poor accuracy. Using 3D coordinates and the atom-type of surface protein atom as the input, we trained and tested a deep learning model to predict how likely a voxel on the protein surface is to be a DNA-binding site. Based on three different evaluation datasets, the results show that our model not only outperforms several previous methods on two commonly used datasets, but also demonstrates its robust performance to be consistent among the three datasets. The visualized prediction outcomes show that the binding sites are also mostly located in correct regions. We successfully built a deep learning model to predict the DNA binding sites on target proteins. It demonstrates that 3D protein structures plus atom-type information on protein surfaces can be used to predict the potential binding sites on a protein. This approach should be further extended to develop the binding sites of other important biological molecules.
引用
收藏
页数:13
相关论文
共 52 条
[1]   ccPDB 2.0: an updated version of datasets created and compiled from Protein Data Bank [J].
Agrawal, Piyush ;
Patiyal, Sumeet ;
Kumar, Rajesh ;
Kumar, Vinod ;
Singh, Harinder ;
Raghav, Pawan Kumar ;
Raghava, Gajendra P. S. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2019,
[2]   PSSM-based prediction of DNA binding sites in proteins [J].
Ahmad, S ;
Sarai, A .
BMC BIOINFORMATICS, 2005, 6 (1)
[3]   Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information [J].
Ahmad, S ;
Gromiha, MM ;
Sarai, A .
BIOINFORMATICS, 2004, 20 (04) :477-486
[4]   Residue-level prediction of DNA-binding sites and its application on DNA-binding protein predictions [J].
Bhardwaj, Nitin ;
Lu, Hui .
FEBS LETTERS, 2007, 581 (05) :1058-1066
[5]   Assessment of ligand binding site predictions in CASP10 [J].
Cassarino, Tiziano Gallo ;
Bordoli, Lorenza ;
Schwede, Torsten .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2014, 82 :154-163
[6]   ProteDNA: a sequence-based predictor of sequence-specific DNA-binding residues in transcription factors [J].
Chu, Wen-Yi ;
Huang, Yu-Feng ;
Huang, Chun-Chin ;
Cheng, Yi-Sheng ;
Huang, Chien-Kang ;
Oyang, Yen-Jen .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W396-W401
[7]   PDRLGB: precise DNA-binding residue prediction using a light gradient boosting machine [J].
Deng, Lei ;
Pan, Juan ;
Xu, Xiaojie ;
Yang, Wenyi ;
Liu, Chuyao ;
Liu, Hui .
BMC BIOINFORMATICS, 2018, 19
[8]   Characterization and prediction of the binding site in DNA-binding proteins: improvement of accuracy by combining residue composition, evolutionary conservation and structural parameters [J].
Dey, Sucharita ;
Pal, Arumay ;
Guharoy, Mainak ;
Sonavane, Shrihari ;
Chakrabarti, Pinak .
NUCLEIC ACIDS RESEARCH, 2012, 40 (15) :7150-7161
[9]   Predicting molecular interactions in silico:: I.: A guide to pharmacophore identification and its applications to drug design [J].
Dror, O ;
Shulman-Peleg, A ;
Nussinov, R ;
Wolfson, HJ .
CURRENT MEDICINAL CHEMISTRY, 2004, 11 (01) :71-90
[10]   DBD-Hunter: a knowledge-based method for the prediction of DNA-protein interactions [J].
Gao, Mu ;
Skolnick, Jeffrey .
NUCLEIC ACIDS RESEARCH, 2008, 36 (12) :3978-3992