DEEPCON: protein contact prediction using dilated convolutional neural networks with dropout

被引:31
作者
Adhikari, Badri [1 ]
机构
[1] Univ Missouri, Dept Math & Comp Sci, St Louis, MO 63121 USA
关键词
RESIDUE-RESIDUE CONTACTS;
D O I
10.1093/bioinformatics/btz593
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Exciting new opportunities have arisen to solve the protein contact prediction problem from the progress in neural networks and the availability of a large number of homologous sequences through high-throughput sequencing. In this work, we study how deep convolutional neural networks (ConvNets) may be best designed and developed to solve this long-standing problem. Results: With publicly available datasets, we designed and trained various ConvNet architectures. We tested several recent deep learning techniques including wide residual networks, dropouts and dilated convolutions. We studied the improvements in the precision of medium-range and long-range contacts, and compared the performance of our best architectures with the ones used in existing state-of-the-art methods. The proposed ConvNet architectures predict contacts with significantly more precision than the architectures used in several state-of-the-art methods. When trained using the DeepCov dataset consisting of 3456 proteins and tested on PSICOV dataset of 150 proteins, our architectures achieve up to 15% higher precision when L/2 long-range contacts are evaluated. Similarly, when trained using the DNCON2 dataset consisting of 1426 proteins and tested on 84 protein domains in the CASP12 dataset, our single network achieves 4.8% higher precision than the ensembled DNCON2 method when top L long-range contacts are evaluated.
引用
收藏
页码:470 / 477
页数:8
相关论文
共 29 条
  • [1] Abadi M., 2016, TENSORFLOW LARGE SCA
  • [2] DNCON2: improved protein contact prediction using two-level deep convolutional neural networks
    Adhikari, Badri
    Hou, Jie
    Cheng, Jianlin
    [J]. BIOINFORMATICS, 2018, 34 (09) : 1466 - 1472
  • [3] CONFOLD2: improved contact-driven ab initio protein structure modeling
    Adhikari, Badri
    Cheng, Jianlin
    [J]. BMC BIOINFORMATICS, 2018, 19
  • [4] End-to-End Differentiable Learning of Protein Structure
    AlQuraishi, Mohammed
    [J]. CELL SYSTEMS, 2019, 8 (04) : 292 - +
  • [5] [Anonymous], 2016, BMVC
  • [6] [Anonymous], BIORXIV
  • [7] [Anonymous], NIPS
  • [8] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [9] Predicting protein residue-residue contacts using deep networks and boosting
    Eickholt, Jesse
    Cheng, Jianlin
    [J]. BIOINFORMATICS, 2012, 28 (23) : 3066 - 3072
  • [10] HMMER web server: interactive sequence similarity searching
    Finn, Robert D.
    Clements, Jody
    Eddy, Sean R.
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : W29 - W37