CNNcon: Improved Protein Contact Maps Prediction Using Cascaded Neural Networks

被引:15
|
作者
Ding, Wang [1 ]
Xie, Jiang [1 ,2 ,3 ]
Dai, Dongbo [1 ]
Zhang, Huiran [1 ]
Xie, Hao [4 ]
Zhang, Wu [1 ,2 ]
机构
[1] Shanghai Univ, Sch Engn & Comp Sci, Shanghai, Peoples R China
[2] Shanghai Univ, Inst Syst Biol, Shanghai, Peoples R China
[3] Univ Calif Irvine, Dept Math, Irvine, CA 92717 USA
[4] Wuhan Univ, Coll Stomatol, Wuhan 430072, Peoples R China
来源
PLOS ONE | 2013年 / 8卷 / 04期
关键词
RESIDUE CONTACTS; CORRELATED MUTATIONS;
D O I
10.1371/journal.pone.0061533
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Backgrounds: Despite continuing progress in X-ray crystallography and high-field NMR spectroscopy for determination of three-dimensional protein structures, the number of unsolved and newly discovered sequences grows much faster than that of determined structures. Protein modeling methods can possibly bridge this huge sequence-structure gap with the development of computational science. A grand challenging problem is to predict three-dimensional protein structure from its primary structure (residues sequence) alone. However, predicting residue contact maps is a crucial and promising intermediate step towards final three-dimensional structure prediction. Better predictions of local and non-local contacts between residues can transform protein sequence alignment to structure alignment, which can finally improve template based three-dimensional protein structure predictors greatly. Methods: CNNcon, an improved multiple neural networks based contact map predictor using six sub-networks and one final cascade-network, was developed in this paper. Both the sub-networks and the final cascade-network were trained and tested with their corresponding data sets. While for testing, the target protein was first coded and then input to its corresponding sub-networks for prediction. After that, the intermediate results were input to the cascade-network to finish the final prediction. Results: The CNNcon can accurately predict 58.86% in average of contacts at a distance cutoff of 8 angstrom for proteins with lengths ranging from 51 to 450. The comparison results show that the present method performs better than the compared state-of-the-art predictors. Particularly, the prediction accuracy keeps steady with the increase of protein sequence length. It indicates that the CNNcon overcomes the thin density problem, with which other current predictors have trouble. This advantage makes the method valuable to the prediction of long length proteins. As a result, the effective prediction of long length proteins could be possible by the CNNcon.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Prediction of contact maps with neural networks and correlated mutations
    Fariselli, P
    Olmea, O
    Valencia, A
    Casadio, R
    PROTEIN ENGINEERING, 2001, 14 (11): : 835 - 843
  • [2] NNcon: improved protein contact map prediction using 2D-recursive neural networks
    Tegge, Allison N.
    Wang, Zheng
    Eickholt, Jesse
    Cheng, Jianlin
    NUCLEIC ACIDS RESEARCH, 2009, 37 : W515 - W518
  • [3] DEEPCON: protein contact prediction using dilated convolutional neural networks with dropout
    Adhikari, Badri
    BIOINFORMATICS, 2020, 36 (02) : 470 - 477
  • [4] Protein contact prediction using metagenome sequence data and residual neural networks
    Wu, Qi
    Peng, Zhenling
    Anishchenko, Ivan
    Cong, Qian
    Baker, David
    Yang, Jianyi
    BIOINFORMATICS, 2020, 36 (01) : 41 - 48
  • [5] Cascaded bidirectional recurrent neural networks for protein secondary structure prediction
    Chen, Jinmiao
    Chaudhari, Narendra S.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (04) : 572 - 582
  • [6] DNCON2: improved protein contact prediction using two-level deep convolutional neural networks
    Adhikari, Badri
    Hou, Jie
    Cheng, Jianlin
    BIOINFORMATICS, 2018, 34 (09) : 1466 - 1472
  • [7] Prediction of membrane protein types by means of wavelet analysis and cascaded neural networks
    Rezaei, Mohammad Ali
    Abdolmaleki, Parviz
    Karami, Zahra
    Asadabadi, Ebrahim Barzegari
    Sherafat, Mohammad Amin
    Abrishami-Moghaddam, Hamid
    Fadaie, Marziyeh
    Forouzanfar, Mohammad
    JOURNAL OF THEORETICAL BIOLOGY, 2008, 254 (04) : 817 - 820
  • [8] Prediction of contact maps using modified transiently chaotic neural network
    Liu, Guixia
    Zhu, Yuanxian
    Zhou, Wengang
    Zhou, Chunguang
    Wang, Rongxing
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 696 - 701
  • [9] Improving prediction of protein secondary structure, backbone angles, solvent accessibility and contact numbers by using predicted contact maps and an ensemble of recurrent and residual convolutional neural networks
    Hanson, Jack
    Paliwal, Kuldip
    Litfin, Thomas
    Yang, Yuedong
    Zhou, Yaoqi
    BIOINFORMATICS, 2019, 35 (14) : 2403 - 2410
  • [10] Improved Prediction Method of Protein Contact Based on RBF Neural Network
    Sun Pengfei
    Zhang Jianpei
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 816 - 819