A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction

被引:21
|
作者
Spencer, Matt [1 ]
Eickholt, Jesse [2 ]
Cheng, Jianlin [3 ]
机构
[1] Univ Missouri, Inst Informat, Columbia, MO 65211 USA
[2] Cent Michigan Univ, Dept Comp Sci, Mt Pleasant, MI 48859 USA
[3] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
基金
美国国家卫生研究院;
关键词
Machine learning; neural nets; protein structure prediction; deep learning; NEURAL-NETWORKS; GENERATION; ACCURATE;
D O I
10.1109/TCBB.2014.2343960
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Ab initio protein secondary structure (SS) predictions are utilized to generate tertiary structure predictions, which are increasingly demanded due to the rapid discovery of proteins. Although recent developments have slightly exceeded previous methods of SS prediction, accuracy has stagnated around 80 percent and many wonder if prediction cannot be advanced beyond this ceiling. Disciplines that have traditionally employed neural networks are experimenting with novel deep learning techniques in attempts to stimulate progress. Since neural networks have historically played an important role in SS prediction, we wanted to determine whether deep learning could contribute to the advancement of this field as well. We developed an SS predictor that makes use of the position-specific scoring matrix generated by PSI-BLAST and deep learning network architectures, which we call DNSS. Graphical processing units and CUDA software optimize the deep network architecture and efficiently train the deep networks. Optimal parameters for the training process were determined, and a workflow comprising three separately trained deep networks was constructed in order to make refined predictions. This deep learning network approach was used to predict SS for a fully independent test dataset of 198 proteins, achieving a Q(3) accuracy of 80.7 percent and a Sov accuracy of 74.2 percent.
引用
收藏
页码:103 / 112
页数:10
相关论文
共 50 条
  • [1] A Deep Learning Approach for Prediction of Protein Secondary Structure
    Zubair, Muhammad
    Hanif, Muhammad Kashif
    Alabdulkreem, Eatedal
    Ghadi, Yazeed
    Khan, Muhammad Irfan
    Sarwar, Muhammad Umer
    Hanif, Ayesha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 3705 - 3718
  • [2] DNSS2: Improved ab initio protein secondary structure prediction using advanced deep learning architectures
    Guo, Zhiye
    Hou, Jie
    Cheng, Jianlin
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (02) : 207 - 217
  • [3] Fast and accurate Ab Initio Protein structure prediction using deep learning potentials
    Pearce, Robin
    Li, Yang
    Omenn, Gilbert S.
    Zhang, Yang
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (09)
  • [4] Deep learning geometrical potential for high-accuracy ab initio protein structure prediction
    Li, Yang
    Zhang, Chengxin
    Yu, Dong-Jun
    Zhang, Yang
    ISCIENCE, 2022, 25 (06)
  • [5] Ab initio protein structure prediction
    Hardin, C
    Pogorelov, TV
    Luthey-Schulten, Z
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (02) : 176 - 181
  • [6] TOUCHSTONE II: A new approach to ab initio protein structure prediction
    Zhang, Y
    Kolinski, A
    Skolnick, J
    BIOPHYSICAL JOURNAL, 2003, 85 (02) : 1145 - 1164
  • [7] Ab initio protein structure prediction using a combined hierarchical approach
    Samudrala, R
    Xia, Y
    Huang, E
    Levitt, M
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1999, : 194 - 198
  • [8] Protein Secondary Structure Prediction Based on Deep Learning
    Zheng, Lin
    Li, Hong-ling
    Wu, Nan
    Ao, Li
    3RD INTERNATIONAL SYMPOSIUM ON MECHATRONICS AND INDUSTRIAL INFORMATICS, (ISMII 2017), 2017, : 171 - 177
  • [9] Lattices for ab initio protein structure prediction
    Pierri, Ciro Leonardo
    De Grassi, Anna
    Turi, Antonio
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 73 (02) : 351 - 361
  • [10] A Parallel Multi-objective Ab initio Approach for Protein Structure Prediction
    Becerra, David
    Sandoval, Angelica
    Restrepo-Montoya, Daniel
    Nino, Luis F.
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 137 - 141