Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks

被引:3
作者
Saqib, Mohammad N. [1 ]
Kry, Justyna D. [1 ]
Gront, Dominik [1 ]
机构
[1] Univ Warsaw, Fac Chem, Biol & Chem Res Ctr, Pasteura 1, PL-02093 Warsaw, Poland
关键词
deep learning; machine learning; multi-class classifier; neural networks; protein secondary structure; protein structure prediction; protein secondary structure assignment; PREDICTION; ALGORITHM; MODELS;
D O I
10.3390/biom12060841
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The assignment of secondary structure elements in protein conformations is necessary to interpret a protein model that has been established by computational methods. The process essentially involves labeling the amino acid residues with H (Helix), E (Strand), or C (Coil, also known as Loop). When particular atoms are absent from an input protein structure, the procedure becomes more complicated, especially when only the alpha carbon locations are known. Various techniques have been tested and applied to this problem during the last forty years. The application of machine learning techniques is the most recent trend. This contribution presents the HECA classifier, which uses neural networks to assign protein secondary structure types. The technique exclusively employs C alpha coordinates. The Keras (TensorFlow) library was used to implement and train the neural network model. The BioShell toolkit was used to calculate the neural network input features from raw coordinates. The study's findings show that neural network-based methods may be successfully used to take on structure assignment challenges when only C alpha trace is available. Thanks to the careful selection of input features, our approach's accuracy (above 97%) exceeded that of the existing methods.
引用
收藏
页数:12
相关论文
共 48 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Assigning secondary structure in proteins using AI
    Antony, Jisna Vellara
    Madhu, Prayagh
    Balakrishnan, Jayaraj Pottekkattuvalappil
    Yadav, Hemant
    [J]. JOURNAL OF MOLECULAR MODELING, 2021, 27 (09)
  • [3] Accurate prediction of protein structures and interactions using a three-track neural network
    Baek, Minkyung
    DiMaio, Frank
    Anishchenko, Ivan
    Dauparas, Justas
    Ovchinnikov, Sergey
    Lee, Gyu Rie
    Wang, Jue
    Cong, Qian
    Kinch, Lisa N.
    Schaeffer, R. Dustin
    Millan, Claudia
    Park, Hahnbeom
    Adams, Carson
    Glassman, Caleb R.
    DeGiovanni, Andy
    Pereira, Jose H.
    Rodrigues, Andria V.
    van Dijk, Alberdina A.
    Ebrecht, Ana C.
    Opperman, Diederik J.
    Sagmeister, Theo
    Buhlheller, Christoph
    Pavkov-Keller, Tea
    Rathinaswamy, Manoj K.
    Dalwadi, Udit
    Yip, Calvin K.
    Burke, John E.
    Garcia, K. Christopher
    Grishin, Nick V.
    Adams, Paul D.
    Read, Randy J.
    Baker, David
    [J]. SCIENCE, 2021, 373 (6557) : 871 - +
  • [4] A New Secondary Structure Assignment Algorithm Using Cα Backbone Fragments
    Cao, Chen
    Wang, Guishen
    Liu, An
    Xu, Shutan
    Wang, Lincong
    Zou, Shuxue
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (03):
  • [5] AWSEM-MD: Protein Structure Prediction Using Coarse-Grained Physical Potentials and Bioinformatically Based Local Structure Biasing
    Davtyan, Aram
    Schafer, Nicholas P.
    Zheng, Weihua
    Clementi, Cecilia
    Wolynes, Peter G.
    Papoian, Garegin A.
    [J]. JOURNAL OF PHYSICAL CHEMISTRY B, 2012, 116 (29) : 8494 - 8503
  • [6] Coarse-Grained Modeling of the Interplay between Secondary Structure Propensities and Protein Fold Assembly
    Dawid, Aleksandra E.
    Gront, Dominik
    Kolinski, Andrzej
    [J]. JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2018, 14 (04) : 2277 - 2287
  • [7] SURPASS Low-Resolution Coarse-Grained Protein Modeling
    Dawid, Aleksandra E.
    Gront, Dominik
    Kolinski, Andrzej
    [J]. JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2017, 13 (11) : 5766 - 5779
  • [8] Protein secondary structure assignment through Voronoi tessellation
    Dupuis, F
    Sadoc, JF
    Mornon, JP
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 55 (03) : 519 - 528
  • [9] Knowledge-based protein secondary structure assignment
    Frishman, D
    Argos, P
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (04): : 566 - 579
  • [10] BioShell - a package of tools for structural biology computations
    Gront, D
    Kolinski, A
    [J]. BIOINFORMATICS, 2006, 22 (05) : 621 - 622