Using Deep Learning (CNN, RNN, LSTM, GRU) methods for the prediction of Protein Secondary Structure

被引:0
作者
Cakmak, Ezgi [1 ]
Selvi, Ihsan Hakan [1 ]
机构
[1] Sakarya Univ, Bilgisayar & Bilisim Bilimleri Fak, Bilisim Sistemleri Muhendisligi, Sakarya, Turkiye
来源
ACTA INFOLOGICA | 2022年 / 6卷 / 01期
关键词
Protein Secondary Structure Prediction; CNN; RNN; GRU; NEURAL-NETWORKS; ACCURACY; PORTER;
D O I
10.26650/acin.1008075
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Proteins play a crucial function in the biological processes of living organisms. Knowing the function of the protein offers significant insight into future biological and medical research. Since a protein's shape determines its function, it is important to understand the protein's 3D structure. Although experimental methods such as X-ray crystallography and nuclear magnetic resonance (NMR) have been used to examine the shape of proteins, so far the results have been insufficient. As a result, predicting the 3D structure of proteins is crucial. Determining the 3D structure of a protein from its primary structure is challenging. Therefore, predicting the protein secondary structure becomes important for studying its structure and function. Many emerging methods, including machine learning, as well as deep learning, have been used to predict the secondary structure of proteins and comprise a crucial part of Structural Bioinformatics. The goal of this study is to compare the results generated by predictive models that were created using the four most frequently utilized deep learning methods: convolutional neural networks (CNN), recurrent neural networks (RNN), long short term memory networks (LSTM), and gated recurrent units (GRU). The CB513 dataset was used to train and test these models, and performance evaluation metrics viz. accuracy, f1 score, recall, and precision were applied. The CNN, RNN, LSTM, and GRU models had an accuracy of 82.54%, 82.06%, 81.1%, and 81.48%, respectively
引用
收藏
页码:43 / 52
页数:10
相关论文
共 26 条
  • [1] Allison L.A., 2007, Fundamental Molecular Biology
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] Aydin Zafer, 2018, SIGNAL PROCESSING CO
  • [4] Exploiting the past and the future in protein secondary structure prediction
    Baldi, P
    Brunak, S
    Frasconi, P
    Soda, G
    Pollastri, G
    [J]. BIOINFORMATICS, 1999, 15 (11) : 937 - 946
  • [5] Branden C.I., 2012, INTRO PROTEIN STRUCT, DOI DOI 10.1201/9781136969898
  • [6] Prediction of protein secondary structure content using support vector machine
    Chen, Chao
    Tian, Yuanxin
    Zou, Xiaoyong
    Cai, Peixiang
    Mo, Jinyuan
    [J]. TALANTA, 2007, 71 (05) : 2069 - 2073
  • [7] PREDICTION OF PROTEIN CONFORMATION
    CHOU, PY
    FASMAN, GD
    [J]. BIOCHEMISTRY, 1974, 13 (02) : 222 - 245
  • [8] Cuff JA, 1999, PROTEINS, V34, P508, DOI 10.1002/(SICI)1097-0134(19990301)34:4<508::AID-PROT10>3.0.CO
  • [9] 2-4
  • [10] The Protein-Folding Problem, 50 Years On
    Dill, Ken A.
    MacCallum, Justin L.
    [J]. SCIENCE, 2012, 338 (6110) : 1042 - 1046