Machine learning techniques for protein secondary structure prediction: An overview and evaluation

被引:26
|
作者
Yoo, Paul D. [1 ]
Zhou, Bing Bing [1 ]
Zomaya, Albert Y. [2 ,3 ]
机构
[1] Univ Sydney, Adv Networks Res Grp, Sch Informat Technol J12, Sydney, NSW 2006, Australia
[2] Univ Sydney, Sydney Bioinformat Ctr, Sydney, NSW 2006, Australia
[3] Univ Sydney, Ctr Math Biol, Sydney, NSW 2006, Australia
关键词
amino acids encoding; evolutionary information; long-range dependencies; machine learning; protein secondary structure;
D O I
10.2174/157489308784340676
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The prediction of protein secondary structures is not only of great importance for many biological applications but also regarded as an important stepping stone for solving the mystery of how amino acid sequences fold into tertiary structures. Recent research on secondary structure prediction is mainly based on widely known machine learning techniques, such as Artificial Neural Networks and Support Vector Machines. The most significant breakthroughs were the incorporation of new biological information into an efficient prediction model and the development of new models which can efficiently exploit suitable information from its primary sequence. Hence this paper reviews the theoretical and experimental literature of these models with a focus on informational issues involving evolutionary and long-range information of protein sequences. Furthermore, we investigate several key issues in protein data processing which involve dimensionality reduction and encoding schemes.
引用
收藏
页码:74 / 86
页数:13
相关论文
共 50 条
  • [21] NMR signal processing, prediction, and structure verification with machine learning techniques
    Cobas, Carlos
    MAGNETIC RESONANCE IN CHEMISTRY, 2020, 58 (06) : 512 - 519
  • [22] Performance evaluation of different machine learning techniques for prediction of heart disease
    Ashok Kumar Dwivedi
    Neural Computing and Applications, 2018, 29 : 685 - 693
  • [23] Fast learning optimized prediction methodology (FLOPRED) for protein secondary structure prediction
    S. Saraswathi
    J. L. Fernández-Martínez
    A. Kolinski
    R. L. Jernigan
    A. Kloczkowski
    Journal of Molecular Modeling, 2012, 18 : 4275 - 4289
  • [24] Fast learning optimized prediction methodology (FLOPRED) for protein secondary structure prediction
    Saraswathi, S.
    Fernandez-Martinez, J. L.
    Kolinski, A.
    Jernigan, R. L.
    Kloczkowski, A.
    JOURNAL OF MOLECULAR MODELING, 2012, 18 (09) : 4275 - 4289
  • [26] An overview of machine learning techniques in constraint solving
    Andrei Popescu
    Seda Polat-Erdeniz
    Alexander Felfernig
    Mathias Uta
    Müslüm Atas
    Viet-Man Le
    Klaus Pilsl
    Martin Enzelsberger
    Thi Ngoc Trang Tran
    Journal of Intelligent Information Systems, 2022, 58 : 91 - 118
  • [27] An overview of machine learning techniques in constraint solving
    Popescu, Andrei
    Polat-Erdeniz, Seda
    Felfernig, Alexander
    Uta, Mathias
    Atas, Muslum
    Viet-Man Le
    Pilsl, Klaus
    Enzelsberger, Martin
    Thi Ngoc Trang Tran
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2022, 58 (01) : 91 - 118
  • [28] Gap, techniques and evaluation: traffic flow prediction using machine learning and deep learning
    Noor Afiza Mat Razali
    Nuraini Shamsaimon
    Khairul Khalil Ishak
    Suzaimah Ramli
    Mohd Fahmi Mohamad Amran
    Sazali Sukardi
    Journal of Big Data, 8
  • [29] Gap, techniques and evaluation: traffic flow prediction using machine learning and deep learning
    Razali, Noor Afiza Mat
    Shamsaimon, Nuraini
    Ishak, Khairul Khalil
    Ramli, Suzaimah
    Amran, Mohd Fahmi Mohamad
    Sukardi, Sazali
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [30] Testing machine learning techniques for general application by using protein secondary structure prediction. A brief survey with studies of pitfalls and benefits using a simple progressive learning approach
    Robson, Barry
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 138