RNA structure prediction using deep learning — A comprehensive review

被引:3
作者
Chaturvedi, Mayank [1 ]
Rashid, Mahmood A. [1 ]
Paliwal, Kuldip K. [1 ]
机构
[1] Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, 4111, QLD
基金
澳大利亚研究理事会;
关键词
Deep learning; Feature extraction; Machine learning; Neural networks; RNA secondary structure prediction; Transformers;
D O I
10.1016/j.compbiomed.2025.109845
中图分类号
学科分类号
摘要
In computational biology, accurate RNA structure prediction offers several benefits, including facilitating a better understanding of RNA functions and RNA-based drug design. Implementing deep learning techniques for RNA structure prediction has led tremendous progress in this field, resulting in significant improvements in prediction accuracy. This comprehensive review aims to provide an overview of the diverse strategies employed in predicting RNA secondary structures, emphasizing deep learning methods. The article categorizes the discussion into three main dimensions: feature extraction methods, existing state-of-the-art learning model architectures, and prediction approaches. We present a comparative analysis of various techniques and models highlighting their strengths and weaknesses. Finally, we identify gaps in the literature, discuss current challenges, and suggest future approaches to enhance model performance and applicability in RNA structure prediction tasks. This review provides a deeper insight into the subject and paves the way for further progress in this dynamic intersection of life sciences and artificial intelligence. © 2025 The Authors
引用
收藏
相关论文
共 189 条
[61]  
Penic R.J., Vlasic T., Huber R.G., Wan Y., Sikic M., RiNALMo: General-purpose RNA language models can generalize well on structure prediction tasks, (2024)
[62]  
A. Rashid M., K. Paliwal K., Single sequence based feature engineering for convolutional neural networks towards RNA contact map prediction, 2023 IEEE Asia-Pacific Conference on Computer Science and Data Engineering, CSDE, pp. 1-6, (2023)
[63]  
Chen W., Zhang X., Brooker J., Lin H., Zhang L., Chou K.-C., PseKNC-General: a cross-platform package for generating various modes of pseudo nucleotide compositions, Bioinformatics, 31, 1, pp. 119-120, (2015)
[64]  
Liu B., BioSeq-Analysis: a platform for DNA, RNA and protein sequence analysis based on machine learning approaches, Brief. Bioinform., 20, 4, pp. 1280-1294, (2019)
[65]  
Chen Z., Zhao P., Li C., Li F., Xiang D., Chen Y.-Z., Akutsu T., Daly R.J., Webb G.I., Zhao Q., Et al., iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization, Nucleic Acids Res., 49, 10, (2021)
[66]  
Wang Y., Chen Z., Pan Z., Huang S., Liu J., Xia W., Zhang H., Zheng M., Li H., Hou T., Et al., RNAincoder: a deep learning-based encoder for RNA and RNA-associated interaction, Nucleic Acids Res., 51, W1, pp. W509-W519, (2023)
[67]  
Liu B., Liu F., Wang X., Chen J., Fang L., Chou K.-C., Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences, Nucleic Acids Res., 43, W1, pp. W65-W71, (2015)
[68]  
Liu B., Liu F., Fang L., Wang X., Chou K.-C., repRNA: a web server for generating various feature vectors of RNA sequences, Mol. Genet. Genom., 291, pp. 473-481, (2016)
[69]  
Muhammod R., Ahmed S., Md Farid D., Shatabda S., Sharma A., Dehzangi A., PyFeat: a Python-based effective feature generation tool for DNA, RNA and protein sequences, Bioinformatics, 35, 19, pp. 3831-3833, (2019)
[70]  
Chen Z., Zhao P., Li F., Marquez-Lago T.T., Leier A., Revote J., Zhu Y., Powell D.R., Akutsu T., Webb G.I., Et al., Ilearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data, Brief. Bioinform., 21, 3, pp. 1047-1057, (2020)