Predicting RNA sequence-structure likelihood via structure-aware deep learning

被引:0
|
作者
Zhou, You [1 ,2 ]
Pedrielli, Giulia [1 ,2 ]
Zhang, Fei [3 ]
Wu, Teresa [1 ,2 ]
机构
[1] Arizona State Univ, Sch Comp & Augmented Intelligence, 699 S Mill Ave, Tempe, AZ 85281 USA
[2] Arizona State Univ, ASU Mayo Ctr Innovat Imaging, 699 S Mill Ave, Tempe, AZ 85281 USA
[3] Rutgers State Univ, Dept Chem, 73 Warren St, Newark, NJ 07102 USA
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
基金
美国国家科学基金会;
关键词
Deep learning; RNA; Secondary structure prediction; SECONDARY STRUCTURE; THERMODYNAMIC PARAMETERS; DESIGN;
D O I
10.1186/s12859-024-05916-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundThe active functionalities of RNA are recognized to be heavily dependent on the structure and sequence. Therefore, a model that can accurately evaluate a design by giving RNA sequence-structure pairs would be a valuable tool for many researchers. Machine learning methods have been explored to develop such tools, showing promising results. However, two key issues remain. Firstly, the performance of machine learning models is affected by the features used to characterize RNA. Currently, there is no consensus on which features are the most effective for characterizing RNA sequence-structure pairs. Secondly, most existing machine learning methods extract features describing entire RNA molecule. We argue that it is essential to define additional features that characterize nucleotides and specific sections of RNA structure to enhance the overall efficacy of the RNA design process.ResultsWe develop two deep learning models for evaluating RNA sequence-secondary structure pairs. The first model, NU-ResNet, uses a convolutional neural network architecture that solves the aforementioned problems by explicitly encoding RNA sequence-structure information into a 3D matrix. Building upon NU-ResNet, our second model, NUMO-ResNet, incorporates additional information derived from the characterizations of RNA, specifically the 2D folding motifs. In this work, we introduce an automated method to extract these motifs based on fundamental secondary structure descriptions. We evaluate the performance of both models on an independent testing dataset. Our proposed models outperform the models from literatures in this independent testing dataset. To assess the robustness of our models, we conduct 10-fold cross validation. To evaluate the generalization ability of NU-ResNet and NUMO-ResNet across different RNA families, we train and test our proposed models in different RNA families. Our proposed models show superior performance compared to the models from literatures when being tested across different independent RNA families.ConclusionsIn this study, we propose two deep learning models, NU-ResNet and NUMO-ResNet, to evaluate RNA sequence-secondary structure pairs. These two models expand the field of data-driven approaches for learning RNA. Furthermore, these two models provide the new method to encode RNA sequence-secondary structure pairs.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Structure-Aware Deep Learning for Product Image Classification
    Chen, Zhineng
    Al, Shanshan
    Jia, Caiyan
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (01)
  • [2] RNA design via structure-aware multifrontier ensemble optimization
    Zhou, Tianshuo
    Dai, Ning
    Li, Sizhen
    Ward, Max
    Mathews, David H.
    Huang, Liang
    BIOINFORMATICS, 2023, 39 : i563 - i571
  • [3] RNA design via structure-aware multifrontier ensemble optimization
    Zhou, Tianshuo
    Dai, Ning
    Li, Sizhen
    Ward, Max
    Mathews, David H.
    Huang, Liang
    BIOINFORMATICS, 2023, 39 : I563 - I571
  • [4] Structure-aware deep clustering network based on contrastive learning
    Chen, Bowei
    Xu, Sen
    Xu, Heyang
    Bian, Xuesheng
    Guo, Naixuan
    Xu, Xiufang
    Hua, Xiaopeng
    NEURAL NETWORKS, 2023, 167 : 118 - 128
  • [5] Sequence-structure relations of pseudoknot RNA
    Huang, Fenix W. D.
    Li, Linda Y. M.
    Reidys, Christian M.
    BMC BIOINFORMATICS, 2009, 10
  • [6] Structure-aware deep learning for chronic middle ear disease
    Wang, Zheng
    Song, Jian
    Su, Ri
    Hou, Muzhou
    Qi, Min
    Zhang, Jianglin
    Wu, Xuewen
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 194
  • [7] Paramecium: RNA sequence-structure phylogenetics
    Weimer, Marlyn
    Vd'acny, Peter
    Wolf, Matthias
    INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2023, 73 (04)
  • [8] Sequence-structure relations of pseudoknot RNA
    Fenix WD Huang
    Linda YM Li
    Christian M Reidys
    BMC Bioinformatics, 10
  • [9] Structure-aware deep learning model for peptide toxicity prediction
    Ebrahimikondori, Hossein
    Sutherland, Darcy
    Yanai, Anat
    Richter, Amelia
    Salehi, Ali
    Li, Chenkai
    Coombe, Lauren
    Kotkoff, Monica
    Warren, Rene L.
    Birol, Inanc
    PROTEIN SCIENCE, 2024, 33 (07)
  • [10] Position and structure-aware graph learning
    Ye, Guoqiang
    Song, Juan
    Feng, Mingtao
    Zhu, Guangming
    Shen, Peiyi
    Zhang, Liang
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    NEUROCOMPUTING, 2023, 556