UFold: fast and accurate RNA secondary structure prediction with deep learning

被引:91
作者
Fu, Laiyi [1 ,2 ]
Cao, Yingxin [2 ,5 ,6 ]
Wu, Jie [3 ]
Peng, Qinke [1 ]
Nie, Qing [4 ,5 ,6 ]
Xie, Xiaohui [2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Syst Engn Inst, Xian 710049, Shaanxi, Peoples R China
[2] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[3] Univ Calif Irvine, Dept Biol Chem, Irvine, CA 92697 USA
[4] Univ Calif Irvine, Dept Math, Irvine, CA 92697 USA
[5] Univ Calif Irvine, Ctr Complex Biol Syst, Irvine, CA 92697 USA
[6] Univ Calif Irvine, NSF Simons Ctr Multiscale Cell Fate Res, Irvine, CA 92697 USA
关键词
WEB SERVER; PROTEIN; DESIGN;
D O I
10.1093/nar/gkab1074
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
For many RNA molecules, the secondary structure is essential for the correct function of the RNA. Predicting RNA secondary structure from nucleotide sequences is a long-standing problem in genomics, but the prediction performance has reached a plateau over time. Traditional RNA secondary structure prediction algorithms are primarily based on thermodynamic models through free energy minimization, which imposes strong prior assumptions and is slow to run. Here, we propose a deep learning-based method, called UFold, for RNA secondary structure prediction, trained directly on annotated data and base-pairing rules. UFold proposes a novel image-like representation of RNA sequences, which can be efficiently processed by Fully Convolutional Networks (FCNs). We benchmark the performance of UFold on both within- and cross-family RNA datasets. It significantly outperforms previous methods on within-family datasets, while achieving a similar performance as the traditional methods when trained and tested on distinct RNA families. UFold is also able to predict pseudoknots accurately. Its prediction is fast with an inference time of about 160 ms per sequence up to 1500 bp in length. An online web server running UFold is available at . Code is available at .
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Novel representation of RNA secondary structure used to improve prediction algorithms
    Zou, Q.
    Lin, C.
    Liu, X. -Y.
    Han, Y. -P.
    Li, W. -B.
    Guo, M. -Z.
    GENETICS AND MOLECULAR RESEARCH, 2011, 10 (03) : 1986 - 1998
  • [32] Accurate prediction of functional effect of single amino acid variants with deep learning
    Derbel, Houssemeddine
    Zhao, Zhongming
    Liu, Qian
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2023, 21 : 5776 - 5784
  • [33] A method to improve prediction of secondary structure for large single RNA sequences
    El-lakkani, Ali
    Ibrahim, Eman M.
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2018, 496 (02) : 523 - 528
  • [34] PSSP-MVIRT: peptide secondary structure prediction based on a multi-view deep learning architecture
    Cao, Xiao
    He, Wenjia
    Chen, Zitan
    Li, Yifan
    Wang, Kexin
    Zhang, Hongbo
    Wei, Lesong
    Cui, Lizhen
    Su, Ran
    Wei, Leyi
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [35] An Improved Algorithm for Solving Helix Generation of RNA Secondary Structure Prediction
    Moon, Nazmun Nessa
    Nur, Fernaz Narin
    Hossain, Syed Akhter
    2014 17TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2014, : 116 - 120
  • [36] In silico methods for co-transcriptional RNA secondary structure prediction and for investigating alternative RNA structure expression
    Meyer, Irmtraud M.
    METHODS, 2017, 120 : 3 - 16
  • [37] Fast impedance prediction for power distribution network using deep learning
    Zhang, Ling
    Juang, Jack
    Kiguradze, Zurab
    Pu, Bo
    Jin, Shuai
    Wu, Songping
    Yang, Zhiping
    Fan, Jun
    Hwang, Chulsoon
    INTERNATIONAL JOURNAL OF NUMERICAL MODELLING-ELECTRONIC NETWORKS DEVICES AND FIELDS, 2022, 35 (02)
  • [38] Predicting RNA structure and dynamics with deep learning and solution scattering
    Patt, Edan
    Classen, Scott
    Hammel, Michal
    Schneidman-Duhovny, Dina
    BIOPHYSICAL JOURNAL, 2025, 124 (03) : 549 - 564
  • [39] CSSP-2.0: A refined consensus method for accurate protein secondary structure prediction
    Sanjeevi, Madhumathi
    Mohan, Ajitha
    Ramachandran, Dhanalakshmi
    Jeyaraman, Jeyakanthan
    Sekar, Kanagaraj
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2024, 112
  • [40] Conserved secondary structure prediction for similar highly group of related RNA sequences
    Fu, Haoyue
    Xue, Dingyu
    Zhang, Xiangde
    Jia, Cangzhi
    CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 5158 - +