UFold: fast and accurate RNA secondary structure prediction with deep learning

被引:91
作者
Fu, Laiyi [1 ,2 ]
Cao, Yingxin [2 ,5 ,6 ]
Wu, Jie [3 ]
Peng, Qinke [1 ]
Nie, Qing [4 ,5 ,6 ]
Xie, Xiaohui [2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Syst Engn Inst, Xian 710049, Shaanxi, Peoples R China
[2] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[3] Univ Calif Irvine, Dept Biol Chem, Irvine, CA 92697 USA
[4] Univ Calif Irvine, Dept Math, Irvine, CA 92697 USA
[5] Univ Calif Irvine, Ctr Complex Biol Syst, Irvine, CA 92697 USA
[6] Univ Calif Irvine, NSF Simons Ctr Multiscale Cell Fate Res, Irvine, CA 92697 USA
关键词
WEB SERVER; PROTEIN; DESIGN;
D O I
10.1093/nar/gkab1074
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
For many RNA molecules, the secondary structure is essential for the correct function of the RNA. Predicting RNA secondary structure from nucleotide sequences is a long-standing problem in genomics, but the prediction performance has reached a plateau over time. Traditional RNA secondary structure prediction algorithms are primarily based on thermodynamic models through free energy minimization, which imposes strong prior assumptions and is slow to run. Here, we propose a deep learning-based method, called UFold, for RNA secondary structure prediction, trained directly on annotated data and base-pairing rules. UFold proposes a novel image-like representation of RNA sequences, which can be efficiently processed by Fully Convolutional Networks (FCNs). We benchmark the performance of UFold on both within- and cross-family RNA datasets. It significantly outperforms previous methods on within-family datasets, while achieving a similar performance as the traditional methods when trained and tested on distinct RNA families. UFold is also able to predict pseudoknots accurately. Its prediction is fast with an inference time of about 160 ms per sequence up to 1500 bp in length. An online web server running UFold is available at . Code is available at .
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Accurate prediction of CDR-H3 loop structures of antibodies with deep learning
    Chen, Hedi
    Fan, Xiaoyu
    Zhu, Shuqian
    Pei, Yuchan
    Zhang, Xiaochun
    Zhang, Xiaonan
    Liu, Lihang
    Qian, Feng
    Tian, Boxue
    ELIFE, 2024, 12
  • [42] Microstructure-informed deep learning model for accurate prediction of multiple concrete properties
    Li, Ye
    Ma, Yiming
    Tan, Kang Hai
    Qian, Hanjie
    Liu, Tiejun
    JOURNAL OF BUILDING ENGINEERING, 2024, 98
  • [43] Conserved secondary structure prediction for similar highly group of related RNA sequences
    Fu, Haoyue
    Xue, Dingyu
    Zhang, Xiangde
    Jia, Cangzhi
    CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 5158 - +
  • [44] New insights from cluster analysis methods for RNA secondary structure prediction
    Rogers, Emily
    Heitsch, Christine
    WILEY INTERDISCIPLINARY REVIEWS-RNA, 2016, 7 (03) : 278 - 294
  • [45] Geometric deep learning for the prediction of magnesium-binding sites in RNA structures
    Wang, Kang
    Yin, Zuode
    Sang, Chunjiang
    Xia, Wentao
    Wang, Yan
    Sun, Tingting
    Xu, Xiaojun
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2024, 262
  • [46] Recent methodology progress of deep learning for RNA-protein interaction prediction
    Pan, Xiaoyong
    Yang, Yang
    Xia, Chun-Qiu
    Mirza, Aashiq H.
    Shen, Hong-Bin
    WILEY INTERDISCIPLINARY REVIEWS-RNA, 2019, 10 (06)
  • [47] Advances in RNA-protein structure prediction
    Zeng ChengWei
    Zhao YunJie
    SCIENTIA SINICA-PHYSICA MECHANICA & ASTRONOMICA, 2023, 53 (09)
  • [48] Prediction of protein secondary structure based on deep residual convolutional neural network
    Cheng, Jinyong
    Xu, Ying
    Zhao, Yunxiang
    BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2021, 35 (01) : 1881 - 1890
  • [49] Operon Finder: A Deep Learning-based Web Server for Accurate Prediction of Prokaryotic Operons
    Tomar, Tejasvi Singh
    Dasgupta, Pratik
    Kanaujia, Shankar Prasad
    JOURNAL OF MOLECULAR BIOLOGY, 2023, 435 (14)
  • [50] An accurate and interpretable deep learning model for environmental properties prediction using hybrid molecular representations
    Zhang, Jun
    Wang, Qin
    Su, Yang
    Jin, Saimeng
    Ren, Jingzheng
    Eden, Mario
    Shen, Weifeng
    AICHE JOURNAL, 2022, 68 (06)