UFold: fast and accurate RNA secondary structure prediction with deep learning

被引:91
|
作者
Fu, Laiyi [1 ,2 ]
Cao, Yingxin [2 ,5 ,6 ]
Wu, Jie [3 ]
Peng, Qinke [1 ]
Nie, Qing [4 ,5 ,6 ]
Xie, Xiaohui [2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Syst Engn Inst, Xian 710049, Shaanxi, Peoples R China
[2] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[3] Univ Calif Irvine, Dept Biol Chem, Irvine, CA 92697 USA
[4] Univ Calif Irvine, Dept Math, Irvine, CA 92697 USA
[5] Univ Calif Irvine, Ctr Complex Biol Syst, Irvine, CA 92697 USA
[6] Univ Calif Irvine, NSF Simons Ctr Multiscale Cell Fate Res, Irvine, CA 92697 USA
关键词
WEB SERVER; PROTEIN; DESIGN;
D O I
10.1093/nar/gkab1074
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
For many RNA molecules, the secondary structure is essential for the correct function of the RNA. Predicting RNA secondary structure from nucleotide sequences is a long-standing problem in genomics, but the prediction performance has reached a plateau over time. Traditional RNA secondary structure prediction algorithms are primarily based on thermodynamic models through free energy minimization, which imposes strong prior assumptions and is slow to run. Here, we propose a deep learning-based method, called UFold, for RNA secondary structure prediction, trained directly on annotated data and base-pairing rules. UFold proposes a novel image-like representation of RNA sequences, which can be efficiently processed by Fully Convolutional Networks (FCNs). We benchmark the performance of UFold on both within- and cross-family RNA datasets. It significantly outperforms previous methods on within-family datasets, while achieving a similar performance as the traditional methods when trained and tested on distinct RNA families. UFold is also able to predict pseudoknots accurately. Its prediction is fast with an inference time of about 160 ms per sequence up to 1500 bp in length. An online web server running UFold is available at . Code is available at .
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Crystal Structure Prediction via Deep Learning
    Ryan, Kevin
    Lengyel, Jeff
    Shatruk, Michael
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2018, 140 (32) : 10158 - 10168
  • [22] Fast prediction of propeller dynamic wake based on deep learning
    Li, Changming
    Liang, Bingchen
    Yuan, Peng
    Zhang, Qin
    Liu, Yongkai
    Liu, Bin
    Zhao, Ming
    PHYSICS OF FLUIDS, 2024, 36 (08)
  • [23] A n2 RNA SECONDARY STRUCTURE PREDICTION ALGORITHM
    Nebel, Markus E.
    Scheid, Anika
    BIOINFORMATICS: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOINFORMATICS MODELS, METHODS AND ALGORITHMS, 2012, : 66 - 75
  • [24] Redundancy-weighting the PDB for detailed secondary structure prediction using deep-learning models
    Sidi, Tomer
    Keasar, Chen
    BIOINFORMATICS, 2020, 36 (12) : 3733 - 3738
  • [25] Machine learning in RNA structure prediction: Advances and challenges
    Zhang, Sicheng
    Li, Jun
    Chen, Shi-Jie
    BIOPHYSICAL JOURNAL, 2024, 123 (17) : 2647 - 2657
  • [26] Protein Structure Prediction: Conventional and Deep Learning Perspectives
    Jisna, V. A.
    Jayaraj, P. B.
    PROTEIN JOURNAL, 2021, 40 (04) : 522 - 544
  • [27] Interaction prediction in structure-based virtual screening using deep learning
    Gonczarek, Adam
    Tomczak, Jakub M.
    Zareba, Szymon
    Kaczmar, Joanna
    Dabrowski, Piotr
    Walczak, Michal J.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 100 : 253 - 258
  • [28] Deep learning generative model for crystal structure prediction
    Luo, Xiaoshan
    Wang, Zhenyu
    Gao, Pengyue
    Lv, Jian
    Wang, Yanchao
    Chen, Changfeng
    Ma, Yanming
    NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [29] TMPSS: A Deep Learning-Based Predictor for Secondary Structure and Topology Structure Prediction of Alpha-Helical Transmembrane Proteins
    Liu, Zhe
    Gong, Yingli
    Bao, Yihang
    Guo, Yuanzhao
    Wang, Han
    Lin, Guan Ning
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2021, 8
  • [30] Prediction of RNA secondary structure by maximizing pseudo-expected accuracy
    Hamada, Michiaki
    Sato, Kengo
    Asai, Kiyoshi
    BMC BIOINFORMATICS, 2010, 11