DeepRaccess: high-speed RNA accessibility prediction using deep learning

被引:1
作者
Hara, Kaisei [1 ,2 ]
Iwano, Natsuki [1 ]
Fukunaga, Tsukasa [3 ]
Hamada, Michiaki [1 ,2 ,4 ]
机构
[1] Waseda Univ, Grad Sch Adv Sci & Engn, Dept Elect Engn & Biosci, Tokyo, Japan
[2] Waseda Univ, Computat Bio Big Data Open Innovat Lab, AIST, Tokyo, Japan
[3] Waseda Univ, Waseda Inst Adv Study, Tokyo, Japan
[4] Nippon Med Sch, Grad Sch Med, Tokyo, Japan
来源
FRONTIERS IN BIOINFORMATICS | 2023年 / 3卷
关键词
RNA secondary structure; RNA accessibility; machine learning; acceleration; translation efficiency prediction; BASE-PAIRING PROBABILITIES; SECONDARY STRUCTURE; PARTITION-FUNCTION;
D O I
10.3389/fbinf.2023.1275787
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA accessibility is a useful RNA secondary structural feature for predicting RNA-RNA interactions and translation efficiency in prokaryotes. However, conventional accessibility calculation tools, such as Raccess, are computationally expensive and require considerable computational time to perform transcriptome-scale analysis. In this study, we developed DeepRaccess, which predicts RNA accessibility based on deep learning methods. DeepRaccess was trained to take artificial RNA sequences as input and to predict the accessibility of these sequences as calculated by Raccess. Simulation and empirical dataset analyses showed that the accessibility predicted by DeepRaccess was highly correlated with the accessibility calculated by Raccess. In addition, we confirmed that DeepRaccess could predict protein abundance in E.coli with moderate accuracy from the sequences around the start codon. We also demonstrated that DeepRaccess achieved tens to hundreds of times software speed-up in a GPU environment. The source codes and the trained models of DeepRaccess are freely available at https://github.com/hmdlab/DeepRaccess.
引用
收藏
页数:8
相关论文
共 50 条
[1]   Predicting effective microRNA target sites in mammalian mRNAs [J].
Agarwal, Vikram ;
Bell, George W. ;
Nam, Jin-Wu ;
Bartel, David P. .
ELIFE, 2015, 4
[2]   Informative RNA base embedding for RNA structural alignment and clustering by deep representation learning [J].
Akiyama, Manato ;
Sakakibara, Yasubumi .
NAR GENOMICS AND BIOINFORMATICS, 2022, 4 (01)
[3]   Harnessing machine learning to guide phylogenetic-tree search algorithms [J].
Azouri, Dana ;
Abadi, Shiran ;
Mansour, Yishay ;
Mayrose, Itay ;
Pupko, Tal .
NATURE COMMUNICATIONS, 2021, 12 (01)
[4]   Local RNA base pairing probabilities in large sequences [J].
Bernhart, SH ;
Hofacker, IL ;
Stadler, PF .
BIOINFORMATICS, 2006, 22 (05) :614-615
[5]   RNA Accessibility in cubic time [J].
Bernhart, Stephan H. ;
Mueckstein, Ullrike ;
Hofacker, Ivo L. .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2011, 6
[6]   Cryo-EM reveals an entangled kinetic trap in the folding of a catalytic RNA [J].
Bonilla, Steve L. ;
Vicens, Quentin ;
Kieft, Jeffrey S. .
SCIENCE ADVANCES, 2022, 8 (34)
[7]   Evaluation of 244,000 synthetic sequences reveals design principles to optimize translation in Escherichia coli [J].
Cambray, Guillaume ;
Guimaraes, Joao C. ;
Arkin, Adam Paul .
NATURE BIOTECHNOLOGY, 2018, 36 (10) :1005-+
[8]   Alignment-free comparison of metagenomics sequences via approximate string matching [J].
Chen, Jian ;
Yang, Le ;
Li, Lu ;
Goodison, Steve ;
Sun, Yijun .
BIOINFORMATICS ADVANCES, 2022, 2 (01)
[9]  
Corso G., 2021, ADV NEUR IN, V34
[10]   SECONDARY STRUCTURE OF THE RIBOSOME BINDING-SITE DETERMINES TRANSLATIONAL EFFICIENCY - A QUANTITATIVE-ANALYSIS [J].
DESMIT, MH ;
VANDUIN, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (19) :7668-7672