Assessing Global-Local Secondary Structure Fingerprints to Classify RNA Sequences With Deep Learning

被引:3
作者
Sutanto, Kevin [1 ]
Turcotte, Marcel [1 ]
机构
[1] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
RNA classification; non-coding RNA; secondary structure; deep learning; k-mers; NONCODING RNAS; COMPUTATIONAL IDENTIFICATION; PREDICTION;
D O I
10.1109/TCBB.2021.3118358
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
RNA elements that are transcribed but not translated into proteins are called non-coding RNAs (ncRNAs). They play wide-ranging roles in biological processes and disorders. Just like proteins, their structure is often intimately linked to their function. Many examples have been documented where structure is conserved across taxa despite sequence divergence. Thus, structure is often used to identify function. Specifically, the secondary structure is predicted and ncRNAs with similar structures are assumed to have same or similar functions. However, a strand of RNA can fold into multiple possible structures, and some strands even fold differently in vivo and in vitro. Furthermore, ncRNAs often function as RNA-protein complexes, which can affect structure. Because of these, we hypothesized using one structure per sequence may discard information, possibly resulting in poorer classification accuracy. Therefore, we propose using secondary structure fingerprints, comprising two categories: a higher-level representation derived from RNA-As-Graphs (RAG), and free energy fingerprints based on a curated repertoire of small structural motifs. The fingerprints take into account the difference between global and local structural matches. We also evaluated our deep learning architecture with k-mers. By combining our global-local fingerprints with 6-mer, we achieved an accuracy, precision, and recall of 91.04%, 91.10%, and 91.00%.
引用
收藏
页码:2736 / 2747
页数:12
相关论文
共 58 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Evaluation of deep learning in non-coding RNA classification
    Amin, Noorul
    McGrath, Annette
    Chen, Yi-Ping Phoebe
    [J]. NATURE MACHINE INTELLIGENCE, 2019, 1 (05) : 246 - 256
  • [3] Deep learning for computational biology
    Angermueller, Christof
    Parnamaa, Tanel
    Parts, Leopold
    Stegle, Oliver
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2016, 12 (07)
  • [4] LncRNAnet: long non-coding RNA identification using deep learning
    Baek, Junghwan
    Lee, Byunghan
    Kwon, Sunyoung
    Yoon, Sungroh
    [J]. BIOINFORMATICS, 2018, 34 (22) : 3889 - 3897
  • [5] NON-CODING RNAs IN DEVELOPMENT AND DISEASE: BACKGROUND, MECHANISMS, AND THERAPEUTIC APPROACHES
    Beermann, Julia
    Piccoli, Maria-Teresa
    Viereck, Janika
    Thum, Thomas
    [J]. PHYSIOLOGICAL REVIEWS, 2016, 96 (04) : 1297 - 1325
  • [6] Borgelt C., 2005, P 1 INT WORKSHOP OPE, P6, DOI DOI 10.1145/1133905.1133908
  • [7] A Novel Integrative Approach for Non-coding RNA Classification Based on Deep Learning
    Boukelia, Abdelbasset
    Boucheham, Anouar
    Belguidou, Meriem
    Batouche, Mohamed
    Zehraoui, Farida
    Tahi, Fariza
    [J]. CURRENT BIOINFORMATICS, 2020, 15 (04) : 338 - 348
  • [8] Non-coding RNA therapeutics for cardiac regeneration
    Braga, Luca
    Ali, Hashim
    Secco, Ilaria
    Giacca, Mauro
    [J]. CARDIOVASCULAR RESEARCH, 2021, 117 (03) : 674 - 693
  • [9] The fungal snoRNAome
    Canzler, Sebastian
    Stadler, Peter F.
    Schor, Jana
    [J]. RNA, 2018, 24 (03) : 342 - 360
  • [10] The Noncoding RNA Revolution-Trashing Old Rules to Forge New Ones
    Cech, Thomas R.
    Steitz, Joan A.
    [J]. CELL, 2014, 157 (01) : 77 - 94