LncRNAnet: long non-coding RNA identification using deep learning

被引:68
|
作者
Baek, Junghwan [1 ]
Lee, Byunghan [2 ]
Kwon, Sunyoung [2 ]
Yoon, Sungroh [1 ,2 ]
机构
[1] Seoul Natl Univ, Interdisciplinary Program Bioinformat, Seoul 08826, South Korea
[2] Seoul Natl Univ, Elect & Comp Engn, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
GENE; EVOLUTION; ANNOTATION; EXPRESSION; ALIGNMENT; FEATURES; GENCODE;
D O I
10.1093/bioinformatics/bty418
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Long non-coding RNAs (IncRNAs) are important regulatory elements in biological processes. LncRNAs share similar sequence characteristics with messenger RNAs, but they play completely different roles, thus providing novel insights for biological studies. The development of next-generation sequencing has helped in the discovery of IncRNA transcripts. However, the experimental verification of numerous transcriptomes is time consuming and costly. To alleviate these issues, a computational approach is needed to distinguish IncRNAs from the transcriptomes. Results: We present a deep learning-based approach, IncRNAnet, to identify IncRNAs that incorporates recurrent neural networks for RNA sequence modeling and convolutional neural networks for detecting stop codons to obtain an open reading frame indicator. IncRNAnet performed clearly better than the other tools for sequences of short lengths, on which most IncRNAs are distributed. In addition, IncRNAnet successfully learned features and showed 7.83%, 5.76%, 5.30% and 3.78% improvements over the alternatives on a human test set in terms of specificity, accuracy, Fl-score and area under the curve, respectively.
引用
收藏
页码:3889 / 3897
页数:9
相关论文
共 50 条
  • [1] Identification and function of long non-coding RNA
    Emst, Carl
    Morton, Cynthia C.
    FRONTIERS IN CELLULAR NEUROSCIENCE, 2013, 7
  • [2] lncDML: Identification of long non-coding RNAs by Deep Metric Learning
    Zhao, Pengfei
    Peng, Qinke
    Zhu, Zhibo
    Han, Tian
    Dong, Rida
    Huang, Huijun
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1808 - 1812
  • [3] Interpretable Deep Learning Model Reveals Subsequences of Various Functions for Long Non-Coding RNA Identification
    Lin, Rattaphon
    Wichadakul, Duangdao
    FRONTIERS IN GENETICS, 2022, 13
  • [4] Identification of Long Non-coding RNA from inherent features using Machine Learning Techniques
    Sreeshma, C. M.
    Manu, Madhavan
    Gopakumar, G.
    2018 INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND SYSTEMS BIOLOGY (BSB), 2018, : 97 - 102
  • [5] Deep learning tools are top performers in long non-coding RNA prediction
    Ammunet, Tea
    Wang, Ning
    Khan, Sofia
    Elo, Laura L.
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2022, 21 (03) : 230 - 241
  • [6] Evaluation of deep learning in non-coding RNA classification
    Amin, Noorul
    McGrath, Annette
    Chen, Yi-Ping Phoebe
    NATURE MACHINE INTELLIGENCE, 2019, 1 (05) : 246 - 256
  • [7] Evaluation of deep learning in non-coding RNA classification
    Noorul Amin
    Annette McGrath
    Yi-Ping Phoebe Chen
    Nature Machine Intelligence, 2019, 1 : 246 - 256
  • [8] Identification of long non-coding RNA in the horse transcriptome
    Scott, E. Y.
    Mansour, T.
    Bellone, R. R.
    Brown, C. T.
    Mienaltowski, M. J.
    Penedo, M. C.
    Ross, P. J.
    Valberg, S. J.
    Murray, J. D.
    Finno, C. J.
    BMC GENOMICS, 2017, 18
  • [9] Identification of long non-coding RNA in the horse transcriptome
    E. Y. Scott
    T. Mansour
    R. R. Bellone
    C. T. Brown
    M. J. Mienaltowski
    M. C. Penedo
    P. J. Ross
    S. J. Valberg
    J. D. Murray
    C. J. Finno
    BMC Genomics, 18
  • [10] Identification of non-coding RNA
    Gaspin, C
    Thébault, P
    BIOFUTUR, 2005, (251) : 33 - 36