A deep learning method for lincRNA detection using auto-encoder algorithm

被引:21
|
作者
Yu, Ning [1 ]
Yu, Zeng [2 ]
Pan, Yi [3 ]
机构
[1] SUNY Coll Brockport, Dept Comp Sci, 350 New Campus Dr, Brockport, NY 14420 USA
[2] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 610031, Sichuan, Peoples R China
[3] Georgia State Univ, Dept Comp Sci, 25 Pk Pl, Atlanta, GA 30303 USA
来源
BMC BIOINFORMATICS | 2017年 / 18卷
关键词
Deep learning; Long intergenic non-coding RNA (lincRNA); Auto-encoder; Transcription sites; RNA-seq; Knowledge-based discovery;
D O I
10.1186/s12859-017-1922-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: RNA sequencing technique (RNA-seq) enables scientists to develop novel data-driven methods for discovering more unidentified lincRNAs. Meantime, knowledge-based technologies are experiencing a potential revolution ignited by the new deep learning methods. By scanning the newly found data set from RNA-seq, scientists have found that: (1) the expression of lincRNAs appears to be regulated, that is, the relevance exists along the DNA sequences; (2) lincRNAs contain some conversed patterns/motifs tethered together by non-conserved regions. The two evidences give the reasoning for adopting knowledge-based deep learning methods in lincRNA detection. Similar to coding region transcription, non-coding regions are split at transcriptional sites. However, regulatory RNAs rather than message RNAs are generated. That is, the transcribed RNAs participate the biological process as regulatory units instead of generating proteins. Identifying these transcriptional regions from non-coding regions is the first step towards lincRNA recognition. Results: The auto-encoder method achieves 100% and 92.4% prediction accuracy on transcription sites over the putative data sets. The experimental results also show the excellent performance of predictive deep neural network on the lincRNA data sets compared with support vector machine and traditional neural network. In addition, it is validated through the newly discovered lincRNA data set and one unreported transcription site is found by feeding the whole annotated sequences through the deep learning machine, which indicates that deep learning method has the extensive ability for lincRNA prediction. Conclusions: The transcriptional sequences of lincRNAs are collected from the annotated human DNA genome data. Subsequently, a two-layer deep neural network is developed for the lincRNA detection, which adopts the auto-encoder algorithm and utilizes different encoding schemes to obtain the best performance over intergenic DNA sequence data. Driven by those newly annotated lincRNA data, deep learning methods based on auto-encoder algorithm can exert their capability in knowledge learning in order to capture the useful features and the information correlation along DNA genome sequences for lincRNA detection. As our knowledge, this is the first application to adopt the deep learning techniques for identifying lincRNA transcription sequences.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A deep learning method for lincRNA detection using auto-encoder algorithm
    Ning Yu
    Zeng Yu
    Yi Pan
    BMC Bioinformatics, 18
  • [2] A Deep Learning Method for lincRNA Identification Using Auto-encoder Algorithm
    Yu, Ning
    Yu, Zeng
    Pan, Yi
    2016 IEEE 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2016,
  • [3] Deep Representation Learning for Code Smells Detection using Variational Auto-Encoder
    Hadj-Kacem, Mouna
    Bouassida, Nadia
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [4] A Deep Learning Method Based on Hybrid Auto-Encoder Model
    Yang, ZhenYu
    Jing, Hui
    PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 1100 - 1104
  • [5] Intrusion detection using deep sparse auto-encoder and self-taught learning
    Aqsa Saeed Qureshi
    Asifullah Khan
    Nauman Shamim
    Muhammad Hanif Durad
    Neural Computing and Applications, 2020, 32 : 3135 - 3147
  • [6] Intrusion detection using deep sparse auto-encoder and self-taught learning
    Qureshi, Aqsa Saeed
    Khan, Asifullah
    Shamim, Nauman
    Durad, Muhammad Hanif
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08): : 3135 - 3147
  • [7] Online deep learning based on auto-encoder
    Zhang, Si-si
    Liu, Jian-wei
    Zuo, Xin
    Lu, Run-kun
    Lian, Si-ming
    APPLIED INTELLIGENCE, 2021, 51 (08) : 5420 - 5439
  • [8] Online deep learning based on auto-encoder
    Si-si Zhang
    Jian-wei Liu
    Xin Zuo
    Run-kun Lu
    Si-ming Lian
    Applied Intelligence, 2021, 51 : 5420 - 5439
  • [9] A road segmentation method based on the deep auto-encoder with supervised learning
    Song, Xiaona
    Rui, Ting
    Zhang, Sai
    Fei, Jianchao
    Wang, Xinqing
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 68 : 381 - 388
  • [10] Designing online network intrusion detection using deep auto-encoder Q-learning
    Kim, Chayoung
    Park, JiSu
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 79