A Novel Integrative Approach for Non-coding RNA Classification Based on Deep Learning

被引:8
作者
Boukelia, Abdelbasset [1 ,2 ,3 ]
Boucheham, Anouar [4 ]
Belguidou, Meriem [1 ]
Batouche, Mohamed [5 ]
Zehraoui, Farida [6 ]
Tahi, Fariza [6 ]
机构
[1] Univ Abdelhamid Mehri Constantine 2, Fac NTIC, Comp Sci Dept, Constantine 25000, Algeria
[2] Natl Ctr Biotechnol Res, Bioinformat Unit, Constantine, Algeria
[3] Res Ctr Sci & Tech Informat, Algiers, Algeria
[4] Univ Salah Boubnider Constantine 3, Constantine 25000, Algeria
[5] Princess Nourah Univ, CCIS RC, IT Dept, Riyadh, Saudi Arabia
[6] Univ Paris Saclay, Univ Evry, IBISC, Evry, France
关键词
Multisource deep-learning; ncRNA classification; epigenetics; biomarkers; features pattern extraction; optimization; SECONDARY STRUCTURES; GENOME; TRANSCRIPTS; DISEASE;
D O I
10.2174/1574893614666191105160633
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Molecular biomarkers show new ways to understand many disease processes. Noncoding RNAs as biomarkers play a crucial role in several cellular activities, which are highly correlated to many human diseases especially cancer. The classification and the identification of ncRNAs have become a critical issue due to their application, such as biomarkers in many human diseases. Objective: Most existing computational tools for ncRNA classification are mainly used for classifying only one type of ncRNA. They are based on structural information or specific known features. Furthermore, these tools suffer from a lack of significant and validated features. Therefore, the performance of these methods is not always satisfactory. Methods: We propose a novel approach named imCnC for ncRNA classification based on multisource deep learning, which integrates several data sources such as genomic and epigenomic data to identify several ncRNA types. Also, we propose an optimization technique to visualize the extracted features pattern from the multisource CNN model to measure the epigenomics features of each ncRNA type. Results: the computational results using a dataset of 16 human ncRNA classes downloaded from RFAM show that imCnC outperforms the existing tools. Indeed, imCnC achieved an accuracy of 94,18%. In addition, our method enables to discover new ncRNA features using an optimization technique to measure and visualize the features pattern of the imCnC classifier.
引用
收藏
页码:338 / 348
页数:11
相关论文
共 43 条
  • [11] De Majo F, 2018, Noncoding RNA Res, V3, P20, DOI 10.1016/j.ncrna.2018.02.003
  • [12] Delpu Y., 2016, DRUG DISCOVERY CANC, P305, DOI DOI 10.1016/B978-0-12-802208-5.00012-6
  • [13] Non-coding RNAs in human disease
    Esteller, Manel
    [J]. NATURE REVIEWS GENETICS, 2011, 12 (12) : 861 - 874
  • [14] NONCODEV5: a comprehensive annotation database for long non-coding RNAs
    Fang, ShuangSang
    Zhang, LiLi
    Guo, JinCheng
    Niu, YiWei
    Wu, Yang
    Li, Hui
    Zhao, Lian He
    Li, Xi Yuan
    Teng, Xue Yi
    Sun, XianHui
    Sun, Liang
    Zhang, Michael Q.
    Chen, RunSheng
    Zhao, Yi
    [J]. NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) : D308 - D314
  • [15] nRC: non-coding RNA Classifier based on structural features
    Fiannaca, Antonino
    La Rosa, Massimo
    La Paglia, Laura
    Rizzo, Riccardo
    Urso, Alfonso
    [J]. BIODATA MINING, 2017, 10
  • [16] Ge L, 2013, 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), P766
  • [17] Noncoder: a web interface for exon array-based detection of long non-coding RNAs
    Gellert, Pascal
    Ponomareva, Yuliya
    Braun, Thomas
    Uchida, Shizuka
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (01) : e20
  • [18] Gruber AR, 2010, BIOCOMPUT-PAC SYM, P69
  • [19] Cell cycle, oncogenic and tumor suppressor pathways regulate numerous long and macro non-protein-coding RNAs
    Hackermueller, Joerg
    Reiche, Kristin
    Otto, Christian
    Hoesler, Nadine
    Blumert, Conny
    Brocke-Heidrich, Katja
    Boehlig, Levin
    Nitsche, Anne
    Kasack, Katharina
    Ahnert, Peter
    Krupp, Wolfgang
    Engeland, Kurt
    Stadler, Peter F.
    Horn, Friedemann
    [J]. GENOME BIOLOGY, 2014, 15 (03):
  • [20] Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families
    Kalvari, Ioanna
    Argasinska, Joanna
    Quinones-Olvera, Natalia
    Nawrocki, Eric P.
    Rivas, Elena
    Eddy, Sean R.
    Bateman, Alex
    Finn, Robert D.
    Petrov, Anton I.
    [J]. NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) : D335 - D342