ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction

被引:6
|
作者
Chen, Kai [1 ,2 ]
Zhu, Xiaodong [1 ,2 ,3 ]
Wang, Jiahao [1 ,2 ]
Hao, Lei [1 ,2 ]
Liu, Zhen [3 ,4 ]
Liu, Yuanning [1 ,2 ,3 ]
机构
[1] Jilin Univ, Coll Software, Changchun 130012, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
[3] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[4] Nagasaki Inst Appl Sci, Grad Sch Engn, 536 Aba Machi, Nagasaki 8510193, Japan
关键词
ncRNAs family; Dynamic Bi-GRU; DenseNet; ncDENSE; NCRNAS;
D O I
10.1186/s12859-023-05191-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Although research on non-coding RNAs (ncRNAs) is a hot topic in life sciences, the functions of numerous ncRNAs remain unclear. In recent years, researchers have found that ncRNAs of the same family have similar functions, therefore, it is important to accurately predict ncRNAs families to identify their functions. There are several methods available to solve the prediction problem of ncRNAs family, whose main ideas can be divided into two categories, including prediction based on the secondary structure features of ncRNAs, and prediction according to sequence features of ncRNAs. The first type of prediction method requires a complicated process and has a low accuracy in obtaining the secondary structure of ncRNAs, while the second type of method has a simple prediction process and a high accuracy, but there is still room for improvement. The existing methods for ncRNAs family prediction are associated with problems such as complicated prediction processes and low accuracy, in this regard, it is necessary to propose a new method to predict the ncRNAs family more perfectly.Results: A deep learning model-based method, ncDENSE, was proposed in this study, which predicted ncRNAs families by extracting ncRNAs sequence features. The bases in ncRNAs sequences were encoded by one-hot coding and later fed into an ensemble deep learning model, which contained the dynamic bi-directional gated recurrent unit (Bi-GRU), the dense convolutional network (DenseNet), and the Attention Mechanism (AM). To be specific, dynamic Bi-GRU was used to extract contextual feature information and capture long-term dependencies of ncRNAs sequences. AM was employed to assign different weights to features extracted by Bi-GRU and focused the attention on information with greater weights. Whereas DenseNet was adopted to extract local feature information of ncRNAs sequences and classify them by the full connection layer. According to our results, the ncDENSE method improved the Accuracy, Sensitivity, Precision, F-score, and MCC by 2.08%, 2.33%, 2.14%, 2.16% , and 2.39%, respectively, compared with the suboptimal method.Conclusions: Overall, the ncDENSE method proposed in this paper extracts sequence features of ncRNAs by dynamic Bi-GRU and DenseNet and improves the accuracy in predicting ncRNAs family and other data.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Computational RNomics: Structure identification and functional prediction of non-coding RNAs in silico
    LingLing Zheng
    LiangHu Qu
    Science China Life Sciences, 2010, 53 : 548 - 562
  • [22] Computational RNomics:Structure identification and functional prediction of non-coding RNAs in silico
    ZHENG LingLing QU LiangHu Key Laboratory of Gene Engineering of the Ministry of EducationState Key Laboratory for BiocontrolSun Yatsen UniversityGuangzhou China
    Science China(Life Sciences), 2010, 53 (05) : 548 - 562
  • [23] Computational RNomics:Structure identification and functional prediction of non-coding RNAs in silico
    ZHENG LingLing & QU LiangHu Key Laboratory of Gene Engineering of the Ministry of Education
    Science China(Life Sciences) , 2010, (05) : 548 - 562
  • [24] Computational RNomics: Structure identification and functional prediction of non-coding RNAs in silico
    Zheng LingLing
    Qu LiangHu
    SCIENCE CHINA-LIFE SCIENCES, 2010, 53 (05) : 548 - 562
  • [25] lncDML: Identification of long non-coding RNAs by Deep Metric Learning
    Zhao, Pengfei
    Peng, Qinke
    Zhu, Zhibo
    Han, Tian
    Dong, Rida
    Huang, Huijun
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1808 - 1812
  • [26] From Structure Prediction to Genomic Screens for Novel Non-Coding RNAs
    Gorodkin, Jan
    Hofacker, Ivo L.
    PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (08)
  • [27] NON-CODING RNAS A new member of the family
    David, Rachel
    NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2012, 13 (11) : 686 - 686
  • [28] Prediction of Non-coding RNAs as Drug Targets
    Jiang, Wei
    Lv, Yingli
    Wang, Shuyuan
    NON-CODING RNAS IN COMPLEX DISEASES: A BIOINFORMATICS PERSPECTIVE, 2018, 1094 : 109 - 115
  • [29] DeepLncLoc: a deep learning framework for long non-coding RNA subcellular localization prediction based on subsequence embedding
    Zeng, Min
    Wu, Yifan
    Lu, Chengqian
    Zhang, Fuhao
    Wu, Fang-Xiang
    Li, Min
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [30] LncDLSM: Identification of Long Non-Coding RNAs With Deep Learning-Based Sequence Model
    Wang, Ying
    Zhao, Pengfei
    Du, Hongkai
    Cao, Yingxin
    Peng, Qinke
    Fu, Laiyi
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (04) : 2117 - 2127