ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction

被引:6
|
作者
Chen, Kai [1 ,2 ]
Zhu, Xiaodong [1 ,2 ,3 ]
Wang, Jiahao [1 ,2 ]
Hao, Lei [1 ,2 ]
Liu, Zhen [3 ,4 ]
Liu, Yuanning [1 ,2 ,3 ]
机构
[1] Jilin Univ, Coll Software, Changchun 130012, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
[3] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[4] Nagasaki Inst Appl Sci, Grad Sch Engn, 536 Aba Machi, Nagasaki 8510193, Japan
关键词
ncRNAs family; Dynamic Bi-GRU; DenseNet; ncDENSE; NCRNAS;
D O I
10.1186/s12859-023-05191-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Although research on non-coding RNAs (ncRNAs) is a hot topic in life sciences, the functions of numerous ncRNAs remain unclear. In recent years, researchers have found that ncRNAs of the same family have similar functions, therefore, it is important to accurately predict ncRNAs families to identify their functions. There are several methods available to solve the prediction problem of ncRNAs family, whose main ideas can be divided into two categories, including prediction based on the secondary structure features of ncRNAs, and prediction according to sequence features of ncRNAs. The first type of prediction method requires a complicated process and has a low accuracy in obtaining the secondary structure of ncRNAs, while the second type of method has a simple prediction process and a high accuracy, but there is still room for improvement. The existing methods for ncRNAs family prediction are associated with problems such as complicated prediction processes and low accuracy, in this regard, it is necessary to propose a new method to predict the ncRNAs family more perfectly.Results: A deep learning model-based method, ncDENSE, was proposed in this study, which predicted ncRNAs families by extracting ncRNAs sequence features. The bases in ncRNAs sequences were encoded by one-hot coding and later fed into an ensemble deep learning model, which contained the dynamic bi-directional gated recurrent unit (Bi-GRU), the dense convolutional network (DenseNet), and the Attention Mechanism (AM). To be specific, dynamic Bi-GRU was used to extract contextual feature information and capture long-term dependencies of ncRNAs sequences. AM was employed to assign different weights to features extracted by Bi-GRU and focused the attention on information with greater weights. Whereas DenseNet was adopted to extract local feature information of ncRNAs sequences and classify them by the full connection layer. According to our results, the ncDENSE method improved the Accuracy, Sensitivity, Precision, F-score, and MCC by 2.08%, 2.33%, 2.14%, 2.16% , and 2.39%, respectively, compared with the suboptimal method.Conclusions: Overall, the ncDENSE method proposed in this paper extracts sequence features of ncRNAs by dynamic Bi-GRU and DenseNet and improves the accuracy in predicting ncRNAs family and other data.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction
    Kai Chen
    Xiaodong Zhu
    Jiahao Wang
    Lei Hao
    Zhen Liu
    Yuanning Liu
    BMC Bioinformatics, 24
  • [2] ncRFP: A Novel end-to-end Method for Non-Coding RNAs Family Prediction Based on Deep Learning
    Wang, Linyu
    Zheng, Shaoge
    Zhang, Hao
    Qiu, Zhiyang
    Zhong, Xiaodan
    Liuliu, Haiming
    Liu, Yuanning
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (02) : 784 - 789
  • [3] Prediction of Long Non-Coding RNAs Based on Deep Learning
    Liu, Xiu-Qin
    Li, Bing-Xiu
    Zeng, Guan-Rong
    Liu, Qiao-Yue
    Ai, Dong-Mei
    GENES, 2019, 10 (04):
  • [4] Computational prediction of novel non-coding RNAs in Arabidopsis thaliana
    Dandan Song
    Yang Yang
    Bin Yu
    Binglian Zheng
    Zhidong Deng
    Bao-Liang Lu
    Xuemei Chen
    Tao Jiang
    BMC Bioinformatics, 10
  • [5] Computational prediction of novel non-coding RNAs in Arabidopsis thaliana
    Song, Dandan
    Yang, Yang
    Yu, Bin
    Zheng, Binglian
    Deng, Zhidong
    Lu, Bao-Liang
    Chen, Xuemei
    Jiang, Tao
    BMC BIOINFORMATICS, 2009, 10
  • [6] ncDLRES: a novel method for non-coding RNAs family prediction based on dynamic LSTM and ResNet
    Wang, Linyu
    Zhong, Xiaodan
    Wang, Shuo
    Liu, Yuanning
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [7] ncRDense: A novel computational approach for classification of non-coding RNA family by deep learning
    Chantsalnyam, Tuvshinbayar
    Siraj, Arslan
    Tayara, Hilal
    Chong, Kil To
    GENOMICS, 2021, 113 (05) : 3030 - 3038
  • [8] A framework for the computational prediction and analysis of non-coding RNAs in microbial environmental populations and their experimental validation
    Steffen C. Lott
    Karsten Voigt
    S. Joke Lambrecht
    Wolfgang R. Hess
    Claudia Steglich
    The ISME Journal, 2020, 14 : 1955 - 1965
  • [9] A framework for the computational prediction and analysis of non-coding RNAs in microbial environmental populations and their experimental validation
    Lott, Steffen C.
    Voigt, Karsten
    Lambrecht, S. Joke
    Hess, Wolfgang R.
    Steglich, Claudia
    ISME JOURNAL, 2020, 14 (08): : 1955 - 1965
  • [10] Computational discovery of non-coding RNAs
    Stadler, P.
    JOURNAL OF NEUROCHEMISTRY, 2009, 110 : 98 - 98