H2Opred: a robust and efficient hybrid deep learning model for predicting 2'-O-methylation sites in human RNA

被引:17
|
作者
Pham, Nhat Truong [1 ]
Rakkiyapan, Rajan [2 ]
Park, Jongsun [3 ]
Malik, Adeel [4 ,6 ]
Manavalan, Balachandran [5 ]
机构
[1] Sungkyunkwan Univ, Dept Integrat Biotechnol, Computat Biol & Bioinformat Lab, Suwon, South Korea
[2] Bharathiar Univ, Dept Math, Coimbatore, Tamil Nadu, India
[3] Infoboss Inc, Seoul, South Korea
[4] Sangmyung Univ, Inst Intelligence Informat Technol, Seoul, South Korea
[5] Sungkyunkwan Univ, Coll Biotechnol & Bioengn, Dept Integrat Biotechnol, Suwon 16419, Gyeonggi Do, South Korea
[6] Inst Intelligence Informat Technol, 20,Hongjimun 2 Gil, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
2'-O-methylation sites; convolutional neural network; gated recurrent unit; hybrid deep learning; bioinformatics; natural language processing; MESSENGER-RNA; HIGH-THROUGHPUT; 2'-O METHYLATION; WEB SERVER; IDENTIFICATION; NUCLEOTIDE; RMBASE;
D O I
10.1093/bib/bbad476
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
2'-O-methylation (2OM) is the most common post-transcriptional modification of RNA. It plays a crucial role in RNA splicing, RNA stability and innate immunity. Despite advances in high-throughput detection, the chemical stability of 2OM makes it difficult to detect and map in messenger RNA. Therefore, bioinformatics tools have been developed using machine learning (ML) algorithms to identify 2OM sites. These tools have made significant progress, but their performances remain unsatisfactory and need further improvement. In this study, we introduced H2Opred, a novel hybrid deep learning (HDL) model for accurately identifying 2OM sites in human RNA. Notably, this is the first application of HDL in developing four nucleotide-specific models [adenine (A2OM), cytosine (C2OM), guanine (G2OM) and uracil (U2OM)] as well as a generic model (N2OM). H2Opred incorporated both stacked 1D convolutional neural network (1D-CNN) blocks and stacked attention-based bidirectional gated recurrent unit (Bi-GRU-Att) blocks. 1D-CNN blocks learned effective feature representations from 14 conventional descriptors, while Bi-GRU-Att blocks learned feature representations from five natural language processing-based embeddings extracted from RNA sequences. H2Opred integrated these feature representations to make the final prediction. Rigorous cross-validation analysis demonstrated that H2Opred consistently outperforms conventional ML-based single-feature models on five different datasets. Moreover, the generic model of H2Opred demonstrated a remarkable performance on both training and testing datasets, significantly outperforming the existing predictor and other four nucleotide-specific H2Opred models. To enhance accessibility and usability, we have deployed a user-friendly web server for H2Opred, accessible at https://balalab-skku.org/H2Opred/. This platform will serve as an invaluable tool for accurately predicting 2OM sites within human RNA, thereby facilitating broader applications in relevant research endeavors.
引用
收藏
页数:13
相关论文
共 14 条
  • [1] i2OM: Toward a better prediction of 2?-O-methylation in human RNA
    Yang, Yu-He
    Ma, Cai-Yi
    Gao, Dong
    Liu, Xiao-Wei
    Yuan, Shi-Shi
    Ding, Hui
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 239
  • [2] Nmix: a hybrid deep learning model for precise prediction of 2'-O-methylation sites based on multi-feature fusion and ensemble learning
    Geng, Yu-Qing
    Lai, Fei-Liao
    Luo, Hao
    Gao, Feng
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [3] Deep-2′-O-Me: Predicting 2′-O-methylation sites by Convolutional Neural Networks
    Mostavi, Milad
    Salekin, Sirajul
    Huang, Yufei
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 2394 - 2397
  • [4] Meta-2OM: A multi-classifier meta-model for the accurate prediction of RNA 2′-O-methylation sites in human RNA
    Harun-Or-Roshid, Md.
    Pham, Nhat Truong
    Manavalan, Balachandran
    Kurata, Hiroyuki
    PLOS ONE, 2024, 19 (06):
  • [5] Prediction and Motif Analysis of 2'-O-methylation Using a Hybrid Deep Learning Model from RNA Primary Sequence and Nanopore Signals
    Pan, Shiyang
    Zhang, Yuxin
    Wei, Zhen
    Meng, Jia
    Huang, Daiyun
    CURRENT BIOINFORMATICS, 2022, 17 (09) : 873 - 882
  • [6] RNA 2′-O-Methylation (Nm) Modification in Human Diseases
    Dimitrova, Dilyana G.
    Teysset, Laure
    Carre, Clement
    GENES, 2019, 10 (02)
  • [7] Single base resolution mapping of 2′-O-methylation sites in human mRNA and in 3′ terminal ends of small RNAs
    Hsu, Phillip J.
    Fei, Qili
    Dai, Qing
    Shi, Hailing
    Dominissini, Dan
    Ma, Lijia
    He, Chuan
    METHODS, 2019, 156 : 85 - 90
  • [8] iRNA-PseKNC(2methyl): Identify RNA 2′-O-methylation sites by convolution neural network and Chou's pseudo components
    Tahir, Muhammad
    Tayara, Hilal
    Chong, Kil To
    JOURNAL OF THEORETICAL BIOLOGY, 2019, 465 : 1 - 6
  • [9] 2′-O-Methylation within Bacterial RNA Acts as Suppressor of TLR7/TLR8 Activation in Human Innate Immune Cells
    Rimbach, Katharina
    Kaiser, Steffen
    Helm, Mark
    Dalpke, Alexander H.
    Eigenbrod, Tatjana
    JOURNAL OF INNATE IMMUNITY, 2015, 7 (05) : 482 - 493
  • [10] MTDeepM6A-2S: A two-stage multi-task deep learning method for predicting RNA N6-methyladenosine sites of Saccharomyces cerevisiae
    Wang, Hong
    Zhao, Shihao
    Cheng, Yinchu
    Bi, Shoudong
    Zhu, Xiaolei
    FRONTIERS IN MICROBIOLOGY, 2022, 13