m7GDisAI: N7-methylguanosine (m7G) sites and diseases associations inference based on heterogeneous network

被引:15
|
作者
Ma, Jiani [1 ,2 ]
Zhang, Lin [1 ,2 ]
Chen, Jin [1 ,2 ]
Song, Bowen [3 ]
Zang, Chenxuan [3 ]
Liu, Hui [1 ,2 ]
机构
[1] China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Spac, Minist Educ, Xuzhou 221116, Jiangsu, Peoples R China
[2] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
[3] Xian Jiaotong Liverpool Univ, AI Univ Res Ctr, Dept Biol Sci, Suzhou 215123, Peoples R China
基金
中国国家自然科学基金;
关键词
m(7)G site; Heterogeneous network; Matrix decomposition; MESSENGER-RNA; OVARIAN-CANCER; SEMANTIC SIMILARITY; CAP STRUCTURE; EXPRESSION; MUTATION; FAMILY; TRANSLATION; STABILITY; PHENOTYPE;
D O I
10.1186/s12859-021-04007-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Recent studies have confirmed that N7-methylguanosine (m(7)G) modification plays an important role in regulating various biological processes and has associations with multiple diseases. Wet-lab experiments are cost and time ineffective for the identification of disease-associated m(7)G sites. To date, tens of thousands of m(7)G sites have been identified by high-throughput sequencing approaches and the information is publicly available in bioinformatics databases, which can be leveraged to predict potential disease-associated m(7)G sites using a computational perspective. Thus, computational methods for m(7)G-disease association prediction are urgently needed, but none are currently available at present. Results To fill this gap, we collected association information between m(7)G sites and diseases, genomic information of m(7)G sites, and phenotypic information of diseases from different databases to build an m(7)G-disease association dataset. To infer potential disease-associated m(7)G sites, we then proposed a heterogeneous network-based model, m(7)G Sites and Diseases Associations Inference (m(7)GDisAI) model. m(7)GDisAI predicts the potential disease-associated m(7)G sites by applying a matrix decomposition method on heterogeneous networks which integrate comprehensive similarity information of m(7)G sites and diseases. To evaluate the prediction performance, 10 runs of tenfold cross validation were first conducted, and m(7)GDisAI got the highest AUC of 0.740(+/- 0.0024). Then global and local leave-one-out cross validation (LOOCV) experiments were implemented to evaluate the model's accuracy in global and local situations respectively. AUC of 0.769 was achieved in global LOOCV, while 0.635 in local LOOCV. A case study was finally conducted to identify the most promising ovarian cancer-related m(7)G sites for further functional analysis. Gene Ontology (GO) enrichment analysis was performed to explore the complex associations between host gene of m(7)G sites and GO terms. The results showed that m(7)GDisAI identified disease-associated m(7)G sites and their host genes are consistently related to the pathogenesis of ovarian cancer, which may provide some clues for pathogenesis of diseases. Conclusion The m(7)GDisAI web server can be accessed at , which provides a user-friendly interface to query disease associated m(7)G. The list of top 20 m(7)G sites predicted to be associted with 177 diseases can be achieved. Furthermore, detailed information about specific m(7)G sites and diseases are also shown.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Integrative pan-cancer analysis and clinical characterization of the N7-methylguanosine (m7G) RNA modification regulators in human cancers
    He, Chun-Ming
    Zhang, Xin-Di
    Zhu, Song-Xin
    Zheng, Jia-Jie
    Wang, Yu-Ming
    Wang, Qing
    Yin, Hang
    Fu, Yu-Jie
    Xue, Song
    Tang, Jian
    Zhao, Xiao-Jing
    FRONTIERS IN GENETICS, 2022, 13
  • [22] BERT-m7G: A Transformer Architecture Based on BERT and Stacking Ensemble to Identify RNA N7-Methylguanosine Sites from Sequence Information
    Zhang, Lu
    Qin, Xinyi
    Liu, Min
    Liu, Guangzhong
    Ren, Yuxiao
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [23] Sia-m7G: Predicting m7G Sites through the Siamese Neural Network with an Attention Mechanism
    Zheng, Jia
    Zhou, Yetong
    CURRENT BIOINFORMATICS, 2024, 19 (10) : 953 - 962
  • [24] TMSC-m7G: A transformer architecture based on multi-sense-scaled embedding features and convolutional neural network to identify RNA N7-methylguanosine sites
    Zhang, Shengli
    Xu, Yujie
    Liang, Yunyun
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 129 - 139
  • [25] Identifying N7-methylguanosine sites by integrating multiple features
    Zou, Hongliang
    Yang, Fan
    Yin, Zhijian
    BIOPOLYMERS, 2022, 113 (02)
  • [26] CAP-m7G: A capsule network-based framework for specific RNA N7-methylguanosine site identification using image encoding and reconstruction layers
    Xie, Peilin
    Guan, Jiahui
    He, Xuxin
    Zhao, Zhihao
    Guo, Yilin
    Sun, Zhenglong
    Yao, Lantian
    Lee, Tzong-Yi
    Chiang, Ying-Chih
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2025, 27 : 804 - 812
  • [27] An Interpretable Prediction Model for Identifying N7-Methylguanosine Sites Based on XGBoost and SHAP
    Bi, Yue
    Xiang, Dongxu
    Ge, Zongyuan
    Li, Fuyi
    Jia, Cangzhi
    Song, Jiangning
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2020, 22 : 362 - 372
  • [28] Prediction of N7-methylguanosine sites in human RNA based on optimal sequence features
    Yang, Yu-He
    Ma, Chi
    Wang, Jia-Shu
    Yang, Hui
    Ding, Hui
    Han, Shu-Guang
    Li, Yan-Wen
    GENOMICS, 2020, 112 (06) : 4342 - 4347
  • [29] No Evidence for N7-Methylation of Guanosine (m7G) in Human let-7e
    Vinther, Jeppe
    MOLECULAR CELL, 2020, 79 (02) : 199 - 200
  • [30] Comprehensive analysis of m7G modification patterns based on potential m7G regulators and tumor microenvironment infiltration characterization in lung adenocarcinoma
    Ma, Shouzheng
    Zhu, Jun
    Wang, Mengmeng
    Zhu, Jianfei
    Wang, Wenchen
    Xiong, Yanlu
    Jiang, Runmin
    Liu, Lei
    Jiang, Tao
    FRONTIERS IN GENETICS, 2022, 13