M6AMRFS: Robust Prediction of N6-Methyladenosine Sites With Sequence-Based Features in Multiple Species

被引:92
|
作者
Qiang, Xiaoli [1 ]
Chen, Huangrong [2 ]
Ye, Xiucai [3 ]
Su, Ran [4 ]
Wei, Leyi [2 ]
机构
[1] Guangzhou Univ, Inst Comp Sci & Technol, Guangzhou, Guangdong, Peoples R China
[2] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[3] Univ Tsukuba, Dept Comp Sci, Tsukuba, Ibaraki, Japan
[4] Tianjin Univ, Sch Software, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
N6-methyladenosine site; eXtreme Gradient Boosting; machine learning; feature representation; RNA methylation; feature selection; WEB SERVER; N-6-METHYLADENOSINE SITES; SACCHAROMYCES-CEREVISIAE; RNA METHYLATION; M(6)A SITES; TOOL; IDENTIFICATION; MICRORNAS; CASCLEAVE; CASPASE;
D O I
10.3389/fgene.2018.00495
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
As one of the well-studied RNA methylation modifications, N6-methyladenosine (m(6)A) plays important roles in various biological progresses, such as RNA splicing and degradation, etc. Identification of m(6)A sites is fundamentally important for better understanding of their functional mechanisms. Recently, machine learning based prediction methods have emerged as an effective approach for fast and accurate identification of m(6)A sites. In this paper, we proposed "M6AMRFS", a new machine learning based predictor for the identification of m(6)A sites. In this predictor, we exploited a new feature representation algorithm to encode RNA sequences with two feature descriptors (dinucleotide binary encoding and Local position-specific dinucleotide frequency), and used the F-score algorithm combined with SFS (Sequential Forward Search) to enhance the feature representation ability. To predict m(6)A sites, we employed the eXtreme Gradient Boosting (XGBoost) algorithm to build a predictive model. Benchmarking results showed that the proposed predictor is competitive with the state-of-the art predictors. Importantly, robust predictions for multiple species by our predictor demonstrate that our predictive models have strong generalization ability. To the best of our knowledge, M6AMRFS is the first tool that can be used for the identification of m(6)A sites in multiple species. To facilitate the use of our predictor, we have established a user-friendly webserver with the implementation of M6AMRFS, which is currently available in http://server.malab.cn/M6AMRFS/. We anticipate that it will be a useful tool for the relevant research of m(6)A sites.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] SRAMP: prediction of mammalian N6-methyladenosine (m6A) sites based on sequence-derived features
    Zhou, Yuan
    Zeng, Pan
    Li, Yan-Hui
    Zhang, Ziding
    Cui, Qinghua
    NUCLEIC ACIDS RESEARCH, 2016, 44 (10)
  • [2] M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning
    Wei, Leyi
    Chen, Huangrong
    Su, Ran
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 12 : 635 - 644
  • [3] M6A-GSMS: Computational identification of N6-methyladenosine sites with GBDT and stacking learning in multiple species
    Zhang, Shengli
    Wang, Jinyue
    Li, Xinjie
    Liang, Yunyun
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2022, 40 (22): : 12380 - 12391
  • [4] Analysis approaches for the identification and prediction of N6-methyladenosine sites
    Yang, Yuwei
    Liu, Zhiyu
    Lu, Junru
    Sun, Yuqing
    Fu, Yue
    Pan, Min
    Xie, Xueying
    Ge, Qinyu
    EPIGENETICS, 2022,
  • [5] GR-m6A: Prediction of N6-methyladenosine sites in mammals with molecular graph and residual network✩
    Qiu, Shi
    Liu, Renxin
    Liang, Ying
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
  • [6] Integration of deep feature representations and handcrafted features to improve the prediction of N6-methyladenosine sites
    Wei, Leyi
    Su, Ran
    Wang, Bing
    Li, Xiuting
    Zou, Quan
    Gao, Xing
    NEUROCOMPUTING, 2019, 324 : 3 - 9
  • [7] Computational identification of N6-methyladenosine sites in multiple tissues of mammals
    Dao, Fu-Ying
    Lv, Hao
    Yang, Yu-He
    Zulfiqar, Hasan
    Gao, Hui
    Lin, Hao
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 1084 - 1091
  • [8] Role of N6-Methyladenosine (m6A) RNA Modification in Multiple Myeloma
    Bach, Christian
    Leffler, Magdalena
    Flamann, Cindy
    Kronke, Jan
    Mougiakakos, Dimitrios
    Mackensen, Andreas
    Bruns, Heiko
    BLOOD, 2018, 132
  • [9] Role of N6-methyladenosine (m6A) RNA modification in Multiple Myeloma
    Bach, C.
    Leffler, M.
    Kroenke, J.
    Saul, D.
    Mougiakakos, D.
    Mackensen, A.
    Bruns, H.
    ONCOLOGY RESEARCH AND TREATMENT, 2019, 42 : 123 - 123
  • [10] DeepM6ASeq-EL: prediction of human N6-methyladenosine (m6A) sites with LSTM and ensemble learning
    Chen, Juntao
    Zou, Quan
    Li, Jing
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (02)