Identifying N6-methyladenosine sites using extreme gradient boosting system optimized by particle swarm optimizer

被引:22
|
作者
Zhao, Xiaowei [1 ,2 ]
Zhang, Ye [1 ]
Ning, Qiao [1 ]
Zhang, Hongrui [1 ]
Ji, Jinchao [1 ]
Yin, Minghao [2 ]
机构
[1] Northeast Normal Univ, Sch Comp Sci & Informat Technol, Changchun 130117, Jilin, Peoples R China
[2] Northeast Normal Univ, Key Lab Intelligent Informat Proc Jilin Univ, Changchun 130117, Jilin, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
M(6)A sites; N-6-methyladenosine; Extreme gradient boosting; XGBoost; Particle swarm optimization; PREDICT SUBCELLULAR-LOCALIZATION; LYSINE SUCCINYLATION SITES; S-NITROSYLATION SITES; AMINO-ACID PAIRS; SACCHAROMYCES-CEREVISIAE; GENERAL-FORM; PROTEINS; PLOC; CLASSIFIER; ADENOSINE;
D O I
10.1016/j.jtbi.2019.01.035
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
N-6-methyladenosine (m(6)A) is the one of the most important RNA modifications, playing the role of splicing events, mRNA exporting and stability to cell differentiation. Because of wide distribution of m(6)A in genes, identification of m(6)A sites in RNA sequences has significant importance for basic biomedical research and drug development. High-throughput laboratory methods are time consuming and costly. Nowadays, effective computational methods are much desirable because of its convenience and fast speed. Thus, in this article, we proposed a new method to improve the performance of the m(6)A prediction by using the combined features of deep features and original features with extreme gradient boosting optimized by particle swarm optimization (PXGB). The proposed PXGB algorithm uses three kinds of features, i.e., position-specific nucleotide propensity (PSNP), position-specific dinucleotide propensity (PSDP), and the traditional nucleotide composition (NC). By 10-fold cross validation, the performance of PXGB was measured with an AUC of 0.8390 and an MCC of 0.5234. Additionally, PXGB was compared with the existing methods, and the higher MCC and AUC of PXGB demonstrated that PXGB was effective to predict m(6)A sites. The predictor proposed in this study might help to predict more m(6)A sites and guide related experimental validation. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:39 / 47
页数:9
相关论文
共 40 条
  • [1] Identifying N6-methyladenosine sites in the Arabidopsis thaliana transcriptome
    Wei Chen
    Pengmian Feng
    Hui Ding
    Hao Lin
    Molecular Genetics and Genomics, 2016, 291 : 2225 - 2229
  • [2] Identifying RNA N6-Methyladenosine Sites in Escherichia coli Genome
    Zhang, Jidong
    Feng, Pengmian
    Lin, Hao
    Chen, Wei
    FRONTIERS IN MICROBIOLOGY, 2018, 9
  • [3] iRNA-Methyl: Identifying N6-methyladenosine sites using pseudo nucleotide composition
    Chen, Wei
    Feng, Pengmian
    Ding, Hui
    Lin, Hao
    Chou, Kuo-Chen
    ANALYTICAL BIOCHEMISTRY, 2015, 490 : 26 - 33
  • [4] iRNA(m6A)-PseDNC: Identifying N6-methyladenosine sites using pseudo dinucleotide composition
    Chen, Wei
    Ding, Hui
    Zhou, Xu
    Lin, Hao
    Chou, Kuo-Chen
    ANALYTICAL BIOCHEMISTRY, 2018, 561 : 59 - 65
  • [5] Comparison and Analysis of Computational Methods for Identifying N6-Methyladenosine Sites in Saccharomyces cerevisiae
    Feng, Pengmian
    Feng, Lijing
    Tang, Chaohui
    CURRENT PHARMACEUTICAL DESIGN, 2021, 27 (09) : 1219 - 1229
  • [6] Identifying N6-Methyladenosine Sites in HepG2 Cell Lines Using Oxford Nanopore Technology
    Arzumanian, Viktoriia A.
    Kurbatov, Ilya Y.
    Ptitsyn, Konstantin G.
    Khmeleva, Svetlana A.
    Kurbatov, Leonid K.
    Radko, Sergey P.
    Poverennaya, Ekaterina V.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (22)
  • [7] A Linear Regression Predictor for Identifying N6-Methyladenosine Sites Using Frequent Gapped K-mer Pattern
    Zhuang, Y. Y.
    Liu, H. J.
    Song, X.
    Ju, Y.
    Peng, H.
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2019, 18 : 673 - 680
  • [8] M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning
    Wei, Leyi
    Chen, Huangrong
    Su, Ran
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 12 : 635 - 644
  • [9] Detecting N6-methyladenosine sites from RNA transcriptomes using random forest
    Khan A.
    Rehman H.U.
    Habib U.
    Ijaz U.
    Rehman, Hafeez Ur (hafeez.urrehman@nu.edu.pk), 1600, Elsevier B.V. (47):
  • [10] Detecting N6-methyladenosine sites from RNA transcriptomes using random forest
    Khan, Asad
    Rehman, Hafeez Ur
    Habib, Usman
    Ijaz, Umer
    JOURNAL OF COMPUTATIONAL SCIENCE, 2020, 47