ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species

被引:13
作者
Chen, Ruyi [1 ,2 ]
Li, Fuyi [1 ]
Guo, Xudong [1 ]
Bi, Yue [3 ,4 ]
Li, Chen [3 ,4 ]
Pan, Shirui [5 ]
Coin, Lachlan J. M. [2 ]
Song, Jiangning [3 ,4 ,6 ]
机构
[1] Northwest A&F Univ, Coll Informat Engn, Xianyang 712100, Shaanxi, Peoples R China
[2] Univ Melbourne, Peter Doherty Inst Infect & Immun, Melbourne, Vic 3000, Australia
[3] Monash Univ, Biomed Discovery Inst, Melbourne, Vic 3800, Australia
[4] Monash Univ, Dept Biochem & Mol Biol, Melbourne, Vic 3800, Australia
[5] Griffith Univ, Sch Informat & Commun Technol, Nathan, Qld 4222, Australia
[6] Monash Univ, Monash Data Futures Inst, Melbourne, Australia
关键词
machine learning; ensemble learning; RNA modification; A-to-I editing; feature selection; ADENOSINE; DATABASE; RETENTION;
D O I
10.1093/bib/bbad170
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A-to-I editing is the most prevalent RNA editing event, which refers to the change of adenosine (A) bases to inosine (I) bases in double-stranded RNAs. Several studies have revealed that A-to-I editing can regulate cellular processes and is associated with various human diseases. Therefore, accurate identification of A-to-I editing sites is crucial for understanding RNA-level (i.e. transcriptional) modifications and their potential roles in molecular functions. To date, various computational approaches for A-to-I editing site identification have been developed; however, their performance is still unsatisfactory and needs further improvement. In this study, we developed a novel stacked-ensemble learning model, ATTIC (A-To-I ediTing predICtor), to accurately identify A-to-I editing sites across three species, including Homo sapiens, Mus musculus and Drosophila melanogaster. We first comprehensively evaluated 37 RNA sequence-derived features combined with 14 popular machine learning algorithms. Then, we selected the optimal base models to build a series of stacked ensemble models. The final ATTIC framework was developed based on the optimal models improved by the feature selection strategy for specific species. Extensive cross-validation and independent tests illustrate that ATTIC outperforms stateof-the-art tools for predicting A-to-I editing sites. We also developed a web server for ATTIC, which is publicly available at http://web. unimelb-bioinfortools.cloud.edu.au/ATTIC/. We anticipate that ATTIC can be utilized as a useful tool to accelerate the identification of A-to-I RNA editing events and help characterize their roles in post-transcriptional regulation.
引用
收藏
页数:15
相关论文
共 75 条
[1]  
Abhishek L, 2020, 2020 INT C EMERGING, P1, DOI [10.1109/INCET49848.2020.9154050, DOI 10.1109/INCET49848.2020.9154050]
[2]   EPAI-NC: Enhanced prediction of adenosine to inosine RNA editing sites using nucleotide compositions [J].
Ahmad, Ahsan ;
Shatabda, Swakkhar .
ANALYTICAL BIOCHEMISTRY, 2019, 569 :16-21
[3]   The majority of transcripts in the squid nervous system are extensively recoded by A-to-I RNA editing [J].
Alon, Shahar ;
Garrett, Sandra C. ;
Levanon, Erez Y. ;
Olson, Sara ;
Graveley, Brenton R. ;
Rosenthal, Joshua J. C. ;
Eisenberg, Eli .
ELIFE, 2015, 4
[4]   A-to-I RNA editing: A new regulatory mechanism of global gene expression [J].
Amariglio, Ninette ;
Rechavi, Gideon .
BLOOD CELLS MOLECULES AND DISEASES, 2007, 39 (02) :151-155
[5]   STALLION: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction [J].
Basith, Shaherin ;
Lee, Gwang ;
Manavalan, Balachandran .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
[6]   Altered Nuclear Retention of mRNAs Containing Inverted Repeats in Human Embryonic Stem Cells: Functional Role of a Nuclear Noncoding RNA [J].
Chen, Ling-Ling ;
Carmichael, Gordon G. .
MOLECULAR CELL, 2009, 35 (04) :467-478
[7]  
Chen TY, 2016, SCI REP-UK, V6, DOI [10.1038/srep33486, 10.1038/srep20001, 10.1038/srep24253, 10.1038/srep20335]
[8]   iRNA-3typeA: Identifying Three Types of Modification at RNA's Adenosine Sites [J].
Chen, Wei ;
Feng, Pengmian ;
Yang, Hui ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 11 :468-474
[9]   iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences [J].
Chen, Wei ;
Feng, Pengmian ;
Yang, Hui ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
ONCOTARGET, 2017, 8 (03) :4208-4217
[10]   Identification and analysis of the N6-methyladenosine in the Saccharomyces cerevisiae transcriptome [J].
Chen, Wei ;
Tran, Hong ;
Liang, Zhiyong ;
Lin, Hao ;
Zhang, Liqing .
SCIENTIFIC REPORTS, 2015, 5