A survey on identification and quantification of alternative polyadenylation sites from RNA-seq data

被引:25
作者
Chen, Moliang [1 ]
Ji, Guoli [1 ,2 ]
Fu, Hongjuan [1 ]
Lin, Qianmin [3 ]
Ye, Congting [4 ]
Ye, Wenbin [1 ]
Su, Yaru [5 ]
Wu, Xiaohui [1 ,6 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361005, Fujian, Peoples R China
[2] Xiamen Res Inst, Xiamen, Peoples R China
[3] Xiamen Univ, Xiangan Hosp, Xiamen, Peoples R China
[4] Xiamen Univ, Coll Environm & Ecol, Xiamen, Peoples R China
[5] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Peoples R China
[6] Xiamen Res Inst, Natl Ctr Healthcare Big Data, Xiamen, Peoples R China
基金
中国国家自然科学基金;
关键词
alternative polyadenylation; RNA-seq; 3 ' untranslated region; benchmark; predictive modeling; 3' UNTRANSLATED REGIONS; CHANGE-POINT MODEL; GENE-EXPRESSION; MESSENGER-RNAS; POLY(A) SITES; CLEAVAGE; REVEALS; WIDESPREAD; MECHANISMS; DYNAMICS;
D O I
10.1093/bib/bbz068
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Alternative polyadenylation (APA) has been implicated to play an important role in post-transcriptional regulation by regulating mRNA abundance, stability, localization and translation, which contributes considerably to transcriptome diversity and gene expression regulation. RNA-seq has become a routine approach for transcriptome profiling, generating unprecedented data that could be used to identify and quantify APA site usage. A number of computational approaches for identifying APA sites and/or dynamic APA events from RNA-seq data have emerged in the literature, which provide valuable yet preliminary results that should be refined to yield credible guidelines for the scientific community. In this review, we provided a comprehensive overview of the status of currently available computational approaches. We also conducted objective benchmarking analysis using RNA-seq data sets from different species (human, mouse and Arabidopsis) and simulated data sets to present a systematic evaluation of 11 representative methods. Our benchmarking study showed that the overall performance of all tools investigated is moderate, reflecting that there is still lot of scope to improve the prediction of APA site or dynamic APA events from RNA-seq data. Particularly, prediction results from individual tools differ considerably, and only a limited number of predicted APA sites or genes are common among different tools. Accordingly, we attempted to give some advice on how to assess the reliability of the obtained results. We also proposed practical recommendations on the appropriate method applicable to diverse scenarios and discussed implications and future directions relevant to profiling APA from RNA-seq data.
引用
收藏
页码:1261 / 1276
页数:16
相关论文
共 92 条
  • [1] Comparison of normalization approaches for gene expression studies completed with high-throughput sequencing
    Abbas-Aghababazadeh, Farnoosh
    Li, Qian
    Fridley, Brooke L.
    [J]. PLOS ONE, 2018, 13 (10):
  • [2] Isoform Sequencing and State-of-Art Applications for Unravelling Complexity of Plant Transcriptomes
    An, Dong
    Cao, Hieu X.
    Li, Changsheng
    Humbeck, Klaus
    Wang, Wenqin
    [J]. GENES, 2018, 9 (01):
  • [3] Differential expression analysis for sequence count data
    Anders, Simon
    Huber, Wolfgang
    [J]. GENOME BIOLOGY, 2010, 11 (10):
  • [4] Detecting differential usage of exons from RNA-seq data
    Anders, Simon
    Reyes, Alejandro
    Huber, Wolfgang
    [J]. GENOME RESEARCH, 2012, 22 (10) : 2008 - 2017
  • [5] Computational approaches for isoform detection and estimation: good and bad news
    Angelini, Claudia
    De Canditiis, Daniela
    De Feis, Italia
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [6] TAPAS: tool for alternative polyadenylation site analysis
    Arefeen, Ashraful
    Liu, Juntao
    Xiao, Xinshu
    Jiang, Tao
    [J]. BIOINFORMATICS, 2018, 34 (15) : 2521 - 2529
  • [7] Single-cell RNAseq for the study of isoforms-how is that possible?
    Arzalluz-Luque, Angeles
    Conesa, Ana
    [J]. GENOME BIOLOGY, 2018, 19
  • [8] Global insights into alternative polyadenylation regulation
    Batra, Ranjan
    Manchanda, Mini
    Swanson, Maurice S.
    [J]. RNA BIOLOGY, 2015, 12 (06) : 597 - 602
  • [9] Newly Constructed Network Models of Different WNT Signaling Cascades Applied to Breast Cancer Expression Data
    Bayerlova, Michaela
    Klemm, Florian
    Kramer, Frank
    Pukrop, Tobias
    Beissbarth, Tim
    Bleckmann, Annalen
    [J]. PLOS ONE, 2015, 10 (12):
  • [10] Patterns of variant polyadenylation signal usage in human genes
    Beaudoing, E
    Freier, S
    Wyatt, JR
    Claverie, JM
    Gautheret, D
    [J]. GENOME RESEARCH, 2000, 10 (07) : 1001 - 1010