Prediction of Poly(A) Sites by Poly(A) Read Mapping

被引:14
作者
Bonfert, Thomas [1 ]
Friedel, Caroline C. [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Inst Informat, Munich, Germany
关键词
3' UNTRANSLATED REGIONS; ALTERNATIVE POLYADENYLATION; RNA-SEQ; GENE-EXPRESSION; MESSENGER-RNAS; CLEAVAGE; LANDSCAPE; SEQUENCES; ALIGNMENT; TRANSCRIPTION;
D O I
10.1371/journal.pone.0170914
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA-seq reads containing part of the poly(A) tail of transcripts (denoted as poly(A) reads) provide the most direct evidence for the position of poly(A) sites in the genome. However, due to reduced coverage of poly(A) tails by reads, poly(A) reads are not routinely identified during RNA-seq mapping. Nevertheless, recent studies for several herpesviruses successfully employed mapping of poly(A) reads to identify herpesvirus poly(A) sites using different strategies and customized programs. To more easily allow such analyses without requiring additional programs, we integrated poly(A) read mapping and prediction of poly(A) sites into our RNA-seq mapping program ContextMap 2. The implemented approach essentially generalizes previously used poly(A) read mapping approaches and combines them with the context-based approach of ContextMap 2 to take into account information provided by other reads aligned to the same location. Poly(A) read mapping using ContextMap 2 was evaluated on real-life data from the ENCODE project and compared against a competing approach based on transcriptome assembly (KLEAT). This showed high positive predictive value for our approach, evidenced also by the presence of poly(A) signals, and considerably lower runtime than KLEAT. Although sensitivity is low for both methods, we show that this is in part due to a high extent of spurious results in the gold standard set derived from RNAPET data. Sensitivity improves for poly(A) sites of known transcripts or determined with a more specific poly(A) sequencing protocol and increases with read coverage on transcript ends. Finally, we illustrate the usefulness of the approach in a high read coverage scenario by a re-analysis of published data for herpes simplex virus 1. Thus, with current trends towards increasing sequencing depth and read length, poly(A) read mapping will prove to be increasingly useful and can now be performed automatically during RNA-seq mapping with ContextMap 2.
引用
收藏
页数:32
相关论文
共 48 条
[1]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[2]   KSHV 2.0: A Comprehensive Annotation of the Kaposi's Sarcoma-Associated Herpesvirus Genome Using NextGeneration Sequencing Reveals Novel Genomic and Functional Features [J].
Arias, Carolina ;
Weisburd, Ben ;
Stern-Ginossar, Noam ;
Mercier, Alexandre ;
Madrid, Alexis S. ;
Bellare, Priya ;
Holdorf, Meghan ;
Weissman, Jonathan S. ;
Ganem, Don .
PLOS PATHOGENS, 2014, 10 (01)
[3]   Patterns of variant polyadenylation signal usage in human genes [J].
Beaudoing, E ;
Freier, S ;
Wyatt, JR ;
Claverie, JM ;
Gautheret, D .
GENOME RESEARCH, 2000, 10 (07) :1001-1010
[4]  
Birol I, 2015, BIOCOMPUT-PAC SYM, P347
[5]   Herpes simplex virus DNA replication [J].
Boehmer, PE ;
Lehman, IR .
ANNUAL REVIEW OF BIOCHEMISTRY, 1997, 66 :347-384
[6]   ContextMap 2: fast and accurate context-based RNA-seq mapping [J].
Bonfert, Thomas ;
Kirner, Evelyn ;
Csaba, Gergely ;
Zimmer, Ralf ;
Friedel, Caroline C. .
BMC BIOINFORMATICS, 2015, 16
[7]   Pervasive Transcription of a Herpesvirus Genome Generates Functionally Important RNAs [J].
Canny, Susan P. ;
Reese, Tiffany A. ;
Johnson, L. Steven ;
Zhang, Xin ;
Kambal, Amal ;
Duan, Erning ;
Liu, Catherine Y. ;
Virgin, Herbert W. .
MBIO, 2014, 5 (02)
[8]   A quantitative atlas of polyadenylation in five mammals [J].
Derti, Adnan ;
Garrett-Engele, Philip ;
MacIsaac, Kenzie D. ;
Stevens, Richard C. ;
Sriram, Shreedharan ;
Chen, Ronghua ;
Rohl, Carol A. ;
Johnson, Jason M. ;
Babak, Tomas .
GENOME RESEARCH, 2012, 22 (06) :1173-1183
[9]   Mechanisms and Consequences of Alternative Polyadenylation [J].
Di Giammartino, Dafne Campigli ;
Nishida, Kensei ;
Manley, James L. .
MOLECULAR CELL, 2011, 43 (06) :853-866
[10]   Landscape of transcription in human cells [J].
Djebali, Sarah ;
Davis, Carrie A. ;
Merkel, Angelika ;
Dobin, Alex ;
Lassmann, Timo ;
Mortazavi, Ali ;
Tanzer, Andrea ;
Lagarde, Julien ;
Lin, Wei ;
Schlesinger, Felix ;
Xue, Chenghai ;
Marinov, Georgi K. ;
Khatun, Jainab ;
Williams, Brian A. ;
Zaleski, Chris ;
Rozowsky, Joel ;
Roeder, Maik ;
Kokocinski, Felix ;
Abdelhamid, Rehab F. ;
Alioto, Tyler ;
Antoshechkin, Igor ;
Baer, Michael T. ;
Bar, Nadav S. ;
Batut, Philippe ;
Bell, Kimberly ;
Bell, Ian ;
Chakrabortty, Sudipto ;
Chen, Xian ;
Chrast, Jacqueline ;
Curado, Joao ;
Derrien, Thomas ;
Drenkow, Jorg ;
Dumais, Erica ;
Dumais, Jacqueline ;
Duttagupta, Radha ;
Falconnet, Emilie ;
Fastuca, Meagan ;
Fejes-Toth, Kata ;
Ferreira, Pedro ;
Foissac, Sylvain ;
Fullwood, Melissa J. ;
Gao, Hui ;
Gonzalez, David ;
Gordon, Assaf ;
Gunawardena, Harsha ;
Howald, Cedric ;
Jha, Sonali ;
Johnson, Rory ;
Kapranov, Philipp ;
King, Brandon .
NATURE, 2012, 489 (7414) :101-108