Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways

被引:18
作者
Archer, Nathan [1 ]
Walsh, Mark D. [1 ]
Shahrezaei, Vahid [2 ]
Hebenstreit, Daniel [1 ]
机构
[1] Univ Warwick, Sch Life Sci, Coventry CV4 7AL, W Midlands, England
[2] Imperial Coll, Dept Math, London SW7 2AZ, England
基金
英国生物技术与生命科学研究理事会;
关键词
SINGLE-CELL TRANSCRIPTOMICS; NONUNIFORM READ DISTRIBUTION; GENE-EXPRESSION; REVERSE-TRANSCRIPTASE; HIGHLY PARALLEL; SEQUENCING DATA; QUANTIFICATION; GENOME; DEGRADATION; CHALLENGES;
D O I
10.1016/j.cels.2016.10.012
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Experimental procedures for preparing RNA-seq and single-cell (sc) RNA-seq libraries are based on assumptions regarding their underlying enzymatic reactions. Here, we show that the fairness of these assumptions varies within libraries: coverage by sequencing reads along and between transcripts exhibits characteristic, protocol-dependent biases. To understand the mechanistic basis of this bias, we present an integrated modeling framework that infers the relationship between enzyme reactions during library preparation and the characteristic coverage patterns observed for different protocols. Analysis of new and existing (sc) RNA-seq data from six different library preparation protocols reveals that polymerase processivity is the mechanistic origin of coverage biases. We apply our framework to demonstrate that lowering incubation temperature increases processivity, yield, and (sc) RNA-seq sensitivity in all protocols. We also provide correction factors based on our model for increasing accuracy of transcript quantification in existing samples prepared at standard temperatures. In total, our findings improve our ability to accurately reflect in vivo transcript abundances in (sc) RNA-seq libraries.
引用
收藏
页码:467 / +
页数:25
相关论文
共 57 条
[1]   Comparative analysis of RNA sequencing methods for degraded or low-input samples [J].
Adiconis, Xian ;
Borges-Rivera, Diego ;
Satija, Rahul ;
DeLuca, David S. ;
Busby, Michele A. ;
Berlin, Aaron M. ;
Sivachenko, Andrey ;
Thompson, Dawn Anne ;
Wysoker, Alec ;
Fennell, Timothy ;
Gnirke, Andreas ;
Pochet, Nathalie ;
Regev, Aviv ;
Levin, Joshua Z. .
NATURE METHODS, 2013, 10 (07) :623-+
[2]  
[Anonymous], 2005, Nat Methods, V2, P151, DOI DOI 10.1038/NMETH0205-151
[3]  
Bambara RA, 1995, METHOD ENZYMOL, V262, P270
[4]   Technical Variations in Low-Input RNA-seq Methodologies [J].
Bhargava, Vipul ;
Head, Steven R. ;
Ordoukhanian, Phillip ;
Mercola, Mark ;
Subramaniam, Shankar .
SCIENTIFIC REPORTS, 2014, 4
[5]   High processivity of the reverse transcriptase from a non-long terminal repeat retrotransposon [J].
Bibillo, A ;
Eickbush, TH .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (38) :34836-34845
[6]   rQuant.web: a tool for RNA-Seq-based transcript quantitation [J].
Bohnert, Regina ;
Raetsch, Gunnar .
NUCLEIC ACIDS RESEARCH, 2010, 38 :W348-W351
[7]   Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells [J].
Deng, Qiaolin ;
Ramskold, Daniel ;
Reinius, Bjorn ;
Sandberg, Rickard .
SCIENCE, 2014, 343 (6167) :193-196
[8]   Single-stranded DNA library preparation for the sequencing of ancient or damaged DNA [J].
Gansauge, Marie-Theres ;
Meyer, Matthias .
NATURE PROTOCOLS, 2013, 8 (04) :737-748
[9]   Modelling and simulating generic RNA-Seq experiments with the flux simulator [J].
Griebel, Thasso ;
Zacher, Benedikt ;
Ribeca, Paolo ;
Raineri, Emanuele ;
Lacroix, Vincent ;
Guigo, Roderic ;
Sammeth, Michael .
NUCLEIC ACIDS RESEARCH, 2012, 40 (20) :10073-10083
[10]  
Grün D, 2014, NAT METHODS, V11, P637, DOI [10.1038/NMETH.2930, 10.1038/nmeth.2930]