Viral population analysis and minority-variant detection using short read next-generation sequencing

被引:135
作者
Watson, Simon J. [1 ]
Welkers, Matthijs R. A. [2 ]
Depledge, Daniel P. [3 ]
Coulter, Eve [3 ]
Breuer, Judith M. [3 ]
de Jong, Menno D. [2 ]
Kellam, Paul [1 ,3 ]
机构
[1] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[2] Univ Amsterdam, Acad Med Ctr, Dept Med Microbiol, NL-1105 AZ Amsterdam, Netherlands
[3] UCL, MRC UCL Ctr Med Mol Virol, Div Infect & Immun, London WC1E 6BT, England
基金
英国惠康基金;
关键词
Quality Assessment of Short Read Pipeline; influenza virus; minority-variant analysis; deep-sequencing; population dynamics; QUALITY; HIV-1;
D O I
10.1098/rstb.2012.0205
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA viruses within infected individuals exist as a population of evolutionary-related variants. Owing to evolutionary change affecting the constitution of this population, the frequency and/or occurrence of individual viral variants can show marked or subtle fluctuations. Since the development of massively parallel sequencing platforms, such viral populations can now be investigated to unprecedented resolution. A critical problem with such analyses is the presence of sequencing-related errors that obscure the identification of true biological variants present at low frequency. Here, we report the development and assessment of the Quality Assessment of Short Read (QUASR) Pipeline (http://sourceforge.net/projects/quasr) specific for virus genome short read analysis that minimizes sequencing errors from multiple deep-sequencing platforms, and enables post-mapping analysis of the minority variants within the viral population. QUASR significantly reduces the error-related noise in deep-sequencing datasets, resulting in increased mapping accuracy and reduction of erroneous mutations. Using QUASR, we have determined influenza virus genome dynamics in sequential samples from an in vitro evolution of 2009 pandemic H1N1 (A/H1N1/09) influenza from samples sequenced on both the Roche 454 GSFLX and Illumina GAIIx platforms. Importantly, concordance between the 454 and Illumina sequencing allowed unambiguous minority-variant detection and accurate determination of virus population turnover in vitro.
引用
收藏
页数:8
相关论文
共 19 条
[1]   Analysis of high-depth sequence data for studying viral diversity: a comparison of next generation sequencing platforms using Segminator II [J].
Archer, John ;
Baillie, Greg ;
Watson, Simon J. ;
Kellam, Paul ;
Rambaut, Andrew ;
Robertson, David L. .
BMC BIOINFORMATICS, 2012, 13
[2]   Detection of low-frequency pretherapy chemokine (CXC motif) receptor 4 (CXCR4)-using HIV-1 with ultra-deep pyrosequencing [J].
Archer, John ;
Braverman, Michael S. ;
Taillon, Bruce E. ;
Desany, Brian ;
James, Ian ;
Harrigan, P. Richard ;
Lewis, Marilyn ;
Robertson, David L. .
AIDS, 2009, 23 (10) :1209-1218
[3]   Evolutionary Dynamics of Local Pandemic H1N1/2009 Influenza Virus Lineages Revealed by Whole-Genome Analysis [J].
Baillie, Gregory J. ;
Galiano, Monica ;
Agapow, Paul-Michael ;
Myers, Richard ;
Chiam, Rachael ;
Gall, Astrid ;
Palser, Anne L. ;
Watson, Simon J. ;
Hedge, Jessica ;
Underwood, Anthony ;
Platt, Steven ;
McLean, Estelle ;
Pebody, Richard G. ;
Rambaut, Andrew ;
Green, Jonathan ;
Daniels, Rod ;
Pybus, Oliver G. ;
Kellam, Paul ;
Zambon, Maria .
JOURNAL OF VIROLOGY, 2012, 86 (01) :11-18
[4]   Detection of Inferred CCR5-and CXCR4-Using HIV-1 Variants and Evolutionary Intermediates Using Ultra-Deep Pyrosequencing [J].
Bunnik, Evelien M. ;
Swenson, Luke C. ;
Edo-Matas, Diana ;
Huang, Wei ;
Dong, Winnie ;
Frantzell, Arne ;
Petropoulos, Christos J. ;
Coakley, Eoin ;
Schuitemaker, Hanneke ;
Harrigan, P. Richard ;
van't Wout, Angelique B. .
PLOS PATHOGENS, 2011, 7 (06)
[5]   Viral population estimation using pyrosequencing [J].
Eriksson, Nicholas ;
Pachter, Lior ;
Mitsuya, Yumi ;
Rhee, Soo-Yon ;
Wang, Chunlin ;
Gharizadeh, Baback ;
Ronaghi, Mostafa ;
Shafer, Robert W. ;
Beerenwinkel, Niko .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (05)
[6]   Accuracy and quality of massively parallel DNA pyrosequencing [J].
Huse, Susan M. ;
Huber, Julie A. ;
Morrison, Hilary G. ;
Sogin, Mitchell L. ;
Mark Welch, David .
GENOME BIOLOGY, 2007, 8 (07)
[7]   Fast and accurate short read alignment with Burrows-Wheeler transform [J].
Li, Heng ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (14) :1754-1760
[8]   The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data [J].
McKenna, Aaron ;
Hanna, Matthew ;
Banks, Eric ;
Sivachenko, Andrey ;
Cibulskis, Kristian ;
Kernytsky, Andrew ;
Garimella, Kiran ;
Altshuler, David ;
Gabriel, Stacey ;
Daly, Mark ;
DePristo, Mark A. .
GENOME RESEARCH, 2010, 20 (09) :1297-1303
[9]   APPLICATIONS OF NEXT-GENERATION SEQUENCING Sequencing technologies - the next generation [J].
Metzker, Michael L. .
NATURE REVIEWS GENETICS, 2010, 11 (01) :31-46
[10]   The population genetics and evolutionary epidemiology of RNA viruses [J].
Moya, A ;
Holmes, EC ;
González-Candelas, F .
NATURE REVIEWS MICROBIOLOGY, 2004, 2 (04) :279-288