Detecting Selective Sweeps from Pooled Next-Generation Sequencing Samples

被引:66
作者
Boitard, Simon [1 ]
Schloetterer, Christian [2 ]
Nolte, Viola [2 ]
Pandey, Ram Vinay [2 ]
Futschik, Andreas [3 ]
机构
[1] INRA, Lab Genet Cellulaire, F-31326 Castanet Tolosan, France
[2] Vetmeduni Vienna, Inst Populat Genet, Vienna, Austria
[3] Univ Vienna, Inst Stat & Decis Support Syst, Vienna, Austria
基金
奥地利科学基金会;
关键词
selective sweeps; next-generation sequencing; pooled DNA; Drosophila; allele frequency spectrum; hidden Markov model; DROSOPHILA-MELANOGASTER; DNA; DISCOVERY; FRAMEWORK; VARIANTS;
D O I
10.1093/molbev/mss090
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Due to its cost effectiveness, next-generation sequencing of pools of individuals (Pool-Seq) is becoming a popular strategy for characterizing variation in population samples. Because Pool-Seq provides genome-wide SNP frequency data, it is possible to use them for demographic inference and/or the identification of selective sweeps. Here, we introduce a statistical method that is designed to detect selective sweeps from pooled data by accounting for statistical challenges associated with Pool-Seq, namely sequencing errors and random sampling among chromosomes. This allows for an efficient use of the information: all base calls are included in the analysis, but the higher credibility of regions with higher coverage and base calls with better quality scores is accounted for. Computer simulations show that our method efficiently detects sweeps even at very low coverage (0.5x per chromosome). Indeed, the power of detecting sweeps is similar to what we could expect from sequences of individual chromosomes. Since the inference of selective sweeps is based on the allele frequency spectrum (AFS), we also provide a method to accurately estimate the AFS provided that the quality scores for the sequence reads are reliable. Applying our approach to Pool-Seq data from Drosophila melanogaster, we identify several selective sweep signatures on chromosome X that include some previously well-characterized sweeps like the wapl region.
引用
收藏
页码:2177 / 2186
页数:10
相关论文
共 30 条
[21]  
Kim Y, 2002, GENETICS, V160, P765
[22]   On Optimal Pooling Designs to Identify Rare Variants Through Massive Resequencing [J].
Lee, Joon Sang ;
Choi, Murim ;
Yan, Xiting ;
Lifton, Richard P. ;
Zhao, Hongyu .
GENETIC EPIDEMIOLOGY, 2011, 35 (03) :139-147
[23]   Inferring the demographic history and rate of adaptive substitution in Drosophila [J].
Li, Haipeng ;
Stephan, Wolfgang .
PLOS GENETICS, 2006, 2 (10) :1580-1589
[24]   A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data [J].
Li, Heng .
BIOINFORMATICS, 2011, 27 (21) :2987-2993
[25]   The Sequence Alignment/Map format and SAMtools [J].
Li, Heng ;
Handsaker, Bob ;
Wysoker, Alec ;
Fennell, Tim ;
Ruan, Jue ;
Homer, Nils ;
Marth, Gabor ;
Abecasis, Goncalo ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (16) :2078-2079
[26]   Fast and accurate short read alignment with Burrows-Wheeler transform [J].
Li, Heng ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (14) :1754-1760
[27]   Genomic scans for selective sweeps using SNP data [J].
Nielsen, R ;
Williamson, S ;
Kim, Y ;
Hubisz, MJ ;
Clark, AG ;
Bustamante, C .
GENOME RESEARCH, 2005, 15 (11) :1566-1575
[28]   Whole-genome resequencing reveals loci under selection during chicken domestication [J].
Rubin, Carl-Johan ;
Zody, Michael C. ;
Eriksson, Jonas ;
Meadows, Jennifer R. S. ;
Sherwood, Ellen ;
Webster, Matthew T. ;
Jiang, Lin ;
Ingman, Max ;
Sharpe, Ted ;
Ka, Sojeong ;
Hallbook, Finn ;
Besnier, Francois ;
Carlborg, Orjan ;
Bed'hom, Bertrand ;
Tixier-Boichard, Michele ;
Jensen, Per ;
Siegel, Paul ;
Lindblad-Toh, Kerstin ;
Andersson, Leif .
NATURE, 2010, 464 (7288) :587-U145
[29]   SNVer: a statistical tool for variant calling in analysis of pooled or individual next-generation sequencing data [J].
Wei, Zhi ;
Wang, Wei ;
Hu, Pingzhao ;
Lyon, Gholson J. ;
Hakonarson, Hakon .
NUCLEIC ACIDS RESEARCH, 2011, 39 (19)
[30]   Localizing recent adaptive evolution in the human genome [J].
Williamson, Scott H. ;
Hubisz, Melissa J. ;
Clark, Andrew G. ;
Payseur, Bret A. ;
Bustamante, Carlos D. ;
Nielsen, Rasmus .
PLOS GENETICS, 2007, 3 (06) :901-915