Detecting Selective Sweeps from Pooled Next-Generation Sequencing Samples

被引:66
作者
Boitard, Simon [1 ]
Schloetterer, Christian [2 ]
Nolte, Viola [2 ]
Pandey, Ram Vinay [2 ]
Futschik, Andreas [3 ]
机构
[1] INRA, Lab Genet Cellulaire, F-31326 Castanet Tolosan, France
[2] Vetmeduni Vienna, Inst Populat Genet, Vienna, Austria
[3] Univ Vienna, Inst Stat & Decis Support Syst, Vienna, Austria
基金
奥地利科学基金会;
关键词
selective sweeps; next-generation sequencing; pooled DNA; Drosophila; allele frequency spectrum; hidden Markov model; DROSOPHILA-MELANOGASTER; DNA; DISCOVERY; FRAMEWORK; VARIANTS;
D O I
10.1093/molbev/mss090
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Due to its cost effectiveness, next-generation sequencing of pools of individuals (Pool-Seq) is becoming a popular strategy for characterizing variation in population samples. Because Pool-Seq provides genome-wide SNP frequency data, it is possible to use them for demographic inference and/or the identification of selective sweeps. Here, we introduce a statistical method that is designed to detect selective sweeps from pooled data by accounting for statistical challenges associated with Pool-Seq, namely sequencing errors and random sampling among chromosomes. This allows for an efficient use of the information: all base calls are included in the analysis, but the higher credibility of regions with higher coverage and base calls with better quality scores is accounted for. Computer simulations show that our method efficiently detects sweeps even at very low coverage (0.5x per chromosome). Indeed, the power of detecting sweeps is similar to what we could expect from sequences of individual chromosomes. Since the inference of selective sweeps is based on the allele frequency spectrum (AFS), we also provide a method to accurately estimate the AFS provided that the quality scores for the sequence reads are reliable. Applying our approach to Pool-Seq data from Drosophila melanogaster, we identify several selective sweep signatures on chromosome X that include some previously well-characterized sweeps like the wapl region.
引用
收藏
页码:2177 / 2186
页数:10
相关论文
共 30 条
  • [1] Achaz G., 2009, GENETICS, V179, P1409
  • [2] Multiplexed shotgun genotyping for rapid and efficient genetic mapping
    Andolfatto, Peter
    Davison, Dan
    Erezyilmaz, Deniz
    Hu, Tina T.
    Mast, Joshua
    Sunayama-Morita, Tomoko
    Stern, David L.
    [J]. GENOME RESEARCH, 2011, 21 (04) : 610 - 617
  • [3] A statistical method for the detection of variants from next-generation resequencing of DNA pools
    Bansal, Vikas
    [J]. BIOINFORMATICS, 2010, 26 (12) : i318 - i324
  • [4] Evidence for a selective sweep in the wapl region of Drosophila melanogaster
    Beisswanger, S
    Stephan, W
    De Lorenzo, D
    [J]. GENETICS, 2006, 172 (01) : 265 - 274
  • [5] Detecting Selective Sweeps: A New Approach Based on Hidden Markov Models
    Boitard, Simon
    Schloetterer, Christian
    Futschik, Andreas
    [J]. GENETICS, 2009, 181 (04) : 1567 - 1578
  • [6] Reduced sleep in Drosophila shaker mutants
    Cirelli, C
    Bushey, D
    Hill, S
    Huber, R
    Kreber, R
    Ganetzky, B
    Tononi, G
    [J]. NATURE, 2005, 434 (7037) : 1087 - 1092
  • [7] A framework for variation discovery and genotyping using next-generation DNA sequencing data
    DePristo, Mark A.
    Banks, Eric
    Poplin, Ryan
    Garimella, Kiran V.
    Maguire, Jared R.
    Hartl, Christopher
    Philippakis, Anthony A.
    del Angel, Guillermo
    Rivas, Manuel A.
    Hanna, Matt
    McKenna, Aaron
    Fennell, Tim J.
    Kernytsky, Andrew M.
    Sivachenko, Andrey Y.
    Cibulskis, Kristian
    Gabriel, Stacey B.
    Altshuler, David
    Daly, Mark J.
    [J]. NATURE GENETICS, 2011, 43 (05) : 491 - +
  • [8] Substantial biases in ultra-short read data sets from high-throughput DNA sequencing
    Dohm, Juliane C.
    Lottaz, Claudio
    Borodina, Tatiana
    Himmelbauer, Heinz
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 (16)
  • [9] Druley TE, 2009, NAT METHODS, V6, P263, DOI [10.1038/NMETH.1307, 10.1038/nmeth.1307]
  • [10] A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species
    Elshire, Robert J.
    Glaubitz, Jeffrey C.
    Sun, Qi
    Poland, Jesse A.
    Kawamoto, Ken
    Buckler, Edward S.
    Mitchell, Sharon E.
    [J]. PLOS ONE, 2011, 6 (05):