Next-generation sequencing for molecular ecology: a caveat regarding pooled samples

被引:44
作者
Anderson, Eric C. [1 ,2 ]
Skaug, Hans J. [2 ,3 ]
Barshis, Daniel J. [1 ]
机构
[1] NOAA, Fisheries Ecol Div, SW Fisheries Sci Ctr, Natl Marine Fisheries Serv, Santa Cruz, CA 95060 USA
[2] Univ Calif Santa Cruz, Dept Appl Math & Stat SOE2, Santa Cruz, CA 95064 USA
[3] Univ Bergen, Dept Math, N-5020 Bergen, Norway
关键词
compound multinomial distribution; outlier analysis; population divergence; SNP discovery; SNP DISCOVERY; DIFFERENTIATION; ACCURACY; MODEL;
D O I
10.1111/mec.12609
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We develop a model based on the Dirichlet-compound multinomial distribution (CMD) and Ewens sampling formula to predict the fraction of SNP loci that will appear fixed for alternate alleles between two pooled samples drawn from the same underlying population. We apply this model to next-generation sequencing (NGS) data from Baltic Sea herring recently published by (Corander etal., , Molecular Ecology, 2931-2940), and show that there are many more fixed loci than expected in the absence of genetic structure. However, we show through coalescent simulations that the degree of population structure required to explain the fraction of alternatively fixed SNPs is extraordinarily high and that the surplus of fixed loci is more likely a consequence of limited representation of individual gene copies in the pooled samples, than it is of population structure. Our analysis signals that the use of NGS on pooled samples to identify divergent SNPs warrants caution. With pooled samples, it is hard to diagnose when an NGS experiment has gone awry; especially when NGS data on pooled samples are of low read depth with a limited number of individuals, it may be worthwhile to temper claims of unexpected population differentiation from pooled samples, pending verification with more reliable methods or stricter adherence to recommended sampling designs for pooled sequencing e.g. Futschik & Schlotterer , Genetics, 186, 207; Gautier etal., , Molecular Ecology, 3766-3779). Analysis of the data and diagnosis of problems is easier and more reliable (and can be less costly) with individually barcoded samples. Consequently, for some scenarios, individual barcoding may be preferable to pooling of samples.
引用
收藏
页码:502 / 512
页数:11
相关论文
共 36 条
  • [1] Single-Nucleotide Polymorphisms (SNPs) under Diversifying Selection Provide Increased Accuracy and Precision in Mixed-Stock Analyses of Sockeye Salmon from the Copper River, Alaska
    Ackerman, Michael W.
    Habicht, Christopher
    Seeb, Lisa W.
    [J]. TRANSACTIONS OF THE AMERICAN FISHERIES SOCIETY, 2011, 140 (03) : 865 - 881
  • [2] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [3] Assessing the power of informative subsets of loci for population assignment: standard methods are upwardly biased
    Anderson, E. C.
    [J]. MOLECULAR ECOLOGY RESOURCES, 2010, 10 (04) : 701 - 710
  • [4] Anderson EC, 2013, DRYAD DIGITAL REPOSI, DOI 10.5061/dryad.cp8t9
  • [5] An improved method for predicting the accuracy of genetic stock identification
    Anderson, Eric C.
    Waples, Robin S.
    Kalinowski, Steven T.
    [J]. CANADIAN JOURNAL OF FISHERIES AND AQUATIC SCIENCES, 2008, 65 (07) : 1475 - 1486
  • [6] Multiplexed shotgun genotyping for rapid and efficient genetic mapping
    Andolfatto, Peter
    Davison, Dan
    Erezyilmaz, Deniz
    Hu, Tina T.
    Mast, Joshua
    Sunayama-Morita, Tomoko
    Stern, David L.
    [J]. GENOME RESEARCH, 2011, 21 (04) : 610 - 617
  • [7] [Anonymous], 1997, Discrete Multivariate Distributions
  • [8] [Anonymous], 1965, P INTERNAT RES SEM S
  • [9] Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers
    Baird, Nathan A.
    Etter, Paul D.
    Atwood, Tressa S.
    Currey, Mark C.
    Shiver, Anthony L.
    Lewis, Zachary A.
    Selker, Eric U.
    Cresko, William A.
    Johnson, Eric A.
    [J]. PLOS ONE, 2008, 3 (10):
  • [10] Genome-wide analysis of a long-term evolution experiment with Drosophila
    Burke, Molly K.
    Dunham, Joseph P.
    Shahrestani, Parvin
    Thornton, Kevin R.
    Rose, Michael R.
    Long, Anthony D.
    [J]. NATURE, 2010, 467 (7315) : 587 - U111