Microhaplotypes provide increased power from short-read DNA sequences for relationship inference

被引:109
作者
Baetscher, Diana S. [1 ,2 ]
Clemento, Anthony J. [2 ,3 ]
Ng, Thomas C. [2 ,4 ]
Anderson, Eric C. [2 ]
Garza, John C. [1 ,2 ,3 ]
机构
[1] Univ Calif Santa Cruz, Dept Ocean Sci, Santa Cruz, CA 95064 USA
[2] Natl Marine Fisheries Serv, Southwest Fisheries Sci Ctr, Santa Cruz, CA 95060 USA
[3] Univ Calif Santa Cruz, Inst Marine Sci, Santa Cruz, CA 95064 USA
[4] Univ Calif Santa Cruz, Dept Biomol Engn, Santa Cruz, CA 95064 USA
基金
美国国家科学基金会;
关键词
high-throughput DNA sequencing; microhaplotype; parentage; population genetics; relationship inference; SINGLE-NUCLEOTIDE POLYMORPHISMS; POPULATION; PARENTAGE; MARKERS; SALMON; SNPS; MICROSATELLITES; RECONSTRUCTION; HAPLOTYPES; DISCOVERY;
D O I
10.1111/1755-0998.12737
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The accelerating rate at which DNA sequence data are now generated by high-throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a single SNP per sequenced region ignores substantial additional information in the phased short-read sequences that are provided by these sequencing instruments. We target sequenced regions with multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine microhaplotypes and then call these microhaplotypes as alleles at each locus. We then demonstrate how these multi-allelic marker data from such loci dramatically increase power for relationship inference. The microhaplotype approach decreases false-positive rates by several orders of magnitude, relative to calling bi-allelic SNPs, for two challenging analytical procedures, full-sibling and single parent-offspring pair identification. We also show how the identification of half-sibling pairs requires so much data that physical linkage becomes a consideration, and that most published studies that attempt to do so are dramatically underpowered. The advent of phased short-read DNA sequence data, in conjunction with emerging analytical tools for their analysis, promises to improve efficiency by reducing the number of loci necessary for a particular level of statistical confidence, thereby lowering the cost of data collection and reducing the degree of physical linkage amongst markers used for relationship estimation. Such advances will facilitate collaborative research and management for migratory and other widespread species.
引用
收藏
页码:296 / 305
页数:10
相关论文
共 45 条
[1]   Large-scale parentage analysis reveals reproductive patterns and heritability of spawn timing in a hatchery population of steelhead (Oncorhynchus mykiss) [J].
Abadia-Cardoso, Alicia ;
Anderson, Eric C. ;
Pearse, Devon E. ;
Garza, John Carlos .
MOLECULAR ECOLOGY, 2013, 22 (18) :4733-4746
[2]   Discovery and characterization of single-nucleotide polymorphisms in steelhead/rainbow trout, Oncorhynchus mykiss [J].
Abadia-Cardoso, Alicia ;
Clemento, Anthony J. ;
Garza, John Carlos .
MOLECULAR ECOLOGY RESOURCES, 2011, 11 :31-49
[3]   The power of single-nucleotide polymorphisms for large-scale parentage inference [J].
Anderson, EC ;
Garza, JC .
GENETICS, 2006, 172 (04) :2567-2582
[4]   Harnessing the power of RADseq for ecological and evolutionary genomics [J].
Andrews, Kimberly R. ;
Good, Jeffrey M. ;
Miller, Michael R. ;
Luikart, Gordon ;
Hohenlohe, Paul A. .
NATURE REVIEWS GENETICS, 2016, 17 (02) :81-92
[5]   Close-Kin Mark-Recapture [J].
Bravington, Mark V. ;
Skaug, Hans J. ;
Anderson, Eric C. .
STATISTICAL SCIENCE, 2016, 31 (02) :259-274
[6]   The utility of single nucleotide polymorphisms in inferences of population history [J].
Brumfield, RT ;
Beerli, P ;
Nickerson, DA ;
Edwards, SV .
TRENDS IN ECOLOGY & EVOLUTION, 2003, 18 (05) :249-256
[7]   Genotyping-in-Thousands by sequencing (GT-seq): A cost effective SNP genotyping method based on custom amplicon sequencing [J].
Campbell, Nathan R. ;
Harmon, Stephanie A. ;
Narum, Shawn R. .
MOLECULAR ECOLOGY RESOURCES, 2015, 15 (04) :855-867
[8]   Stacks: an analysis tool set for population genomics [J].
Catchen, Julian ;
Hohenlohe, Paul A. ;
Bassham, Susan ;
Amores, Angel ;
Cresko, William A. .
MOLECULAR ECOLOGY, 2013, 22 (11) :3124-3140
[9]   Discovery and characterization of single nucleotide polymorphisms in Chinook salmon, Oncorhynchus tshawytscha [J].
Clemento, A. J. ;
Abadia-Cardoso, A. ;
Starks, H. A. ;
Garza, J. C. .
MOLECULAR ECOLOGY RESOURCES, 2011, 11 :50-66
[10]  
Dent R, 2012, PLOS ONE, V7, DOI [10.1371/journal.pone.0036889, 10.1371/journal.pone.0037135]