Paralogs are revealed by proportion of heterozygotes and deviations in read ratios in genotyping-by-sequencing data from natural populations

被引:162
作者
McKinney, Garrett J. [1 ]
Waples, Ryan K. [1 ,2 ]
Seeb, Lisa W. [1 ]
Seeb, James E. [1 ]
机构
[1] Univ Washington, Sch Aquat & Fishery Sci, 1122 NE Boat St,Box 355020, Seattle, WA 98195 USA
[2] Univ Copenhagen, Dept Biol, Bioinformat Ctr, DK-2200 Copenhagen, Denmark
关键词
Chinook salmon; genome duplication; genotyping-by-sequencing; natural populations; paralog; RADseq; WHOLE-GENOME DUPLICATION; GENE DUPLICATION; RAINBOW-TROUT; SNP DISCOVERY; LINKAGE MAP; EVOLUTION; INHERITANCE; PROVIDES; RADSEQ; FISH;
D O I
10.1111/1755-0998.12613
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Whole-genome duplications have occurred in the recent ancestors of many plants, fish, and amphibians, resulting in a pervasiveness of paralogous loci and the potential for both disomic and tetrasomic inheritance in the same genome. Paralogs can be difficult to reliably genotype and are often excluded from genotyping-by-sequencing(GBS) analyses; however, removal requires paralogs to be identified which is difficult without a reference genome. We present a method for identifying paralogs in natural populations by combining two properties of duplicated loci: (i) the expected frequency of heterozygotes exceeds that for singleton loci, and (ii) within heterozygotes, observed read ratios for each allele in GBS data will deviate from the 1: 1 expected for singleton (diploid) loci. These deviations are often not apparent within individuals, particularly when sequence coverage is low; but, we postulated that summing allele reads for each locus over all heterozygous individuals in a population would provide sufficient power to detect deviations at those loci. We identified paralogous loci in three species: Chinook salmon (Oncorhynchus tshawytscha) which retains regions with ongoing residual tetrasomy on eight chromosome arms following a recent whole-genome duplication, mountain barberry (Berberis alpina) which has a large proportion of paralogs that arose through an unknown mechanism, and dusky parrotfish (Scarus niger) which has largely rediploidized following an ancient whole-genome duplication. Importantly, this approach only requires the genotype and allele-specific read counts for each individual, information which is readily obtained from most GBS analysis pipelines.
引用
收藏
页码:656 / 669
页数:14
相关论文
共 63 条
[21]   Don't throw the baby out with the bathwater: identifying and mapping paralogs in salmonids [J].
Dufresne, France .
MOLECULAR ECOLOGY RESOURCES, 2016, 16 (01) :7-9
[22]   Recent progress and challenges in population genetics of polyploid organisms: an overview of current state-of-the-art molecular and statistical tools [J].
Dufresne, France ;
Stift, Marc ;
Vergilino, Roland ;
Mable, Barbara K. .
MOLECULAR ECOLOGY, 2014, 23 (01) :40-69
[23]   PyRAD: assembly of de novo RADseq loci for phylogenetic analyses [J].
Eaton, Deren A. R. .
BIOINFORMATICS, 2014, 30 (13) :1844-1849
[24]   Short reads and nonmodel species: exploring the complexities of next-generation sequence assembly and SNP discovery in the absence of a reference genome [J].
Everett, M. V. ;
Grau, E. D. ;
Seeb, J. E. .
MOLECULAR ECOLOGY RESOURCES, 2011, 11 :93-108
[25]   GENETIC-VARIABILITY IN SPECIES WITH EXTENSIVE GENE DUPLICATION - THE TETRAPLOID CATOSTOMID FISHES [J].
FERRIS, SD ;
WHITT, GS .
AMERICAN NATURALIST, 1980, 115 (05) :650-666
[26]   Genome-wide association reveals genetic basis for the propensity to migrate in wild populations of rainbow and steelhead trout [J].
Hecht, Benjamin C. ;
Campbell, Nathan R. ;
Holecek, Dean E. ;
Narum, Shawn R. .
MOLECULAR ECOLOGY, 2013, 22 (11) :3061-3076
[27]   Genomic patterns of introgression in rainbow and westslope cutthroat trout illuminated by overlapping paired-end RAD sequencing [J].
Hohenlohe, Paul A. ;
Day, Mitch D. ;
Amish, Stephen J. ;
Miller, Michael R. ;
Kamps-Hughes, Nick ;
Boyer, Matthew C. ;
Muhlfeld, Clint C. ;
Allendorf, Fred W. ;
Johnson, Eric A. ;
Luikart, Gordon .
MOLECULAR ECOLOGY, 2013, 22 (11) :3002-3013
[28]   Next-generation RAD sequencing identifies thousands of SNPs for assessing hybridization between rainbow and westslope cutthroat trout [J].
Hohenlohe, Paul A. ;
Amish, Stephen J. ;
Catchen, Julian M. ;
Allendorf, Fred W. ;
Luikart, Gordon .
MOLECULAR ECOLOGY RESOURCES, 2011, 11 :117-122
[29]   Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype [J].
Jaillon, O ;
Aury, JM ;
Brunet, F ;
Petit, JL ;
Stange-Thomann, N ;
Mauceli, E ;
Bouneau, L ;
Fischer, C ;
Ozouf-Costaz, C ;
Bernot, A ;
Nicaud, S ;
Jaffe, D ;
Fisher, S ;
Lutfalla, G ;
Dossat, C ;
Segurens, B ;
Dasilva, C ;
Salanoubat, M ;
Levy, M ;
Boudet, N ;
Castellano, S ;
Anthouard, R ;
Jubin, C ;
Castelli, V ;
Katinka, M ;
Vacherie, B ;
Biémont, C ;
Skalli, Z ;
Cattolico, L ;
Poulain, J ;
de Berardinis, V ;
Cruaud, C ;
Duprat, S ;
Brottier, P ;
Coutanceau, JP ;
Gouzy, J ;
Parra, G ;
Lardier, G ;
Chapple, C ;
McKernan, KJ ;
McEwan, P ;
Bosak, S ;
Kellis, M ;
Volff, JN ;
Guigó, R ;
Zody, MC ;
Mesirov, J ;
Lindblad-Toh, K ;
Birren, B ;
Nusbaum, C .
NATURE, 2004, 431 (7011) :946-957
[30]   Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae [J].
Kellis, M ;
Birren, BW ;
Lander, ES .
NATURE, 2004, 428 (6983) :617-624