Stacks 2: Analytical methods for paired-end sequencing improve RADseq-based population genomics

被引:804
作者
Rochette, Nicolas C. [1 ]
Rivera-Colon, Angel G. [1 ]
Catchen, Julian M. [1 ]
机构
[1] Univ Illinois, Dept Evolut Ecol & Behav, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
bioinformatics; conservation genetics; genotype calling; haplotype phasing; population genetics; restriction-site associated DNA sequencing; RESTRICTION ENZYME; HARDY-WEINBERG; DE-NOVO; DNA; GENETICS; TOOL; ADAPTATION; DISCOVERY; FRAMEWORK; EVOLUTION;
D O I
10.1111/mec.15253
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
For half a century population genetics studies have put type II restriction endonucleases to work. Now, coupled with massively-parallel, short-read sequencing, the family of RAD protocols that wields these enzymes has generated vast genetic knowledge from the natural world. Here, we describe the first software natively capable of using paired-end sequencing to derive short contigs from de novo RAD data. Stacks version 2 employs a de Bruijn graph assembler to build and connect contigs from forward and reverse reads for each de novo RAD locus, which it then uses as a reference for read alignments. The new architecture allows all the individuals in a metapopulation to be considered at the same time as each RAD locus is processed. This enables a Bayesian genotype caller to provide precise SNPs, and a robust algorithm to phase those SNPs into long haplotypes, generating RAD loci that are 400-800 bp in length. To prove its recall and precision, we tested the software with simulated data and compared reference-aligned and de novo analyses of three empirical data sets. Our study shows that the latest version of Stacks is highly accurate and outperforms other software in assembling and genotyping paired-end de novo data sets.
引用
收藏
页码:4737 / 4754
页数:18
相关论文
共 78 条
[1]   RAD Capture (Rapture): Flexible and Efficient Sequence-Based Genotyping [J].
Ali, Omar A. ;
O'Rourke, Sean M. ;
Amish, Stephen J. ;
Meek, Mariah H. ;
Luikart, Gordon ;
Jeffres, Carson ;
Miller, Michael R. .
GENETICS, 2016, 202 (02) :389-+
[2]   Genome Evolution and Meiotic Maps by Massively Parallel DNA Sequencing: Spotted Gar, an Outgroup for the Teleost Genome Duplication [J].
Amores, Angel ;
Catchen, Julian ;
Ferrara, Allyse ;
Fontenot, Quenton ;
Postlethwait, John H. .
GENETICS, 2011, 188 (04) :799-U79
[3]   Harnessing the power of RADseq for ecological and evolutionary genomics [J].
Andrews, Kimberly R. ;
Good, Jeffrey M. ;
Miller, Michael R. ;
Luikart, Gordon ;
Hohenlohe, Paul A. .
NATURE REVIEWS GENETICS, 2016, 17 (02) :81-92
[4]  
Aronesty, 2011, EA UTILS COMMAND LIN
[5]   Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers [J].
Baird, Nathan A. ;
Etter, Paul D. ;
Atwood, Tressa S. ;
Currey, Mark C. ;
Shiver, Anthony L. ;
Lewis, Zachary A. ;
Selker, Eric U. ;
Cresko, William A. ;
Johnson, Eric A. .
PLOS ONE, 2008, 3 (10)
[6]   Repeated Selection of Alternatively Adapted Haplotypes Creates Sweeping Genomic Remodeling in Stickleback [J].
Bassham, Susan ;
Catchen, Julian ;
Lescak, Emily ;
von Hippel, Frank A. ;
Cresko, William A. .
GENETICS, 2018, 209 (03) :921-939
[7]   Genomic signals of selection predict climate-driven population declines in a migratory bird [J].
Bay, Rachael A. ;
Harrigan, Ryan J. ;
Le Underwood, Vinh ;
Gibbs, H. Lisle ;
Smith, Thomas B. ;
Ruegg, Kristen .
SCIENCE, 2018, 359 (6371) :83-+
[8]  
Bird CE, 2011, CRUSTACEAN ISS, V19, P31
[9]   Amplified fragment length polymorphism (AFLP): a review of the procedure and its applications [J].
Blears, MJ ;
De Grandis, SA ;
Lee, H ;
Trevors, JT .
JOURNAL OF INDUSTRIAL MICROBIOLOGY & BIOTECHNOLOGY, 1998, 21 (03) :99-114
[10]  
BOTSTEIN D, 1980, AM J HUM GENET, V32, P314