SimRAD: an R package for simulation-based prediction of the number of loci expected in RADseq and similar genotyping by sequencing approaches

被引:96
作者
Lepais, Olivier [1 ,2 ]
Weir, Jason T. [3 ,4 ]
机构
[1] INRA, UMR 1224, St Pee Sur Nivelle, France
[2] Univ Pau & Pays Adour, UMR 1224, UFR Sci & Tech Cote Basque, F-64010 Pau, France
[3] Univ Toronto Scarborough, Dept Biol Sci, Toronto, ON M1C 1A4, Canada
[4] Univ Toronto Scarborough, Dept Ecol & Evolutionary Biol, Toronto, ON M1C 1A4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
GBS; genome complexity reduction; in silico digestion; next generation sequencing; restriction site associated DNA polymorphism; single nucleotide polymorphism; EEL ANGUILLA-ROSTRATA; SALMON SALMO-SALAR; ATLANTIC SALMON; GENOME; POLYMORPHISMS; DISCOVERY; RAINBOW; MARKERS; STACKS; TOOL;
D O I
10.1111/1755-0998.12273
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Application of high-throughput sequencing platforms in the field of ecology and evolutionary biology is developing quickly with the introduction of efficient methods to reduce genome complexity. Numerous approaches for genome complexity reduction have been developed using different combinations of restriction enzymes, library construction strategies and fragment size selection. As a result, the choice of which techniques to use may become cumbersome, because it is difficult to anticipate the number of loci resulting from each method. We developed SimRAD, an R package that performs in silico restriction enzyme digests and fragment size selection as implemented in most restriction site associated DNA polymorphism and genotyping by sequencing methods. In silico digestion is performed on a reference genome or on a randomly generated DNA sequence when no reference genome sequence is available. SimRAD accurately predicts the number of loci under alternative protocols when a reference genome sequence is available for the targeted species (or a close relative) but may be unreliable when no reference genome is available. SimRAD is also useful for fine-tuning a given protocol to adjust the number of targeted loci. Here, we outline the functionality of SimRAD and provide an illustrative example of the use of the package (available on the CRAN at http://cran.r-project.org/web/packages/SimRAD).
引用
收藏
页码:1314 / 1321
页数:8
相关论文
共 31 条
[1]   Catadromous eels continue to be slippery research subjects [J].
Avise, John C. .
MOLECULAR ECOLOGY, 2011, 20 (07) :1317-1319
[2]   Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers [J].
Baird, Nathan A. ;
Etter, Paul D. ;
Atwood, Tressa S. ;
Currey, Mark C. ;
Shiver, Anthony L. ;
Lewis, Zachary A. ;
Selker, Eric U. ;
Cresko, William A. ;
Johnson, Eric A. .
PLOS ONE, 2008, 3 (10)
[3]   Marker Density and Read Depth for Genotyping Populations Using Genotyping-by-Sequencing [J].
Beissinger, Timothy M. ;
Hirsch, Candice N. ;
Sekhon, Rajandeep S. ;
Foerster, Jillian M. ;
Johnson, James M. ;
Muttoni, German ;
Vaillancourt, Brieanne ;
Buell, C. Robin ;
Kaeppler, Shawn M. ;
de Leon, Natalia .
GENETICS, 2013, 193 (04) :1073-1081
[4]   Stacks: an analysis tool set for population genomics [J].
Catchen, Julian ;
Hohenlohe, Paul A. ;
Bassham, Susan ;
Amores, Angel ;
Cresko, William A. .
MOLECULAR ECOLOGY, 2013, 22 (11) :3124-3140
[5]   Stacks: Building and Genotyping Loci De Novo From Short-Read Sequences [J].
Catchen, Julian M. ;
Amores, Angel ;
Hohenlohe, Paul ;
Cresko, William ;
Postlethwait, John H. .
G3-GENES GENOMES GENETICS, 2011, 1 (03) :171-182
[6]   Detection and genotyping of restriction fragment associated polymorphisms in polyploid crops with a pseudo-reference sequence: a case study in allotetraploid Brassica napus [J].
Chen, Xun ;
Li, Xuemin ;
Zhang, Bing ;
Xu, Jinsong ;
Wu, Zhikun ;
Wang, Bo ;
Li, Haitao ;
Younas, Muhammad ;
Huang, Lei ;
Luo, Yingfeng ;
Wu, Jiangsheng ;
Hu, Songnian ;
Liu, Kede .
BMC GENOMICS, 2013, 14
[7]   Rainbow: an integrated tool for efficient clustering and assembling RAD-seq reads [J].
Chong, Zechen ;
Ruan, Jue ;
Wu, Chung-I. .
BIOINFORMATICS, 2012, 28 (21) :2732-2737
[8]   Population genetics of the American eel (Anguilla rostrata): FST=0 and North Atlantic Oscillation effects on demographic fluctuations of a panmictic species [J].
Cote, Caroline L. ;
Gagnaire, Pierre-Alexandre ;
Bourret, Vincent ;
Verreault, Guy ;
Castonguay, Martin ;
Bernatchez, Louis .
MOLECULAR ECOLOGY, 2013, 22 (07) :1763-1776
[9]   Genome-wide genetic marker discovery and genotyping using next-generation sequencing [J].
Davey, John W. ;
Hohenlohe, Paul A. ;
Etter, Paul D. ;
Boone, Jason Q. ;
Catchen, Julian M. ;
Blaxter, Mark L. .
NATURE REVIEWS GENETICS, 2011, 12 (07) :499-510
[10]   Sequencing the genome of the Atlantic salmon (Salmo salar) [J].
Davidson, William S. ;
Koop, Ben F. ;
Jones, Steven J. M. ;
Iturra, Patricia ;
Vidal, Rodrigo ;
Maass, Alejandro ;
Jonassen, Inge ;
Lien, Sigbjorn ;
Omholt, Stig W. .
GENOME BIOLOGY, 2010, 11 (09)