Is RAD-seq suitable for phylogenetic inference? An in silico assessment and optimization

被引:135
作者
Cariou, Marie [1 ]
Duret, Laurent [1 ]
Charlat, Sylvain [1 ]
机构
[1] Univ Lyon 1, CNRS, Univ Lyon, Lab Biometrie & Biol Evolut,UMR 5558, F-69622 Villeurbanne, France
关键词
Bioinfomatics; phyloinfomatics; molecular evolution; phylogenetic theory and methods; phylogeography; EVOLUTION;
D O I
10.1002/ece3.512
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Inferring phylogenetic relationships between closely related taxa can be hindered by three factors: (1) the lack of informative molecular variation at short evolutionary timescale; (2) the lack of established markers in poorly studied taxa; and (3) the potential phylogenetic conflicts among different genomic regions due to incomplete lineage sorting or introgression. In this context, Restriction site Associated DNA sequencing (RAD-seq) seems promising as this technique can generate sequence data from numerous DNA fragments scattered throughout the genome, from a large number of samples, and without preliminary knowledge on the taxa under study. However, divergence beyond the within-species level will necessarily reduce the number of conserved and non-duplicated restriction sites, and therefore the number of loci usable for phylogenetic inference. Here, we assess the suitability of RAD-seq for phylogeny using a simulated experiment on the 12 Drosophila genomes, with divergence times ranging from 5 to 63 million years. These simulations show that RAD-seq allows the recovery of the known Drosophila phylogeny with strong statistical support, even for relatively ancient nodes. Notably, this conclusion is robust to the potentially confounding effects of sequencing errors, heterozygosity, and low coverage. We further show that clustering RAD-seq data using the BLASTN and SiLiX programs significantly improves the recovery of orthologous RAD loci compared with previously proposed approaches, especially for distantly related species. This study therefore validates the view that RAD sequencing is a powerful tool for phylogenetic inference.
引用
收藏
页码:846 / 852
页数:7
相关论文
共 19 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers [J].
Baird, Nathan A. ;
Etter, Paul D. ;
Atwood, Tressa S. ;
Currey, Mark C. ;
Shiver, Anthony L. ;
Lewis, Zachary A. ;
Selker, Eric U. ;
Cresko, William A. ;
Johnson, Eric A. .
PLOS ONE, 2008, 3 (10)
[3]   Linkage Mapping and Comparative Genomics Using Next-Generation RAD Sequencing of a Non-Model Organism [J].
Baxter, Simon W. ;
Davey, John W. ;
Johnston, J. Spencer ;
Shelton, Anthony M. ;
Heckel, David G. ;
Jiggins, Chris D. ;
Blaxter, Mark L. .
PLOS ONE, 2011, 6 (04)
[4]   Stacks: Building and Genotyping Loci De Novo From Short-Read Sequences [J].
Catchen, Julian M. ;
Amores, Angel ;
Hohenlohe, Paul ;
Cresko, William ;
Postlethwait, John H. .
G3-GENES GENOMES GENETICS, 2011, 1 (03) :171-182
[5]   Evolution of genes and genomes on the Drosophila phylogeny [J].
Clark, Andrew G. ;
Eisen, Michael B. ;
Smith, Douglas R. ;
Bergman, Casey M. ;
Oliver, Brian ;
Markow, Therese A. ;
Kaufman, Thomas C. ;
Kellis, Manolis ;
Gelbart, William ;
Iyer, Venky N. ;
Pollard, Daniel A. ;
Sackton, Timothy B. ;
Larracuente, Amanda M. ;
Singh, Nadia D. ;
Abad, Jose P. ;
Abt, Dawn N. ;
Adryan, Boris ;
Aguade, Montserrat ;
Akashi, Hiroshi ;
Anderson, Wyatt W. ;
Aquadro, Charles F. ;
Ardell, David H. ;
Arguello, Roman ;
Artieri, Carlo G. ;
Barbash, Daniel A. ;
Barker, Daniel ;
Barsanti, Paolo ;
Batterham, Phil ;
Batzoglou, Serafim ;
Begun, Dave ;
Bhutkar, Arjun ;
Blanco, Enrico ;
Bosak, Stephanie A. ;
Bradley, Robert K. ;
Brand, Adrianne D. ;
Brent, Michael R. ;
Brooks, Angela N. ;
Brown, Randall H. ;
Butlin, Roger K. ;
Caggese, Corrado ;
Calvi, Brian R. ;
de Carvalho, A. Bernardo ;
Caspi, Anat ;
Castrezana, Sergio ;
Celniker, Susan E. ;
Chang, Jean L. ;
Chapple, Charles ;
Chatterji, Sourav ;
Chinwalla, Asif ;
Civetta, Alberto .
NATURE, 2007, 450 (7167) :203-218
[6]   RADSeq: next-generation population genetics [J].
Davey, John L. ;
Blaxter, Mark W. .
BRIEFINGS IN FUNCTIONAL GENOMICS, 2010, 9 (5-6) :416-423
[7]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797
[8]   Search and clustering orders of magnitude faster than BLAST [J].
Edgar, Robert C. .
BIOINFORMATICS, 2010, 26 (19) :2460-2461
[9]  
Emerson K. J., 2010, RESOLVING POSTGLACIA
[10]   Field guide to next-generation DNA sequencers [J].
Glenn, Travis C. .
MOLECULAR ECOLOGY RESOURCES, 2011, 11 (05) :759-769