Predictors of sequence capture in a large-scale anchored phylogenomics project

被引:9
作者
Nunes, Renato [1 ,2 ]
Storer, Caroline [3 ]
Doleck, Tenzing [1 ,2 ]
Kawahara, Akito Y. [3 ,4 ,5 ]
Pierce, Naomi E. [6 ]
Lohman, David J. [1 ,2 ,7 ]
机构
[1] CUNY, City Coll New York, Biol Dept, New York, NY 10017 USA
[2] CUNY, Grad Ctr, Program Biol, New York, NY 10017 USA
[3] Univ Florida, McGuire Ctr Lepidoptera & Biodivers, Florida Museum Nat Hist, Gainesville, FL USA
[4] Univ Florida, Entomol & Nematol Dept, Gainesville, FL USA
[5] Univ Florida, Dept Biol, Gainesville, FL USA
[6] Harvard Univ, Dept Organism & Evolutionary Biol & Museum Compara, Cambridge, MA USA
[7] Natl Museum Nat Hist, Entomol Sect, Manila, Philippines
基金
美国国家科学基金会;
关键词
anchored hybrid enrichment; historical DNA; hybrid capture; Lepidoptera; museomics; archival DNA; Papilionoidea; phylogenomics; SPECIES-TREE ESTIMATION; ULTRACONSERVED ELEMENTS; MOLECULAR PHYLOGENY; FIELD PRESERVATION; SOFTWARE PACKAGE; DNA; EVOLUTIONARY; SAMPLES; ENRICHMENT; MARKERS;
D O I
10.3389/fevo.2022.943361
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Next-generation sequencing (NGS) technologies have revolutionized phylogenomics by decreasing the cost and time required to generate sequence data from multiple markers or whole genomes. Further, the fragmented DNA of biological specimens collected decades ago can be sequenced with NGS, reducing the need for collecting fresh specimens. Sequence capture, also known as anchored hybrid enrichment, is a method to produce reduced representation libraries for NGS sequencing. The technique uses single-stranded oligonucleotide probes that hybridize with pre-selected regions of the genome that are sequenced via NGS, culminating in a dataset of numerous orthologous loci from multiple taxa. Phylogenetic analyses using these sequences have the potential to resolve deep and shallow phylogenetic relationships. Identifying the factors that affect sequence capture success could save time, money, and valuable specimens that might be destructively sampled despite low likelihood of sequencing success. We investigated the impacts of specimen age, preservation method, and DNA concentration on sequence capture (number of captured sequences and sequence quality) while accounting for taxonomy and extracted tissue type in a large-scale butterfly phylogenomics project. This project used two probe sets to extract 391 loci or a subset of 13 loci from over 6,000 butterfly specimens. We found that sequence capture is a resilient method capable of amplifying loci in samples of varying age (0-111 years), preservation method (alcohol, papered, pinned), and DNA concentration (0.020 ng/mu l - 316 ng/ul). Regression analyses demonstrate that sequence capture is positively correlated with DNA concentration. However, sequence capture and DNA concentration are negatively correlated with sample age and preservation method. Our findings suggest that sequence capture projects should prioritize the use of alcohol-preserved samples younger than 20 years old when available. In the absence of such specimens, dried samples of any age can yield sequence data, albeit with returns that diminish with increasing age.
引用
收藏
页数:16
相关论文
共 128 条
[1]   A New Approach Using Targeted Sequence Capture for Phylogenomic Studies across Cactaceae [J].
Acha, Serena ;
Majure, Lucas C. .
GENES, 2022, 13 (02)
[2]   Deletion of ultraconserved elements yields viable mice [J].
Ahituv, Nadav ;
Zhu, Yiwen ;
Visel, Axel ;
Holt, Amy ;
Afzal, Veena ;
Pennacchio, Len A. ;
Rubin, Edward M. .
PLOS BIOLOGY, 2007, 5 (09) :1906-1911
[3]   RAD Capture (Rapture): Flexible and Efficient Sequence-Based Genotyping [J].
Ali, Omar A. ;
O'Rourke, Sean M. ;
Amish, Stephen J. ;
Meek, Mariah H. ;
Luikart, Gordon ;
Jeffres, Carson ;
Miller, Michael R. .
GENETICS, 2016, 202 (02) :389-+
[4]   Phylogenomics from Whole Genome Sequences Using aTRAM [J].
Allen, Julie M. ;
Boyd, Bret ;
Nam-Phuong Nguyen ;
Vachaspati, Pranjal ;
Warnow, Tandy ;
Huang, Daisie I. ;
Grady, Patrick G. S. ;
Bell, Kayce C. ;
Cronk, Quentin C. B. ;
Mugisha, Lawrence ;
Pittendrigh, Barry R. ;
Soledad Leonardi, M. ;
Reed, David L. ;
Johnson, Kevin P. .
SYSTEMATIC BIOLOGY, 2017, 66 (05) :786-798
[5]   Whole Genome Shotgun Phylogenomics Resolves the Pattern and Timing of Swallowtail Butterfly Evolution [J].
Allio, Rmi ;
Scornavacca, Cline ;
Nabholz, Benoit ;
Clamens, Anne-Laure ;
Sperling, Felix A. H. ;
Condamine, Fabien L. .
SYSTEMATIC BIOLOGY, 2020, 69 (01) :38-60
[6]   A Guide to Carrying Out a Phylogenomic Target Sequence Capture Project [J].
Andermann, Tobias ;
Jimenez, Maria Fernanda Torres ;
Matos-Maravi, Pavel ;
Batista, Romina ;
Blanco-Pastor, Jose L. ;
Gustafsson, A. Lovisa S. ;
Kistler, Logan ;
Liberal, Isabel M. ;
Oxelman, Bengt ;
Bacon, Christine D. ;
Antonelli, Alexandre .
FRONTIERS IN GENETICS, 2020, 10
[7]   SECAPR-a bioinformatics pipeline for the rapid and user-friendly processing of targeted enriched Illumina sequences, from raw reads to alignments [J].
Andermann, Tobias ;
Cano, Angela ;
Zizka, Alexander ;
Bacon, Christine ;
Antonelli, Alexandre .
PEERJ, 2018, 6
[8]   COMPARATIVE STUDIES OF AVIAN DNA BY RESTRICTION FRAGMENT LENGTH POLYMORPHISM ANALYSIS - CONVENIENT PROCEDURES ON BLOOD-SAMPLES FROM LIVE BIRDS [J].
ARCTANDER, P .
JOURNAL FUR ORNITHOLOGIE, 1988, 129 (02) :205-216
[9]   Whole-Genome Alignment and Comparative Annotation [J].
Armstrong, Joel ;
Fiddes, Ian T. ;
Diekhans, Mark ;
Paten, Benedict .
ANNUAL REVIEW OF ANIMAL BIOSCIENCES, VOL 7, 2019, 7 :41-64
[10]   The use of museum samples for large-scale sequence capture: a study of congeneric horseshoe bats (family Rhinolophidae) [J].
Bailey, Sebastian E. ;
Mao, Xiuguang ;
Struebig, Monika ;
Tsagkogeorga, Georgia ;
Csorba, Gabor ;
Heaney, Lawrence R. ;
Sedlock, Jodi ;
Stanley, William ;
Rouillard, Jean-Marie ;
Rossiter, Stephen J. .
BIOLOGICAL JOURNAL OF THE LINNEAN SOCIETY, 2016, 117 (01) :58-70