An efficient and robust laboratory workflow and tetrapod database for larger scale environmental DNA studies

被引:32
作者
Axtner, Jan [1 ]
Crampton-Platt, Alex [1 ]
Hoerig, Lisa A. [1 ]
Mohamed, Azlan [1 ]
Xu, Charles C. Y. [2 ,3 ,4 ,5 ]
Yu, Douglas W. [2 ,6 ]
Wilting, Andreas [1 ]
机构
[1] Leibniz Inst Zoo & Wildlife Res, Dept Ecol Dynam, Alfred Kowalke Str 17, D-10315 Berlin, Germany
[2] Chinese Acad Sci, Kunming Inst Zool, State Key Lab Genet Resources & Evolut, 32 Jiaochang East Rd, Kunming 650223, Yunnan, Peoples R China
[3] Univ Groningen, Groningen Inst Evolutionary Life Sci, POB 11103, NL-9700 CC Groningen, Netherlands
[4] McGill Univ, Redpath Museum, 859 Sherbooke St West, Montreal, PQ H3A 2K6, Canada
[5] McGill Univ, Dept Biol, 859 Sherbooke St West, Montreal, PQ H3A 2K6, Canada
[6] Univ East Anglia, Sch Biol Sci, Norwich Res Pk, Norwich NR4 7TJ, Norfolk, England
基金
中国国家自然科学基金;
关键词
metabarcoding; invertebrate-derived; -DNA; environmental DNA; leeches; TAXONOMIC CLASSIFICATION; FALSE POSITIVES; IDENTIFICATION; PERFORMANCE; MOSQUITOS; LEECHES; SEARCH; ERRORS; MAFFT; LIMIT;
D O I
10.1093/gigascience/giz029
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The use of environmental DNA for species detection via metabarcoding is growing rapidly. We present a co-designed lab workflow and bioinformatic pipeline to mitigate the 2 most important risks of environmental DNA use: sample contamination and taxonomic misassignment. These risks arise from the need for polymerase chain reaction (PCR) amplification to detect the trace amounts of DNA combined with the necessity of using short target regions due to DNA degradation. Findings: Our high-throughput workflow minimizes these risks via a 4-step strategy: (i) technical replication with 2 PCR replicates and 2 extraction replicates; (ii) using multi-markers (12S, 16S, CytB); (iii) a "twin-tagging," 2-step PCR protocol; and (iv) use of the probabilistic taxonomic assignment method PROTAX, which can account for incomplete reference databases. Because annotation errors in the reference sequences can result in taxonomic misassignment, we supply a protocol for curating sequence datasets. For some taxonomic groups and some markers, curation resulted in >50% of sequences being deleted from public reference databases, owing to (i) limited overlap between our target amplicon and reference sequences, (ii) mislabelling of reference sequences, and (iii) redundancy. Finally, we provide a bioinformatic pipeline to process amplicons and conduct PROTAX assignment and tested it on an invertebrate-derived DNA dataset from 1,532 leeches from Sabah, Malaysia. Twin-tagging allowed us to detect and exclude sequences with non-matching tags. The smallest DNA fragment (16S) amplified most frequently for all samples but was less powerful for discriminating at species rank. Using a stringent and lax acceptance criterion we found 162 (stringent) and 190 (lax) vertebrate detections of 95 (stringent) and 109 (lax) leech samples. Conclusions: Our metabarcoding workflow should help research groups increase the robustness of their results and therefore facilitate wider use of environmental and invertebrate-derived DNA, which is turning into a valuable source of ecological and conservation information on tetrapods.
引用
收藏
页数:17
相关论文
共 65 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   A comparative evaluation of sequence classification programs [J].
Bazinet, Adam L. ;
Cummings, Michael P. .
BMC BIOINFORMATICS, 2012, 13
[3]   Connecting Earth observation to high-throughput biodiversity data [J].
Bush, Alex ;
Sollmann, Rahel ;
Wilting, Andreas ;
Bohmann, Kristine ;
Cole, Beth ;
Balzter, Heiko ;
Martius, Christopher ;
Zlinszky, Andras ;
Calvignac-Spencer, Sebastien ;
Cobbold, Christina A. ;
Dawson, Terence P. ;
Emerson, Brent C. ;
Ferrier, Simon ;
Gilbert, M. Thomas P. ;
Herold, Martin ;
Jones, Laurence ;
Leendertz, Fabian H. ;
Matthews, Louise ;
Millington, James D. A. ;
Olson, John R. ;
Ovaskainen, Otso ;
Raffaelli, Dave ;
Reeve, Richard ;
Roedel, Mark-Oliver ;
Rodgers, Torrey W. ;
Snape, Stewart ;
Visseren-Hamakers, Ingrid ;
Vogler, Alfried P. ;
White, Piran C. L. ;
Wooster, Martin J. ;
Yu, Douglas W. .
NATURE ECOLOGY & EVOLUTION, 2017, 1 (07)
[4]   An invertebrate stomach's view on vertebrate ecology Certain invertebrates could be used as "vertebrate samplers" and deliver DNA-based information on many aspects of vertebrate ecology [J].
Calvignac-Spencer, Sebastien ;
Leendertz, Fabian H. ;
Gilbert, M. Thomas P. ;
Schubert, Grit .
BIOESSAYS, 2013, 35 (11) :1004-1013
[5]   Carrion fly-derived DNA as a tool for comprehensive and cost-effective assessment of mammalian biodiversity [J].
Calvignac-Spencer, Sebastien ;
Merkel, Kevin ;
Kutzner, Nadine ;
Kuehl, Hjalmar ;
Boesch, Christophe ;
Kappeler, Peter M. ;
Metzger, Sonja ;
Schubert, Grit ;
Leendertz, Fabian H. .
MOLECULAR ECOLOGY, 2013, 22 (04) :915-924
[6]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10
[7]  
Chamberlain Scott A, 2013, F1000Res, V2, P191, DOI 10.12688/f1000research.2-191.v1
[8]   Uses and Misuses of Environmental DNA in Biodiversity Science and Conservation [J].
Cristescu, Melania E. ;
Hebert, Paul D. N. .
ANNUAL REVIEW OF ECOLOGY, EVOLUTION, AND SYSTEMATICS, VOL 49, 2018, 49 :209-230
[9]   Still looking for that woodpecker [J].
Dalton, Rex .
NATURE, 2010, 463 (7282) :718-719
[10]   ednaoccupancy: An r package for multiscale occupancy modelling of environmental DNA data [J].
Dorazio, Robert M. ;
Erickson, Richard A. .
MOLECULAR ECOLOGY RESOURCES, 2018, 18 (02) :368-380