Data, time and money: evaluating the best compromise for inferring molecular phylogenies of non-model animal taxa

被引:20
作者
Zaharias, Paul [1 ]
Pante, Eric [2 ]
Gey, Delphine [3 ]
Fedosov, Alexander E. [4 ]
Puillandre, Nicolas [1 ]
机构
[1] Univ Antilles, Sorbonne Univ, Inst Systemat Evolut Biodiversite ISYEB, Museum Natl Hist Nat,CNRS,EPHE, 43 Rue Cuvier,CP 26, F-75005 Paris, France
[2] La Rochelle Univ, UMR 7266 CNRS, Littoral Environm & Soc LIENSs, 2 Rue Olympe de Gouges, F-17042 La Rochelle, France
[3] Museum Natl Hist Nat, Acquisit & Anal Donnees Hist Nat 2AD UMS 2700, Paris, France
[4] Russian Acad Sci, AN Severtsov Inst Ecol & Evolut, Leninsky Prospect 33, Moscow 119071, Russia
基金
俄罗斯科学基金会;
关键词
Phylogenomics; Transcriptomics; High throughput sequencing; Sanger sequencing; Non-model taxa; Turridae; SEQUENCE CAPTURE; TREE; DNA; SYSTEMATICS; INTROGRESSION; GASTROPODA; BOOTSTRAP; ALIGNMENT; SIGNALS; MARKERS;
D O I
10.1016/j.ympev.2019.106660
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
For over a decade now, High Throughput sequencing (HTS) approaches have revolutionized phylogenetics, both in terms of data production and methodology. While transcriptomes and (reduced) genomes are increasingly used, generating and analyzing HTS datasets remain expensive, time consuming and complex for most nonmodel taxa. Indeed, a literature survey revealed that 74% of the molecular phylogenetics trees published in 2018 are based on data obtained through Sanger sequencing. In this context, our goal was to identify the strategy that would represent the best compromise among costs, time and robustness of the resulting tree. We sequenced and assembled 32 transcriptomes of the marine mollusk family Turridae, considered as a typical non-model animal taxon. From these data, we extracted the loci most commonly used in gastropod phylogenies (cox1, 12S, 16S, 28S, h3 and 18S), full mitogenomes, and a reduced nuclear transcriptome representation. With each dataset, we reconstructed phylogenies and compared their robustness and accuracy. We discuss the impact of missing data and the use of statistical tests, tree metrics, and supertree and supermatrix methods to further improve phylogenetic data acquisition pipelines. We evaluated the overall costs (time and money) in order to identify the best compromise for phylogenetic data sampling in non-model animal taxa. Although sequencing full mitogenomes seems to constitute the best compromise both in terms of costs and node support, they are known to induce biases in phylogenetic reconstructions. Rather, we recommend to systematically include loci commonly used for phylogenetics and taxonomy (i.e. DNA barcodes, rRNA genes, full mitogenomes, etc.) among the other loci when designing baits for capture.
引用
收藏
页数:11
相关论文
共 83 条
[1]   Delimiting species of marine gastropods (Turridae, Conoidea) using RAD sequencing in an integrative taxonomy framework [J].
Abdelkrim, Jawad ;
Aznar-Cormano, Laetitia ;
Buge, Barbara ;
Fedosov, Alexander ;
Kantor, Yuri ;
Zaharias, Paul ;
Puillandre, Nicolas .
MOLECULAR ECOLOGY, 2018, 27 (22) :4591-4611
[2]   Exon-Capture-Based Phylogeny and Diversification of the Venomous Gastropods (Neogastropoda, Conoidea) [J].
Abdelkrim, Jawad ;
Aznar-Cormano, Laetitia ;
Fedosov, Alexander E. ;
Kantor, Yuri I. ;
Lozouet, Pierre ;
Phuong, Mark A. ;
Zaharias, Paul ;
Puillandre, Nicolas .
MOLECULAR BIOLOGY AND EVOLUTION, 2018, 35 (10) :2355-2374
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]   Studying sources of incongruence in arthropod molecular phylogenies: Sea spiders (Pycnogonida) as a case study [J].
Arabi, Juliette ;
Cruaud, Corinne ;
Couloux, Arnaud ;
Hassanin, Alexandre .
COMPTES RENDUS BIOLOGIES, 2010, 333 (05) :438-453
[5]   Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers [J].
Baird, Nathan A. ;
Etter, Paul D. ;
Atwood, Tressa S. ;
Currey, Mark C. ;
Shiver, Anthony L. ;
Lewis, Zachary A. ;
Selker, Eric U. ;
Cresko, William A. ;
Johnson, Eric A. .
PLOS ONE, 2008, 3 (10)
[6]   A New Orthology Assessment Method for Phylogenomic Data: Unrooted Phylogenetic Orthology [J].
Ballesteros, Jesus A. ;
Hormiga, Gustavo .
MOLECULAR BIOLOGY AND EVOLUTION, 2016, 33 (08) :2117-2134
[7]   Disk covering methods improve phylogenomic analyses [J].
Bayzid, Md Shamsuzzoha ;
Hunt, Tyler ;
Warnow, Tandy .
BMC GENOMICS, 2014, 15
[8]   A review of long-branch attraction [J].
Bergsten, J .
CLADISTICS, 2005, 21 (02) :163-193
[9]  
BI K, 2012, BMC GENOMICS, V13
[10]  
BOGDANOWICZ D, 2012, EVOL BIOINFORM, V8