Phylogenomics from Whole Genome Sequences Using aTRAM

被引:61
|
作者
Allen, Julie M. [1 ]
Boyd, Bret [1 ,2 ]
Nam-Phuong Nguyen [3 ]
Vachaspati, Pranjal [4 ]
Warnow, Tandy [3 ,4 ,12 ]
Huang, Daisie I. [5 ]
Grady, Patrick G. S. [1 ]
Bell, Kayce C. [6 ,7 ]
Cronk, Quentin C. B. [5 ]
Mugisha, Lawrence [8 ,9 ]
Pittendrigh, Barry R. [10 ]
Soledad Leonardi, M. [11 ]
Reed, David L. [2 ]
Johnson, Kevin P. [1 ]
机构
[1] Univ Illinois, Illinois Nat Hist Survey, Urbana, IL 61801 USA
[2] Univ Florida, Florida Museum Nat Hist, Gainesville, FL 32611 USA
[3] Univ Illinois, Carl R Woese Inst Genom Biol, Urbana, IL 61801 USA
[4] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[5] Univ British Columbia, Biodivers Res Ctr, Vancouver, BC V6T 1Z4, Canada
[6] Univ New Mexico, Dept Biol, Albuquerque, NM 87131 USA
[7] Univ New Mexico, Museum Southwestern Biol, Albuquerque, NM 87131 USA
[8] CEHA, Kampala, Uganda
[9] Makerere Univ, Anim Resources & Biosecur COVAB, Coll Vet Med, Kampala, Uganda
[10] Michigan State Univ, Dept Entomol, E Lansing, MI 48823 USA
[11] Ctr Nacl Patagen, Inst Biol Organismos Marinos, Puerto Madryn, Argentina
[12] Univ Illinois, Dept Bioengn, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
aTRAM; gene assembly; genome sequencing; phylogenomics; ULTRACONSERVED ELEMENTS; READ ALIGNMENT; TREE; ENDOSYMBIONT; ENRICHMENT; THOUSANDS; NUCLEAR; RESOLVE; TAXA; AVES;
D O I
10.1093/sysbio/syw105
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Novel sequencing technologies are rapidly expanding the size of data sets that can be applied to phylogenetic studies. Currently the most commonly used phylogenomic approaches involve some form of genome reduction. While these approaches make assembling phylogenomic data sets more economical for organisms with large genomes, they reduce the genomic coverage and thereby the long-term utility of the data. Currently, for organisms with moderate to small genomes (< 1000 Mbp) it is feasible to sequence the entire genome at modest coverage (10-30x). Computational challenges for handling these large data sets can be alleviated by assembling targeted reads, rather than assembling the entire genome, to produce a phylogenomic data matrix. Here we demonstrate the use of automated Target Restricted Assembly Method (aTRAM) to assemble 1107 single-copy ortholog genes from whole genome sequencing of sucking lice ( Anoplura) and out-groups. We developed a pipeline to extract exon sequences from the aTRAM assemblies by annotating them with respect to the original target protein. We aligned these protein sequences with the inferred amino acids and then performed phylogenetic analyses on both the concatenated matrix of genes and on each gene separately in a coalescent analysis. Finally, we tested the limits of successful assembly in aTRAM by assembling 100 genes from close-to distantly related taxa at high to low levels of coverage. Both the concatenated analysis and the coalescent-based analysis produced the same tree topology, which was consistent with previously published results and resolved weakly supported nodes. These results demonstrate that this approach is successful at developing phylogenomic data sets from raw genome sequencing reads. Further, we found that with coverages above 5-10x, aTRAM was successful at assembling 80-90% of the contigs for both close and distantly related taxa. As sequencing costs continue to decline, we expect full genome sequencing will become more feasible for a wider array of organisms, and aTRAM will enable mining of these genomic data sets for an extensive variety of applications, including phylogenomics.
引用
收藏
页码:786 / 798
页数:13
相关论文
共 50 条
  • [1] Phylogenomics from low-coverage whole-genome sequencing
    Zhang, Feng
    Ding, Yinhuan
    Zhu, Chao-Dong
    Zhou, Xin
    Orr, Michael C.
    Scheu, Stefan
    Luan, Yun-Xia
    METHODS IN ECOLOGY AND EVOLUTION, 2019, 10 (04): : 507 - 517
  • [2] Phylogenomics using low-depth whole genome sequencing: A case study with the olive tribe
    Olofsson, Jill K.
    Cantera, Isabel
    Van de Paer, Celine
    Hong-Wa, Cynthia
    Zedane, Loubab
    Dunning, Luke T.
    Alberti, Adriana
    Christin, Pascal-Antoine
    Besnard, Guillaume
    MOLECULAR ECOLOGY RESOURCES, 2019, 19 (04) : 877 - 892
  • [3] A revised phylogeny of Boletaceae using whole genome sequences
    Tremble, Keaton
    Henkel, Terry
    Bradshaw, Alexander
    Domnauer, Colin
    Brown, Lyda M.
    Tham, Le Xuan
    Furci, Guliana
    Aime, M. Catherine
    Moncalvo, Jean-Marc
    Dentinger, Bryn
    MYCOLOGIA, 2024, 116 (03) : 392 - 408
  • [4] Whole Genome Shotgun Phylogenomics Resolves the Pattern and Timing of Swallowtail Butterfly Evolution
    Allio, Rmi
    Scornavacca, Cline
    Nabholz, Benoit
    Clamens, Anne-Laure
    Sperling, Felix A. H.
    Condamine, Fabien L.
    SYSTEMATIC BIOLOGY, 2020, 69 (01) : 38 - 60
  • [5] Beyond the whole genome consensus: Unravelling of PRRSV phylogenomics using next generation sequencing technologies
    Lu, Zen H.
    Archibald, Alan L.
    Ait-Ali, Tahar
    VIRUS RESEARCH, 2014, 194 : 167 - 174
  • [6] Molecular phylogenomics of the tribe Shoreeae (Dipterocarpaceae) using whole plastid genomes
    Heckenhauer, Jacqueline
    Paun, Ovidiu
    Chase, Mark W.
    Ashton, Peter S.
    Kamariah, A. S.
    Samuel, Rosabelle
    ANNALS OF BOTANY, 2019, 123 (05) : 857 - 865
  • [7] Unravelling hybridization in Phytophthora using phylogenomics and genome size estimation
    Van Poucke, Kris
    Haegeman, Annelies
    Goedefroit, Thomas
    Focquet, Fran
    Leus, Leen
    Jung, Marilia Horta
    Nave, Corina
    Redondo, Miguel Angel
    Husson, Claude
    Kostov, Kaloyan
    Lyubenova, Aneta
    Christova, Petya
    Chandelier, Anne
    Slavov, Slavcho
    de Cock, Arthur
    Bonants, Peter
    Werres, Sabine
    Palau, Jonas Oliva
    Marcais, Benoit
    Jung, Thomas
    Stenlid, Jan
    Ruttink, Tom
    Heungens, Kurt
    IMA FUNGUS, 2021, 12 (01)
  • [8] Detecting Phylogenetic Signals in Eukaryotic Whole Genome Sequences
    Cohen, Eyal
    Chor, Benny
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (08) : 945 - 956
  • [9] Putting the genome in insect phylogenomics
    Johnson, Kevin P.
    CURRENT OPINION IN INSECT SCIENCE, 2019, 36 : 111 - 117
  • [10] Developing insights into the mechanisms of evolution of bacterial pathogens from whole-genome sequences
    Bryant, Josephine
    Chewapreecha, Claire
    Bentley, Stephen D.
    FUTURE MICROBIOLOGY, 2012, 7 (11) : 1283 - 1296