Phylogenomics from Whole Genome Sequences Using aTRAM

被引:61
|
作者
Allen, Julie M. [1 ]
Boyd, Bret [1 ,2 ]
Nam-Phuong Nguyen [3 ]
Vachaspati, Pranjal [4 ]
Warnow, Tandy [3 ,4 ,12 ]
Huang, Daisie I. [5 ]
Grady, Patrick G. S. [1 ]
Bell, Kayce C. [6 ,7 ]
Cronk, Quentin C. B. [5 ]
Mugisha, Lawrence [8 ,9 ]
Pittendrigh, Barry R. [10 ]
Soledad Leonardi, M. [11 ]
Reed, David L. [2 ]
Johnson, Kevin P. [1 ]
机构
[1] Univ Illinois, Illinois Nat Hist Survey, Urbana, IL 61801 USA
[2] Univ Florida, Florida Museum Nat Hist, Gainesville, FL 32611 USA
[3] Univ Illinois, Carl R Woese Inst Genom Biol, Urbana, IL 61801 USA
[4] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[5] Univ British Columbia, Biodivers Res Ctr, Vancouver, BC V6T 1Z4, Canada
[6] Univ New Mexico, Dept Biol, Albuquerque, NM 87131 USA
[7] Univ New Mexico, Museum Southwestern Biol, Albuquerque, NM 87131 USA
[8] CEHA, Kampala, Uganda
[9] Makerere Univ, Anim Resources & Biosecur COVAB, Coll Vet Med, Kampala, Uganda
[10] Michigan State Univ, Dept Entomol, E Lansing, MI 48823 USA
[11] Ctr Nacl Patagen, Inst Biol Organismos Marinos, Puerto Madryn, Argentina
[12] Univ Illinois, Dept Bioengn, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
aTRAM; gene assembly; genome sequencing; phylogenomics; ULTRACONSERVED ELEMENTS; READ ALIGNMENT; TREE; ENDOSYMBIONT; ENRICHMENT; THOUSANDS; NUCLEAR; RESOLVE; TAXA; AVES;
D O I
10.1093/sysbio/syw105
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Novel sequencing technologies are rapidly expanding the size of data sets that can be applied to phylogenetic studies. Currently the most commonly used phylogenomic approaches involve some form of genome reduction. While these approaches make assembling phylogenomic data sets more economical for organisms with large genomes, they reduce the genomic coverage and thereby the long-term utility of the data. Currently, for organisms with moderate to small genomes (< 1000 Mbp) it is feasible to sequence the entire genome at modest coverage (10-30x). Computational challenges for handling these large data sets can be alleviated by assembling targeted reads, rather than assembling the entire genome, to produce a phylogenomic data matrix. Here we demonstrate the use of automated Target Restricted Assembly Method (aTRAM) to assemble 1107 single-copy ortholog genes from whole genome sequencing of sucking lice ( Anoplura) and out-groups. We developed a pipeline to extract exon sequences from the aTRAM assemblies by annotating them with respect to the original target protein. We aligned these protein sequences with the inferred amino acids and then performed phylogenetic analyses on both the concatenated matrix of genes and on each gene separately in a coalescent analysis. Finally, we tested the limits of successful assembly in aTRAM by assembling 100 genes from close-to distantly related taxa at high to low levels of coverage. Both the concatenated analysis and the coalescent-based analysis produced the same tree topology, which was consistent with previously published results and resolved weakly supported nodes. These results demonstrate that this approach is successful at developing phylogenomic data sets from raw genome sequencing reads. Further, we found that with coverages above 5-10x, aTRAM was successful at assembling 80-90% of the contigs for both close and distantly related taxa. As sequencing costs continue to decline, we expect full genome sequencing will become more feasible for a wider array of organisms, and aTRAM will enable mining of these genomic data sets for an extensive variety of applications, including phylogenomics.
引用
收藏
页码:786 / 798
页数:13
相关论文
共 50 条
  • [31] Phylogenomics and historical biogeography of the cleptoparasitic bee genus Nomada (Hymenoptera: Apidae) using ultraconserved elements
    Odanaka, Katherine A.
    Branstetter, Michael G.
    Tobin, Kerrigan B.
    Rehan, Sandra M.
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 2022, 170
  • [32] Phylogenomics and mitochondrial genome evolution of the gall-associated doryctine wasp genera (Hymenoptera: Braconidae)
    Samaca-Saenz, Ernesto
    Meza-Lazaro, Rubi N.
    Branstetter, Michael G.
    Zaldivar-Riveron, Alejandro
    SYSTEMATICS AND BIODIVERSITY, 2019, 17 (08) : 731 - 744
  • [33] The draft mitochondrial genome of Magnolia biondii and mitochondrial phylogenomics of angiosperms
    Dong, Shanshan
    Chen, Lu
    Liu, Yang
    Wang, Yaling
    Zhang, Suzhou
    Yang, Leilei
    Lang, Xiaoan
    Zhang, Shouzhou
    PLOS ONE, 2020, 15 (04):
  • [34] Genome Evolution and the Future of Phylogenomics of Non-Avian Reptiles
    Card, Daren C.
    Jennings, W. Bryan
    Edwards, Scott V.
    ANIMALS, 2023, 13 (03):
  • [35] Champagne: Automated Whole-Genome Phylogenomic Character Matrix Method Using Large Genomic Indels for Homoplasy-Free Inference
    Schull, James K.
    Turakhia, Yatish
    Hemker, James A.
    Dally, William J.
    Bejerano, Gill
    GENOME BIOLOGY AND EVOLUTION, 2022, 14 (03):
  • [36] A comprehensive biomedical variant catalogue based on whole genome sequences of 582 dogs and eight wolves
    Jagannathan, V
    Droegemueller, C.
    Leeb, T.
    Aguirre, Gustavo
    Andre, Catherine
    Bannasch, Danika
    Becker, Doreen
    Davis, Brian
    Drogemuller, Cord
    Ekenstedt, Kari
    Faller, Kiterie
    Forman, Oliver
    Friedenberg, Steve
    Furrow, Eva
    Giger, Urs
    Hitte, Christophe
    Hytonen, Marjo
    Lohi, Hannes
    Mellersh, Cathryn
    Mickelson, James R.
    Murgiano, Leonardo
    Oberbauer, Anita
    Schmutz, Sheila
    Schoenebeck, Jeffrey
    Summers, Kim
    van Steenbeek, Frank
    Wade, Claire
    ANIMAL GENETICS, 2019, 50 (06) : 695 - 704
  • [37] Whole genome sequences and annotation of Micrococcus luteus SUBG006, a novel phytopathogen of mango
    Rakhashiya, Purvi M.
    Patel, Pooja P.
    Thaker, Vrinda S.
    GENOMICS DATA, 2015, 6 : 10 - +
  • [38] Distinguishing mitochondrial DNA and NUMT sequences amplified with the precision ID mtDNA whole genome panel
    Cihlar, Jennifer Churchill
    Strobl, Christina
    Lagace, Robert
    Muenzler, Melissa
    Parson, Walther
    Budowle, Bruce
    MITOCHONDRION, 2020, 55 : 122 - 133
  • [39] Phylogenomics using Target-Restricted Assembly Resolves Intrageneric Relationships of Parasitic Lice (Phthiraptera: Columbicola)
    Boyd, Bret M.
    Allen, Julie M.
    Nam-Phuong Nguyen
    Sweet, Andrew D.
    Warnow, Tandy
    Shapiro, Michael D.
    Villa, Scott M.
    Bush, Sarah E.
    Clayton, Dale H.
    Johnson, Kevin P.
    SYSTEMATIC BIOLOGY, 2017, 66 (06) : 896 - 911
  • [40] Whole genome sequences of a free-living Pseudomonas sp strain ML96 isolated from a freshwater Maar Lake
    Li, Xiuling
    Blom, Jochen
    Zeng, Yonghui
    MARINE GENOMICS, 2015, 24 : 219 - 221