Detecting Phylogenetic Signals in Eukaryotic Whole Genome Sequences

被引:9
|
作者
Cohen, Eyal [1 ]
Chor, Benny [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
关键词
alignment-free sequence comparison; average common subsequence (ACS) method; reconstructing multicellular eukaryotic phylogeny; phylogenetic signal; whole genome phylogeny; MAXIMUM-LIKELIHOOD; TREE; DATABASE; MAMMALS; DISTANCES; ALIGNMENT;
D O I
10.1089/cmb.2012.0122
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Whole genome sequences are a rich source of molecular data, with a potential for the discovery of novel evolutionary information. Yet, many parts of these sequences are not known to be under evolutionary pressure and, thus, are not conserved. Furthermore, a good model for whole genome evolution does not exist. Consequently, it is not a priori clear if a meaningful phylogenetic signal exists and can be extracted from the sequences as a whole. Indeed, very few phylogenies were reconstructed based on these sequences. Prior to this work, only two reconstruction methods were applied to large eukaryotic genomes: the K-r method (Haubold et al., 2009), which was applied to genomes of rather small diversity (Drosophila species), and the feature frequency profile method (Sims et al., 2009a), which was applied to genomes of moderate diversity (mammals). We investigate the whole genome-based phylogenetic reconstruction question with respect to a much wider taxonomic sample. We apply K-r, FFP, and an alternative alignment-free method, the average common subsequence (ACS) (Ulitsky et al., 2006), to 24 multicellular eukaryotes (vertebrates, invertebrates, and plants). We also apply ACS to the proteome sequences of these 24 taxa. We compare the resulting trees to a standard reference, the National Center for Biotechnology Information (NCBI) taxonomy tree. Trees produced by ACS(AA), based on proteomes, are in complete agreement with the NCBI tree. For the genome-based reconstruction, ACS(DNA) produces trees whose agreement with the NCBI tree is excellent to very good for divergence times up to 800 million years ago, medium at 1 billion years ago, and poor at 1.6 billion years ago. We conclude that whole genomes do carry a clear phylogenetic signal, yet this signal "saturates" with longer divergence times. Furthermore, from the few existing methods, ACS is best capable of detecting this signal.
引用
收藏
页码:945 / 956
页数:12
相关论文
共 50 条
  • [41] The complete chloroplast genome sequences of six Hylotelephium species: Comparative genomic analysis and phylogenetic relationships
    An, Sung-Mo
    Kim, Bo-Yun
    Kang, Halam
    Lee, Ha-Rim
    Lee, Yoo-Bin
    Park, Yoo-Jung
    Cheon, Kyeong-Sik
    Kim, Kyung-Ah
    PLOS ONE, 2023, 18 (10):
  • [42] The complete chloroplast genome sequences of eight Orostachys species: Comparative analysis and assessment of phylogenetic relationships
    Lee, Ha-Rim
    Kim, Kyung-Ah
    Kim, Bo-Yun
    Park, Yoo-Jung
    Lee, Yoo-Bin
    Cheon, Kyeong-Sik
    PLOS ONE, 2022, 17 (11):
  • [43] Whole-Genome Sequences of Salmonella Isolates from an Ecological Wastewater Treatment System
    Connolly, Charles J.
    Kaminsky, Laura
    Pinto, Gabriella N.
    Sinclair, Priscilla C.
    Bajracharya, Gyasu
    Yan, Runan
    Nawrocki, Erin M.
    Dudley, Edward G.
    Kovac, Jasna
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2020, 9 (23):
  • [44] Global Evolutionary History and Dynamics of Dengue Viruses Inferred from Whole Genome Sequences
    Stica, Caleb J.
    Barrero, Roberto A.
    Murray, Rachael Z.
    Devine, Gregor J.
    Phillips, Matthew J.
    Frentiu, Francesca D.
    VIRUSES-BASEL, 2022, 14 (04):
  • [45] Deciphering the Wisent Demographic and Adaptive Histories from Individual Whole-Genome Sequences
    Gautier, Mathieu
    Moazami-Goudarzi, Katayoun
    Leveziel, Hubert
    Parinello, Hugues
    Grohs, Cecile
    Rialle, Stephanie
    Kowalczyk, Rafal
    Flori, Laurence
    MOLECULAR BIOLOGY AND EVOLUTION, 2016, 33 (11) : 2801 - 2814
  • [46] TreeCluster: Clustering biological sequences using phylogenetic trees
    Balaban, Metin
    Moshiri, Niema
    Mai, Uyen
    Jia, Xingfan
    Mirarab, Siavash
    PLOS ONE, 2019, 14 (08):
  • [47] Canine Parvovirus in Turkey: First Whole-Genome Sequences, Strain Distribution, and Prevalence
    Temizkan, Mehmet Cevat
    Temizkan, Secil Sevinc
    VIRUSES-BASEL, 2023, 15 (04):
  • [48] Molecular Evolution of Viruses of the Family Filoviridae Based on 97 Whole-Genome Sequences
    Carroll, Serena A.
    Towner, Jonathan S.
    Sealy, Tara K.
    McMullan, Laura K.
    Khristova, Marina L.
    Burt, Felicity J.
    Swanepoel, Robert
    Rollin, Pierre E.
    Nichol, Stuart T.
    JOURNAL OF VIROLOGY, 2013, 87 (05) : 2608 - 2616
  • [49] Conflicting phylogenetic signals in plastomes of the tribe Laureae (Lauraceae)
    Xiao, Tian-Wen
    Xu, Yong
    Jin, Lu
    Liu, Tong-Jian
    Yan, Hai-Fei
    Ge, Xue-Jun
    PEERJ, 2020, 8
  • [50] Hierarchical steepness and phylogenetic models: phylogenetic signals in Macaca
    Balasubramaniam, K. N.
    Dittmar, K.
    Berman, C. M.
    Butovskaya, M.
    Cooper, M. A.
    Majolo, B.
    Ogawa, H.
    Schino, G.
    Thierry, B.
    de Waal, F. B. M.
    ANIMAL BEHAVIOUR, 2012, 83 (05) : 1207 - 1218