Detecting Phylogenetic Signals in Eukaryotic Whole Genome Sequences

被引:9
|
作者
Cohen, Eyal [1 ]
Chor, Benny [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
关键词
alignment-free sequence comparison; average common subsequence (ACS) method; reconstructing multicellular eukaryotic phylogeny; phylogenetic signal; whole genome phylogeny; MAXIMUM-LIKELIHOOD; TREE; DATABASE; MAMMALS; DISTANCES; ALIGNMENT;
D O I
10.1089/cmb.2012.0122
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Whole genome sequences are a rich source of molecular data, with a potential for the discovery of novel evolutionary information. Yet, many parts of these sequences are not known to be under evolutionary pressure and, thus, are not conserved. Furthermore, a good model for whole genome evolution does not exist. Consequently, it is not a priori clear if a meaningful phylogenetic signal exists and can be extracted from the sequences as a whole. Indeed, very few phylogenies were reconstructed based on these sequences. Prior to this work, only two reconstruction methods were applied to large eukaryotic genomes: the K-r method (Haubold et al., 2009), which was applied to genomes of rather small diversity (Drosophila species), and the feature frequency profile method (Sims et al., 2009a), which was applied to genomes of moderate diversity (mammals). We investigate the whole genome-based phylogenetic reconstruction question with respect to a much wider taxonomic sample. We apply K-r, FFP, and an alternative alignment-free method, the average common subsequence (ACS) (Ulitsky et al., 2006), to 24 multicellular eukaryotes (vertebrates, invertebrates, and plants). We also apply ACS to the proteome sequences of these 24 taxa. We compare the resulting trees to a standard reference, the National Center for Biotechnology Information (NCBI) taxonomy tree. Trees produced by ACS(AA), based on proteomes, are in complete agreement with the NCBI tree. For the genome-based reconstruction, ACS(DNA) produces trees whose agreement with the NCBI tree is excellent to very good for divergence times up to 800 million years ago, medium at 1 billion years ago, and poor at 1.6 billion years ago. We conclude that whole genomes do carry a clear phylogenetic signal, yet this signal "saturates" with longer divergence times. Furthermore, from the few existing methods, ACS is best capable of detecting this signal.
引用
收藏
页码:945 / 956
页数:12
相关论文
共 50 条
  • [21] The Complete Chloroplast Genome Sequences of 14CurcumaSpecies: Insights Into Genome Evolution and Phylogenetic Relationships Within Zingiberales
    Liang, Heng
    Zhang, Yan
    Deng, Jiabin
    Gao, Gang
    Ding, Chunbang
    Zhang, Li
    Yang, Ruiwu
    FRONTIERS IN GENETICS, 2020, 11
  • [22] Phylogenetic analysis of the eukaryotic RNA (cytosine-5)-methyltransferases
    Pavlopoulou, Athanasia
    Kossida, Sophia
    GENOMICS, 2009, 93 (04) : 350 - 357
  • [23] Comparative phylogenetic analyses of Chinese Horsfieldia (Myristicaceae) using complete chloroplast genome sequences
    Cai, Chao-Nan
    Ma, Hui
    Ci, Xiu-Qin
    Conran, John G.
    Li, Jie
    JOURNAL OF SYSTEMATICS AND EVOLUTION, 2021, 59 (03) : 504 - 514
  • [24] The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses
    Zhang, Yanjun
    Du, Liuwen
    Liu, Ao
    Chen, Jianjun
    Wu, Li
    Hu, Weiming
    Zhang, Wei
    Kim, Kyunghee
    Lee, Sang-Choon
    Yang, Tae-Jin
    Wang, Ying
    FRONTIERS IN PLANT SCIENCE, 2016, 7
  • [25] Extracting phylogenetic signal and accounting for bias in whole-genome data sets supports the Ctenophora as sister to remaining Metazoa
    Borowiec, Marek L.
    Lee, Ernest K.
    Chiu, Joanna C.
    Plachetzki, David C.
    BMC GENOMICS, 2015, 16
  • [26] AN EVALUATION OF THE HYBRID SPECIATION HYPOTHESIS FOR XIPHOPHORUS CLEMENCIAE BASED ON WHOLE GENOME SEQUENCES
    Schumer, Molly
    Cui, Rongfeng
    Boussau, Bastien
    Walter, Ronald
    Rosenthal, Gil
    Andolfatto, Peter
    EVOLUTION, 2013, 67 (04) : 1155 - 1168
  • [27] Discrete Wavelet Packet Transform Based Discriminant Analysis for Whole Genome Sequences
    Huang, Hsin-Hsiung
    Girimurugan, Senthil Balaji
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2019, 18 (02)
  • [28] Monophyly of clade III nematodes is not supported by phylogenetic analysis of complete mitochondrial genome sequences
    Park, Joong-Ki
    Sultana, Tahera
    Lee, Sang-Hwa
    Kang, Seokha
    Kim, Hyong Kyu
    Min, Gi-Sik
    Eom, Keeseon S.
    Nadler, Steven A.
    BMC GENOMICS, 2011, 12
  • [29] Detecting taxonomic and phylogenetic signals in equid cheek teeth: towards new palaeontological and archaeological proxies
    Cucchi, T.
    Mohaseb, A.
    Peigne, S.
    Debue, K.
    Orlando, L.
    Mashkour, M.
    ROYAL SOCIETY OPEN SCIENCE, 2017, 4 (04):
  • [30] Phylogenomic Resolution of the Phylogeny of Laurasiatherian Mammals: Exploring Phylogenetic Signals within Coding and Noncoding Sequences
    Chen, Meng-Yun
    Liang, Dan
    Zhang, Peng
    GENOME BIOLOGY AND EVOLUTION, 2017, 9 (08): : 1998 - 2012