Evaluating Rare Amino Acid Substitutions (RGC_CAMs) in a Yeast Model Clade

被引:2
|
作者
Polzin, Kenneth [1 ]
Rokas, Antonis [1 ]
机构
[1] Vanderbilt Univ, Dept Biol Sci, Nashville, TN 37235 USA
来源
PLOS ONE | 2014年 / 9卷 / 03期
基金
美国国家科学基金会;
关键词
GENOME-WIDE ANALYSIS; REPLACEMENTS; PHYLOGENETICS; COELOMATA; SEQUENCES; HOMOPLASY; EVOLUTION; ANIMALS; GENUS;
D O I
10.1371/journal.pone.0092213
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
When inferring phylogenetic relationships, not all sites in a sequence alignment are equally informative. One recently proposed approach that takes advantage of this inequality relies on sites that contain amino acids whose replacement requires multiple substitutions. Identifying these so-called RGC_CAM substitutions (after Rare Genomic Changes as Conserved Amino acids-Multiple substitutions) requires that, first, at any given site in the amino acid sequence alignment, there must be a minimum of two different amino acids; second, each amino acid must be present in at least two taxa; and third, the amino acids must require a minimum of two nucleotide substitutions to replace each other. Although theory suggests that RGC_CAM substitutions are expected to be rare and less likely to be homoplastic, the informativeness of RGC_CAM substitutions has not been extensively evaluated in biological data sets. We investigated the quality of RGC_CAM substitutions by examining their degree of homoplasy and internode certainty in nearly 2.7 million aligned amino acid sites from 5,261 proteins from five species belonging to the yeast Saccharomyces sensu stricto clade whose phylogeny is well-established. We identified 2,647 sites containing RGC_CAM substitutions, a number that contrasts sharply with the 100,887 sites containing RGC_non-CAM substitutions (i.e., changes between amino acids that require only a single nucleotide substitution). We found that RGC_CAM substitutions had significantly lower homoplasy than RGC_non-CAM ones; specifically RGC_CAM substitutions showed a per-site average homoplasy index of 0.100, whereas RGC_non-CAM substitutions had a homoplasy index of 0.215. Internode certainty values were also higher for sites containing RGC_CAM substitutions than for RGC_non-CAM ones. These results suggest that RGC_ CAM substitutions possess a strong phylogenetic signal and are useful markers for phylogenetic inference despite their rarity.
引用
收藏
页数:6
相关论文
共 4 条
  • [1] Evaluating Ortholog Prediction Algorithms in a Yeast Model Clade
    Salichos, Leonidas
    Rokas, Antonis
    PLOS ONE, 2011, 6 (04):
  • [2] Analysis of rare amino acid replacements supports the coelomata clade
    Rogozin, Igor B.
    Wolf, Yuri I.
    Carmel, Liran
    Koonin, Eugene V.
    MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (12) : 2594 - 2597
  • [3] Ecdysozoan clade rejected by genome-wide analysis of rare amino acid replacements
    Rogozin, Igor B.
    Wolf, Yuri I.
    Carmel, Liran
    Koonin, Eugene V.
    MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (04) : 1080 - 1090
  • [4] Pharmacodynamic Evaluation of Zoliflodacin Treatment of Neisseria gonorrhoeae Strains With Amino Acid Substitutions in the Zoliflodacin Target GyrB Using a Dynamic Hollow Fiber Infection Model
    Jacobsson, Susanne
    Golparian, Daniel
    Oxelbark, Joakim
    Franceschi, Francois
    Brown, David
    Louie, Arnold
    Drusano, George
    Unemo, Magnus
    FRONTIERS IN PHARMACOLOGY, 2022, 13