Gene loss rate: a probabilistic measure for the conservation of eukaryotic genes

被引:24
作者
Borenstein, Elhanan [1 ]
Shlomi, Tomer
Ruppin, Eytan
Sharan, Roded
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
[2] Tel Aviv Univ, Sch Med, IL-69978 Tel Aviv, Israel
基金
以色列科学基金会;
关键词
D O I
10.1093/nar/gkl792
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The rate of conservation of a gene in evolution is believed to be correlated with its biological importance. Recent studies have devised various conservation measures for genes and have shown that they are correlated with several biological characteristics of functional importance. Specifically, the state-of-the-art propensity for gene loss (PGL) measure was shown to be strongly correlated with gene essentiality and its number of protein-protein interactions (PPIs). The observed correlation between conservation and functional importance varies however between conservation measures, underscoring the need for accurate and general measures for the rate of gene conservation. Here we develop a novel maximum-likelihood approach to computing the rate in which a gene is lost in evolution, motivated by the same principles as those underlying PGL. However, in difference to PGL which considers only the most parsimonious ancestral states of the internal nodes of the phylogenetic tree relating the species, our approach weighs in a probabilistic manner all possible ancestral states, and includes the branch length information as part of the probabilistic model. In application to data of 16 eukaryotic genomes, our approach shows higher correlations with experimental data than PGL, including data on gene lethality, level of connectivity in a PPI network and coherence within functionally related genes.
引用
收藏
页数:8
相关论文
共 34 条
  • [1] [Anonymous], 2003, BMC EVOL BIOL
  • [2] Comparative genomics and disorder prediction identify biologically relevant SH3 protein interactions
    Beltrao, P
    Serrano, L
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2005, 1 (03) : 202 - 211
  • [3] CSUROS M, 2006, 10 ANN INT C RES COM, P206
  • [4] Deng Minghua, 2003, Pac Symp Biocomput, P140
  • [5] DUDLEY A, 2005, MOL SYST BIOL
  • [6] Durbin R., 1998, Biological sequence analysis: Probabilistic models of proteins and nucleic acids
  • [7] PHYLOGENETIC ANALYSIS UNDER DOLLOS LAW
    FARRIS, JS
    [J]. SYSTEMATIC ZOOLOGY, 1977, 26 (01): : 77 - 88
  • [8] EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH
    FELSENSTEIN, J
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) : 368 - 376
  • [9] Evolutionary rate in the protein interaction network
    Fraser, HB
    Hirsh, AE
    Steinmetz, LM
    Scharfe, C
    Feldman, MW
    [J]. SCIENCE, 2002, 296 (5568) : 750 - 752
  • [10] An insect molecular clock dates the origin of the insects and accords with palaeontological and biogeographic landmarks
    Gaunt, MW
    Miles, MA
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (05) : 748 - 761