Accuracy and power of Bayes prediction of amino acid sites under positive selection

被引:303
作者
Anisimova, M
Bielawski, JP
Yang, ZH
机构
[1] UCL, Dept Biol, Galton Lab, London WC1E 6BT, England
[2] UCL, Ctr Math & Phys Life Sci & Expt Biol, London WC1E 6BT, England
关键词
Bayes inference; likelihood; nonsynonymous-synonymous rate ratio; positive selection; posterior probability;
D O I
10.1093/oxfordjournals.molbev.a004152
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Bayes prediction quantifies uncertainty by assigning posterior probabilities. It Was used to identify amino acids in a protein under recurrent diversifying selection indicated by higher nonsynonymous, (d(N)) than synonymous (d(S)) substitution rates or by omega = d(N)/d(S) > 1. Parameters were estimated by maximum likelihood under a codon substitution model that assumed several classes of sites with different w ratios. The Bayes theorem was used to calculate the posterior probabilities of each site falling into these site classes. Here. we evaluate the performance of Bayes prediction of amino acids under positive selection by computer simulation. We measured the accuracy by the proportion of predicted sites that were truly under selection and the power by the proportion of true positively selected sites that were predicted by the method. The accuracy was slightly better for longer sequences, whereas the power was largely unaffected by the increase in sequence length. Both accuracy and power were higher for medium or highly diverged sequences than for similar sequences. We found that accuracy and power were unacceptably low when data contained only a few highly similar sequences. However, sampling a large number of lineage improved the performance substantially. Even for very similar sequences. accuracy and Power can he high if over 100 taxa are used in the analysis. We make the following recommendations: (1) prediction of positive selection sites is not feasible for a few closely related sequences: (2) using it large number of lineages is the best way to improve the accuracy and power of the prediction: and (3) multiple models of heterogeneous selective pressures among sites should he applied in real data analysis.
引用
收藏
页码:950 / 958
页数:9
相关论文
共 28 条
  • [1] MOLECULAR RESURRECTION OF AN EXTINCT ANCESTRAL PROMOTER FOR MOUSE L1
    ADEY, NB
    TOLLEFSBOL, TO
    SPARKS, AB
    EDGELL, MH
    HUTCHISON, CA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (04) : 1569 - 1573
  • [2] Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution
    Anisimova, M
    Bielawski, JP
    Yang, ZH
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (08) : 1585 - 1592
  • [3] Rapid evolution in plant chitinases: Molecular targets of selection in plant-pathogen coevolution
    Bishop, JG
    Dean, AM
    Mitchell-Olds, T
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (10) : 5322 - 5327
  • [4] Positive selection on the H3 hemagglutinin gene of human influenza virus A
    Bush, RM
    Fitch, WM
    Bender, CA
    Cox, NJ
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (11) : 1457 - 1465
  • [5] Angiotensin II-forming activity in a reconstructed ancestral chymase
    Chandrasekharan, UM
    Sanker, S
    Glynias, MJ
    Karnik, SS
    Husain, A
    [J]. SCIENCE, 1996, 271 (5248) : 502 - 505
  • [6] Chang BSW, 2002, METHOD ENZYMOL, V343, P274
  • [7] Protein engineering reveals ancient adaptive replacements in isocitrate dehydrogenase
    Dean, AM
    Golding, GB
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (07) : 3104 - 3109
  • [8] Large-scale search for genes on which positive selection may operate
    Endo, T
    Ikeo, K
    Gojobori, T
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (05) : 685 - 690
  • [9] Evidence for positive selection in the capsid protein-coding region of the foot-and-mouth disease virus (FMDV) subjected to experimental passage regimens
    Fares, MA
    Moya, A
    Escarmís, C
    Baranowski, E
    Domingo, E
    Barrio, E
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (01) : 10 - 21
  • [10] Long term trends in the evolution of H(3) HA1 human influenza type A
    Fitch, WM
    Bush, RM
    Bender, CA
    Cox, NJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (15) : 7712 - 7718