Maximum-likelihood approach for gene family evolution under functional divergence

被引:225
作者
Gu, X [1 ]
机构
[1] Iowa State Univ, Ctr Bioinformat & Biol Stat, Dept Zool Genet, Ames, IA 50011 USA
关键词
gene family evolution; functional divergence; gene (genome) duplication; maximum likelihood; functional prediction;
D O I
10.1093/oxfordjournals.molbev.a003824
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
According to the observed alignment pattern (i.e., amino acid configuration), we studied two basic types of functional divergence of a protein family. Type I functional divergence after gene duplication results in altered functional constraints (i.e., different evolutionary rate) between duplicate genes, whereas type II results in no altered functional constraints but radical change in amino acid property between them (e.g., charge, hydrophobicity, etc.). Two statistical approaches, i.e., the subtree likelihood and the whole-tree likelihood, were developed for estimating the coefficients of (type I or type II) functional divergence. Numerical algorithms for obtaining maximum-likelihood estimates are also provided. Moreover. a posterior-based site-specific profile is implemented to predict critical amino acid residues that are responsible for type I and/or type II functional divergence after gene duplication. We compared the current likelihood with a fast method developed previously by examples; both show similar results. For handling altered functional constraints (type I functional divergence) in the large gene family with many member genes (clusters), which appears to be a normal case in postgenomics, the subtree likelihood provides a solution that is computationally feasible and robust against the uncertainty of the phylogeny. The cost of this feasibility is the approximation when frequencies of amino acids are very skewed. The potential bias and correction are discussed.
引用
收藏
页码:453 / 464
页数:12
相关论文
共 35 条
[1]   Predicting functions from protein sequences - where are the bottlenecks? [J].
Bork, P ;
Koonin, EV .
NATURE GENETICS, 1998, 18 (04) :313-318
[2]   Modeling residue usage in aligned protein sequences via maximum likelihood [J].
Bruno, WJ .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (10) :1368-1374
[3]   A METHOD TO PREDICT FUNCTIONAL RESIDUES IN PROTEINS [J].
CASARI, G ;
SANDER, C ;
VALENCIA, A .
NATURE STRUCTURAL BIOLOGY, 1995, 2 (02) :171-178
[4]  
DAYHOFF MO, 1978, ATLAS PROTEIN SEQ S3, V5, P342
[5]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[6]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[7]   The structural basis of molecular adaptation [J].
Golding, GB ;
Dean, AM .
MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (04) :355-369
[8]   Statistical methods for testing functional divergence after gene duplication [J].
Gu, X .
MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (12) :1664-1674
[9]  
GU X, 1995, MOL BIOL EVOL, V12, P546
[10]   A simple method for estimating the parameter of substitution rate variation among sites [J].
Gu, X ;
Zhang, JZ .
MOLECULAR BIOLOGY AND EVOLUTION, 1997, 14 (11) :1106-1113