Patterns of gene duplication in Saccharomyces cerevisiae and Caenorhabditis elegans

被引:25
作者
Cavalcanti, ARO
Ferreira, R
Gu, ZL
Li, WH
机构
[1] Univ Chicago, Dept Ecol & Evolut, Chicago, IL 60637 USA
[2] Univ Fed Pernambuco, Dept Quim Fundamental, Recife, Brazil
关键词
gene duplication; block duplication; protein families; database cleaning; whole-genome duplication;
D O I
10.1007/s00239-002-2377-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this paper we present a new method for detecting block duplications in a genome. It is more stringent than previous ones in that it requires a more rigorous definition of paralogous genes and that it requires the paralogous proteins on the two blocks to be contiguous. In addition, it provides three criterion choices: (1) the same composition (i.e., having the same paralogues in the two windows), (2) the same composition and gene order, and (3) the same composition, gene order, and gene orientation. The method is completely automated, requiring no visual inspection as in previous methods. We applied it to analyze the complete genomes of S. cerevisiae and C. elegans. In yeast we detected fewer duplicated blocks than previously reported. In C. elegans, however, we detected more block duplications than previously reported, indicating that although our method has a more stringent definition of block duplication than previous ones, it may be more sensitive in detection because it considers every possible window rather than only fixed nonoverlapping windows. Our results show that block duplication is a common phenomenon in both organisms. The patterns of block duplication in the two species are, however, markedly different. The yeast shows much more extensive block duplication than the nematode, with some chromosomes having more than 40% of the duplications derived from block duplications. Moreover, in the yeast the majority of block duplications occurred between chromosomes, while in the nematode most block duplications occurred within chromosomes.
引用
收藏
页码:28 / 37
页数:10
相关论文
共 23 条
[1]  
Doolittle R.F., 1986, Of Urfs and Orfs: A Primer on How to Analyze Derived Amino Acid Sequences
[2]   THE MULTIPLICITY OF DOMAINS IN PROTEINS [J].
DOOLITTLE, RF .
ANNUAL REVIEW OF BIOCHEMISTRY, 1995, 64 :287-314
[3]   Gene duplication and the structure of eukaryotic genomes [J].
Friedman, R ;
Hughes, AL .
GENOME RESEARCH, 2001, 11 (03) :373-381
[4]   Extent of gene duplication in the genomes of Drosophila, nematode, and yeast [J].
Gu, ZL ;
Cavalcanti, A ;
Chen, FC ;
Bouman, P ;
Li, WH .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (03) :256-262
[5]  
Haldane JBS, 1932, The causes of evolution, by
[6]   HIDDEN MARKOV-MODELS IN COMPUTATIONAL BIOLOGY - APPLICATIONS TO PROTEIN MODELING [J].
KROGH, A ;
BROWN, M ;
MIAN, IS ;
SJOLANDER, K ;
HAUSSLER, D .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 235 (05) :1501-1531
[7]  
Li W.-H., 1997, MOL EVOLUTION
[8]   The evolutionary fate and consequences of duplicate genes [J].
Lynch, M ;
Conery, JS .
SCIENCE, 2000, 290 (5494) :1151-1155
[9]  
MORIYAMA EN, 1993, GENETICS, V134, P847
[10]  
NEI M, 1986, MOL BIOL EVOL, V3, P418