Extent of gene duplication in the genomes of Drosophila, nematode, and yeast

被引:389
作者
Gu, ZL
Cavalcanti, A
Chen, FC
Bouman, P
Li, WH
机构
[1] Univ Chicago, Dept Ecol & Evolut, Chicago, IL 60637 USA
[2] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
关键词
gene families; gene duplication rate; database cleaning; codon usage bias;
D O I
10.1093/oxfordjournals.molbev.a004079
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We conducted a detailed analysis of duplicate genes in three complete genoms: yeast, Drosophila, and Caenorhabditis elegans. For two proteins belonging to the same family we used the criteria ( 1 ) their similarity is greater than or equal to1 ( 1 = 30% if L greater than or equal to 150 a.a. and I = 0.01n + 4.8L(-0.321) (exp) if L < 150 a.a.. where n = 6 and L is the length of the alignable region), and (2) the length of the alignable region between the two sequences is greater than or equal to80% of the longer protein, We found it very important to delete isoforms (caused by alternative splicing). same genes with different names. and proteins derived from repetitive elements. We estimated that there Acre 530, 674, and 1,219 protein families in yeast. Drosophila. and C. elegans, respectively. so, as expected, cast has the smallest number of duplicate genes. However. for the duplicate pairs with the number of substitutions per synonymous site (K-s) < 0.01, Drosophila has only seven pairs. whereas beast has 58 pairs and nematode has 153 pairs. After considering the possible effects of codon usage bias and gene coil version. these numbers became 6, 55, and 147, respectively. Thus. Drosophila appears to have much fewer young duplicate genes than do yeast and nematode. The larger numbers of duplicate pairs with K-s < 0.01 in yeast and C. elegans were probably largely caused by block duplications. At any rate. it is clear that the genome of Drosophila melanogaster has undergone few gene duplications. in the recent past and has much fewer gene families than C. elegans.
引用
收藏
页码:256 / 262
页数:7
相关论文
共 30 条
  • [1] Mutation pressure, natural selection, and the evolution of base composition in Drosophila
    Akashi, H
    Kliman, RM
    Eyre-Walker, A
    [J]. GENETICA, 1998, 102-3 (0) : 49 - 60
  • [2] Genomes were forged by massive bombardments with retroelements and retrosequences
    Brosius, J
    [J]. GENETICA, 1999, 107 (1-3) : 209 - 238
  • [3] Doolittle R.F., 1986, Of Urfs and Orfs: A Primer on How to Analyze Derived Amino Acid Sequences
  • [4] THE MULTIPLICITY OF DOMAINS IN PROTEINS
    DOOLITTLE, RF
    [J]. ANNUAL REVIEW OF BIOCHEMISTRY, 1995, 64 : 287 - 314
  • [5] Dunn KA, 2001, GENETICS, V157, P295
  • [6] Gene duplication and the structure of eukaryotic genomes
    Friedman, R
    Hughes, AL
    [J]. GENOME RESEARCH, 2001, 11 (03) : 373 - 381
  • [8] Analysis of the genome sequence of the flowering plant Arabidopsis thaliana
    Kaul, S
    Koo, HL
    Jenkins, J
    Rizzo, M
    Rooney, T
    Tallon, LJ
    Feldblyum, T
    Nierman, W
    Benito, MI
    Lin, XY
    Town, CD
    Venter, JC
    Fraser, CM
    Tabata, S
    Nakamura, Y
    Kaneko, T
    Sato, S
    Asamizu, E
    Kato, T
    Kotani, H
    Sasamoto, S
    Ecker, JR
    Theologis, A
    Federspiel, NA
    Palm, CJ
    Osborne, BI
    Shinn, P
    Conway, AB
    Vysotskaia, VS
    Dewar, K
    Conn, L
    Lenz, CA
    Kim, CJ
    Hansen, NF
    Liu, SX
    Buehler, E
    Altafi, H
    Sakano, H
    Dunn, P
    Lam, B
    Pham, PK
    Chao, Q
    Nguyen, M
    Yu, GX
    Chen, HM
    Southwick, A
    Lee, JM
    Miranda, M
    Toriumi, MJ
    Davis, RW
    [J]. NATURE, 2000, 408 (6814) : 796 - 815
  • [10] HIDDEN MARKOV-MODELS IN COMPUTATIONAL BIOLOGY - APPLICATIONS TO PROTEIN MODELING
    KROGH, A
    BROWN, M
    MIAN, IS
    SJOLANDER, K
    HAUSSLER, D
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1994, 235 (05) : 1501 - 1531