An evolutionary model of a complementary circular code

被引:14
作者
Arques, DG
Fallot, JP
Michel, CJ
机构
[1] UNIV MARNE LA VALLEE, EQUIPE BIOL THEOR, INST GASPARD MONGE, F-93160 NOISY LE GRAND, FRANCE
[2] UNIV FRANCHE COMTE, INST UNIV TECHNOL BELFORT MONTBELIARD, EQUIPE BIOL THEOR, F-90016 BELFORT, FRANCE
关键词
PROTEIN CODING REGIONS; DNA-SEQUENCES; GENETIC-CODE; TRYPANOSOME MITOCHONDRIA; RNA; SIMULATION; ORIGIN;
D O I
10.1006/jtbi.1996.0305
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The subset X(q) = {AAC,AAT,ACC,ATC,ATT,CAG,CTC,CTG,GAA,GAC,GAG,GAT,GCC,GGC GGT,GTA,GTC,GTT,TAC,TTC} of 20 trinucleotides has a preferential occurrence in frame 0 (a reading frame established by the ATG start trinucleotide) of protein (coding) genes of both prokaryotes and eukaryotes. This subset X(0) has the rarity property (6 x 10(-8)) to be a complementary maximal circular code with two permutated maximal circular codes X(1) and X(2) in frames 1 and 2 respectively (frame 0 shifted by one and two nucleotides respectively in the 5'-3' direction). X(0) is called a C-3 code. A quantitative study of these three subsets X(0), X(1) and X(2) in the three frames 0, 1 and 2 of eukaryotic protein genes shows that their occurrence frequencies are constant functions of the trinucleotide positions in the sequences. The frequencies of X(0), X(1) and X(2) in frame 0 of the eukaryotic protein genes are 48.5%, 29% and 22.5% respectively. These properties are not observed in the 5' and 3' regions of eukaryotes where X(0), X(1) and X(2) occur with variable frequencies around the random value (1/3). Several frequency asymmetries unexpectedly observed, e.g. the frequency difference between X(1) and X(2) in the frame 0, are related to a new property of the C-3 code X(0) involving substitutions. An evolutionary model at three parameters (p, q, k) based on an independent mixing of the 20 codons (trinucieotides in frame 0) of X(0) with equiprobability (1/20) followed by k approximate to 5 substitutions per codon in the three codon sites in proportions p approximate to 0.1, q approximate to 0.1 and r = 1 - p - q approximate to 0.8 respectively, retrieves the frequencies of X(0), X(1) and X(2) observed in the three flames of protein genes and explains these asymmetries. (C) 1997 Academic Press Limited.
引用
收藏
页码:241 / 253
页数:13
相关论文
共 30 条
[1]   A MODEL OF DNA-SEQUENCE EVOLUTION [J].
ARQUES, DG ;
MICHEL, CJ .
BULLETIN OF MATHEMATICAL BIOLOGY, 1990, 52 (06) :741-772
[2]   A PURINE PYRIMIDINE MOTIF VERIFYING AN IDENTICAL PRESENCE IN ALMOST ALL GENE TAXONOMIC GROUPS [J].
ARQUES, DG ;
MICHEL, CJ .
JOURNAL OF THEORETICAL BIOLOGY, 1987, 128 (04) :457-461
[3]   ANALYTICAL EXPRESSION OF THE PURINE/PYRIMIDINE AUTOCORRELATION FUNCTION AFTER AND BEFORE RANDOM MUTATIONS [J].
ARQUES, DG ;
MICHEL, CJ .
MATHEMATICAL BIOSCIENCES, 1994, 123 (01) :103-125
[4]   A complementary circular code in the protein coding genes [J].
Arques, DG ;
Michel, CJ .
JOURNAL OF THEORETICAL BIOLOGY, 1996, 182 (01) :45-58
[5]   IDENTIFICATION AND SIMULATION OF NEW NONRANDOM STATISTICAL PROPERTIES COMMON TO DIFFERENT EUKARYOTIC GENE SUBPOPULATIONS [J].
ARQUES, DG ;
MICHEL, CJ .
BIOCHIMIE, 1993, 75 (05) :399-407
[6]   A SIMULATION OF THE GENETIC PERIODICITIES MODULO-2 AND MODULO-3 WITH PROCESSES OF NUCLEOTIDE INSERTIONS AND DELETIONS [J].
ARQUES, DG ;
MICHEL, CJ .
JOURNAL OF THEORETICAL BIOLOGY, 1992, 156 (01) :113-127
[7]  
Beal M.-P., 1993, CODAGE SYMBOLIQUE
[8]   THE ORIGIN AND EVOLUTION OF THE GENETIC-CODE [J].
BELAND, P ;
ALLEN, TFH .
JOURNAL OF THEORETICAL BIOLOGY, 1994, 170 (04) :359-365
[9]   RNA-EDITING IN TRYPANOSOME MITOCHONDRIA [J].
BENNE, R .
BIOCHIMICA ET BIOPHYSICA ACTA, 1989, 1007 (02) :131-139
[10]   MAJOR TRANSCRIPT OF THE FRAMESHIFTED COXLL GENE FROM TRYPANOSOME MITOCHONDRIA CONTAINS 4 NUCLEOTIDES THAT ARE NOT ENCODED IN THE DNA [J].
BENNE, R ;
VANDENBURG, J ;
BRAKENHOFF, JPJ ;
SLOOF, P ;
VANBOOM, JH ;
TROMP, MC .
CELL, 1986, 46 (06) :819-826