Comprehensive analysis of the base composition around the transcription start site in Metazoa

被引:56
作者
Aerts, S [1 ]
Thijs, G
Dabrowski, M
Moreau, Y
De Moor, B
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT SCD, Louvain, Belgium
[2] M Nencki Inst Expt Biol, Lab Transcript Regulat, PL-02093 Warsaw, Poland
关键词
D O I
10.1186/1471-2164-5-34
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The transcription start site of a metazoan gene remains poorly understood, mostly because there is no clear signal present in all genes. Now that several sequenced metazoan genomes have been annotated, we have been able to compare the base composition around the transcription start site for all annotated genes across multiple genomes. Results: The most prominent feature in the base compositions is a significant local variation in G+C content over a large region around the transcription site. The change is present in all animal phyla but the extent of variation is different between distinct classes of vertebrates, and the shape of the variation is completely different between vertebrates and arthropods. Furthermore, the height of the variation correlates with CpG frequencies in vertebrates but not in invertebrates and it also correlates with gene expression, especially in mammals. We also detect GC and AT skews in all clades (where %G is not equal to %C or %A is not equal to %T respectively) but these occur in a more confined region around the transcription start site and in the coding region. conclusions: the dramatic changes in nucleotide composition in humans are a consequence of CpG nucleotide frequencies and of gene expression, the changes in Fugu could point to primordial CpG islands, and the changes in the fly are of a totally different kind and unrelated to dinucleotide frequencies.
引用
收藏
页数:10
相关论文
共 37 条
  • [1] Transcription-induced mutations: Increase in C to T mutations in the nontranscribed strand during transcription in Escherichia coli
    Beletskii, A
    Bhagwat, AS
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (24) : 13919 - 13924
  • [2] The human genome: Organization and evolutionary history
    Bernardi, G
    [J]. ANNUAL REVIEW OF GENETICS, 1995, 29 : 445 - 476
  • [3] Molecular biology - DNA methylation de novo
    Bird, A
    [J]. SCIENCE, 1999, 286 (5448) : 2287 - 2288
  • [4] DNA METHYLATION AND THE FREQUENCY OF CPG IN ANIMAL DNA
    BIRD, AP
    [J]. NUCLEIC ACIDS RESEARCH, 1980, 8 (07) : 1499 - 1504
  • [5] CPG-RICH ISLANDS AND THE FUNCTION OF DNA METHYLATION
    BIRD, AP
    [J]. NATURE, 1986, 321 (6067) : 209 - 213
  • [6] A global analysis of Caenorhabditis elegans operons
    Blumenthal, T
    Evans, D
    Link, CD
    Guffanti, A
    Lawson, D
    Thierry-Mieg, J
    Thierry-Mieg, D
    Chiu, WL
    Duke, K
    Kiraly, M
    Kim, SK
    [J]. NATURE, 2002, 417 (6891) : 851 - 854
  • [7] Predicting gene regulatory elements in silico on a genomic scale
    Brazma, A
    Jonassen, I
    Vilo, J
    Ukkonen, E
    [J]. GENOME RESEARCH, 1998, 8 (11) : 1202 - 1215
  • [8] MatchMiner: a tool for batch navigation among gene and gene product identifiers
    Bussey, KJ
    Kane, D
    Sunshine, M
    Narasimhan, S
    Nishizuka, S
    Reinhold, WC
    Zeeberg, B
    Ajay
    Weinstein, JN
    [J]. GENOME BIOLOGY, 2003, 4 (04)
  • [9] CHARGAFF E, 1951, FED PROC, V10, P654
  • [10] MOLECULAR-BASIS OF BASE SUBSTITUTION HOTSPOTS IN ESCHERICHIA-COLI
    COULONDRE, C
    MILLER, JH
    FARABAUGH, PJ
    GILBERT, W
    [J]. NATURE, 1978, 274 (5673) : 775 - 780