An evaluation of new criteria for CpG islands in the human genome as gene markers

被引:179
作者
Wang, Y [1 ]
Leung, FCC [1 ]
机构
[1] Univ Hong Kong, Dept Zool, Pokfulam, Hong Kong, Peoples R China
关键词
D O I
10.1093/bioinformatics/bth059
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Recently, more stringent criteria for CpG islands have been introduced to exclude Alu repeats, thereby enabling a higher proportion of CpG islands associating with genes to be identified. Using these new criteria, several types of associations between CpG islands and genes were investigated to further establish the importance of CpG islands as gene markers. Results: The CpG islands were searched by CpGIE, a java software program developed for CpG island identification. CpGIE was advanced in identification accuracy compared with other tools. According to our results, about 70% of the identified CpG islands were associating with the human genes and over half of them are in the promoters. Furthermore, the investigation of genes in the confirmed gene model showed that 56% of them had a CpG island overlapping the transcription start sites. In comparison, the new criteria were found capable of filtering a large fraction of Alu repeats that was identified as CpG islands by the generally accepted criteria within the genes, but very few CpG islands associating with the promoters were affected. The genes in the predicted gene model were not obviously associated with CpG islands, suggesting that CpG islands can be used to evaluate the accuracy of gene annotation.
引用
收藏
页码:1170 / 1177
页数:8
相关论文
共 18 条
[1]   NUMBER OF CPG ISLANDS AND GENES IN HUMAN AND MOUSE [J].
ANTEQUERA, F ;
BIRD, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (24) :11995-11999
[2]   CpG islands as genomic footprints of promoters that are associated with replication origins [J].
Antequera, F ;
Bird, A .
CURRENT BIOLOGY, 1999, 9 (17) :R661-R667
[3]   DNA methylation patterns and epigenetic memory [J].
Bird, A .
GENES & DEVELOPMENT, 2002, 16 (01) :6-21
[4]   CONSERVATION OF THE ORGANIZATION OF 5 TIGHTLY CLUSTERED GENES OVER 600 MILLION YEARS OF DIVERGENT EVOLUTION [J].
COLOMBO, P ;
YON, J ;
GARSON, K ;
FRIED, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (14) :6358-6362
[5]   Computational identification of promoters and first exons in the human genome [J].
Davuluri, RV ;
Grosse, I ;
Zhang, MQ .
NATURE GENETICS, 2001, 29 (04) :412-417
[6]   CPG ISLANDS IN GENES SHOWING TISSUE-SPECIFIC EXPRESSION [J].
EDWARDS, YH .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1990, 326 (1235) :207-215
[7]   CPG ISLANDS IN VERTEBRATE GENOMES [J].
GARDINERGARDEN, M ;
FROMMER, M .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 196 (02) :261-282
[8]   THE MOUSE SURFEIT LOCUS CONTAINS A CLUSTER OF 6 GENES ASSOCIATED WITH 4 CPG-RICH ISLANDS IN 32 KILOBASES OF GENOMIC DNA [J].
HUXLEY, C ;
FRIED, M .
MOLECULAR AND CELLULAR BIOLOGY, 1990, 10 (02) :605-614
[9]   Large-scale human promoter mapping using CpG islands [J].
Ioshikhes, IP ;
Zhang, MQ .
NATURE GENETICS, 2000, 26 (01) :61-63
[10]   CPG ISLANDS AS GENE MARKERS IN THE HUMAN GENOME [J].
LARSEN, F ;
GUNDERSEN, G ;
LOPEZ, R ;
PRYDZ, H .
GENOMICS, 1992, 13 (04) :1095-1107