SNPs Occur in Regions with Less Genomic Sequence Conservation

被引:54
作者
Castle, John C. [1 ]
机构
[1] Rosetta Inpharmat LLC, Seattle, WA USA
关键词
GENETIC-VARIATION; MOUSE GENOME; DATABASE; SITES; CONSEQUENCES; EUKARYOTES; MUTATIONS; EVOLUTION; SILENCE; MAMMALS;
D O I
10.1371/journal.pone.0020660
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Rates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra-and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less conserved regions and that functional regions have fewer SNPs. Results are confirmed using both mouse and human data. Regions include protein start codons, 3' splice sites, 5' splice sites, protein stop codons, predicted miRNA binding sites, and polyadenylation sites. Throughout, SNP rates are lower and conservation is higher at regulatory sites. Within coding regions, SNP rates are highest and conservation is lowest at codon position three and the fewest SNPs are found at codon position two, reflecting codon degeneracy for amino acid encoding. Exon splice sites show high conservation and very low SNP rates, reflecting both splicing signals and protein coding. Relaxed constraint on the codon third position is dramatically seen when separating exonic SNP rates based on intron phase. At polyadenylation sites, a peak of conservation and low SNP rate occurs from 30 to 17 nt preceding the site. This region is highly enriched for the sequence AAUAAA, reflecting the location of the conserved polyA signal. miRNA 3' UTR target sites are predicted incorporating interspecies genomic sequence conservation; SNP rates are low in these sites, again showing fewer SNPs in conserved regions. Together, these results confirm that SNPs, reflecting recent genetic variation, occur more frequently in regions with less evolutionarily conservation.
引用
收藏
页数:12
相关论文
共 27 条
[1]   Genomics and the future of conservation genetics [J].
Allendorf, Fred W. ;
Hohenlohe, Paul A. ;
Luikart, Gordon .
NATURE REVIEWS GENETICS, 2010, 11 (10) :697-709
[2]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]   Mutation rate variation in multicellular eukaryotes: causes and consequences [J].
Baer, Charles F. ;
Miyamoto, Michael M. ;
Denver, Dee R. .
NATURE REVIEWS GENETICS, 2007, 8 (08) :619-631
[4]   Listening to silence and understanding nonsense: Exonic mutations that affect splicing [J].
Cartegni, L ;
Chew, SL ;
Krainer, AR .
NATURE REVIEWS GENETICS, 2002, 3 (04) :285-298
[5]   Hearing silence: non-neutral evolution at synonymous sites in mammals [J].
Chamary, JV ;
Parmley, JL ;
Hurst, LD .
NATURE REVIEWS GENETICS, 2006, 7 (02) :98-108
[6]   Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: Structure-based assessment of amino acid variation [J].
Chasman, D ;
Adams, RM .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (02) :683-706
[7]   Natural selection on human microRNA binding sites inferred from SNP data [J].
Chen, Kevin ;
Rajewsky, Nikolaus .
NATURE GENETICS, 2006, 38 (12) :1452-1456
[8]   WebLogo: A sequence logo generator [J].
Crooks, GE ;
Hon, G ;
Chandonia, JM ;
Brenner, SE .
GENOME RESEARCH, 2004, 14 (06) :1188-1190
[9]   Single nucleotide polymorphism-based validation of exonic splicing enhancers [J].
Fairbrother, WG ;
Holste, D ;
Burge, CB ;
Sharp, PA .
PLOS BIOLOGY, 2004, 2 (09) :1388-1395
[10]   MicroRNA targeting specificity in mammals: Determinants beyond seed pairing [J].
Grimson, Andrew ;
Farh, Kyle Kai-How ;
Johnston, Wendy K. ;
Garrett-Engele, Philip ;
Lim, Lee P. ;
Bartel, David P. .
MOLECULAR CELL, 2007, 27 (01) :91-105