Partial short-read sequencing of a highly inbred Iberian pig and genomics inference thereof

被引:14
作者
Esteve-Codina, A. [1 ]
Kofler, R. [2 ,3 ]
Himmelbauer, H. [2 ]
Ferretti, L. [1 ,4 ]
Vivancos, A. P. [2 ]
Groenen, M. A. M. [5 ]
Folch, J. M. [1 ]
Rodriguez, M. C. [6 ]
Perez-Enciso, M. [1 ,7 ]
机构
[1] Univ Autonoma Barcelona, Dept Ciencia Anim & Aliments, Fac Vet, Bellaterra 08193, Spain
[2] Univ Pompeu Fabra, Ctr Genom Regulat CRG, Barcelona, Spain
[3] Max Planck Inst Mol Genet, Berlin, Germany
[4] CRAG, Dept Anim Sci, Bellaterra, Spain
[5] Univ Wageningen & Res Ctr, Anim Breeding & Genom Ctr, Wageningen, Netherlands
[6] Inst Nacl Invest & Tecnol Agr Alimentaria INIA, Dept Mejora Genet Anim, Madrid, Spain
[7] ICREA, Barcelona, Spain
关键词
Iberian pig; next generation sequencing; nucleotide diversity; Pig; SNP DISCOVERY; RECOMBINATION RATES; WHOLE-GENOME; QTL; DIVERSITY; MUTATION;
D O I
10.1038/hdy.2011.13
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Despite dramatic reduction in sequencing costs with the advent of next generation sequencing technologies, obtaining a complete mammalian genome sequence at sufficient depth is still costly. An alternative is partial sequencing. Here, we have sequenced a reduced representation library of an Iberian sow from the Guadyerbas strain, a highly inbred strain that has been used in numerous QTL studies because of its extreme phenotypic characteristics. Using the Illumina Genome Analyzer II (San Diego, CA, USA), we resequenced similar to 1% of the genome with average 4 x depth, identifying 68 778 polymorphisms. Of these, 55 457 were putative fixed differences with respect to the assembly, based on the genome of a Duroc pig, and 13 321 were heterozygous positions within Guadyerbas. Despite being highly inbred, the estimate of heterozygosity within Guadyerbas was similar to 0.78 kb(-1) in autosomes, after correcting for low depth. Nucleotide variability was consistently higher at the telomeric regions than on the rest of the chromosome, likely a result of increased recombination rates. Further, variability was 50% lower in the X-chromosome than in autosomes, which may be explained by a recent bottleneck or by selection. We divided the whole genome in 500 kb windows and we analyzed overrepresented gene ontology terms in regions of low and high variability. Multi organism process, pigmentation and cell killing were overrepresented in high variability regions and metabolic process ontology, within low variability regions. Further, a genome wide Hudson-Kreitman-Aguade test was carried out per window; overall, variability was in agreement with neutral expectations. Heredity (2011) 107, 256-264; doi: 10.1038/hdy.2011.13; published online 16 March 2011
引用
收藏
页码:256 / 264
页数:9
相关论文
共 33 条
[1]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[2]  
AMARAL A, 2011, PLOS ONE IN PRESS
[3]   Application of massive parallel sequencing to whole genome SNP discovery in the porcine genome [J].
Amaral, Andreia J. ;
Megens, Hendrik-Jan ;
Kerstens, Hindrik H. D. ;
Heuven, Henri C. M. ;
Dibbits, Bert ;
Crooijmans, Richard P. M. A. ;
Den Dunnen, Johan T. ;
Groenen, Martien A. M. .
BMC GENOMICS, 2009, 10 :374
[4]  
[Anonymous], 2010, GENETICS POPULATIONS
[5]   Fast and flexible simulation of DNA sequence data [J].
Chen, Gary K. ;
Marjoram, Paul ;
Wall, Jeffrey D. .
GENOME RESEARCH, 2009, 19 (01) :136-142
[6]   To Pool, or Not to Pool? [J].
Cutler, David J. ;
Jensen, Jeffrey D. .
GENETICS, 2010, 186 (01) :41-43
[7]   Substantial biases in ultra-short read data sets from high-throughput DNA sequencing [J].
Dohm, Juliane C. ;
Lottaz, Claudio ;
Borodina, Tatiana ;
Himmelbauer, Heinz .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16)
[8]   The different levels of genetic diversity in sex chromosomes and autosomes [J].
Ellegren, Hans .
TRENDS IN GENETICS, 2009, 25 (06) :278-284
[9]   mlRho - a program for estimating the population mutation and recombination rates from shotgun-sequenced diploid genomes [J].
Haubold, Bernhard ;
Pfaffelhuber, Peter ;
Lynch, Michael .
MOLECULAR ECOLOGY, 2010, 19 :277-284
[10]   Population genetic analysis of shotgun assemblies of genomic sequences from multiple individuals [J].
Hellmann, Ines ;
Mang, Yuan ;
Gu, Zhiping ;
Li, Peter ;
de la Vega, Francisco M. ;
Clark, Andrew G. ;
Nielsen, Rasmus .
GENOME RESEARCH, 2008, 18 (07) :1020-1029