The effect of single nucleotide polymorphism identification strategies on estimates of linkage disequilibrium

被引:50
作者
Akey, JM [1 ]
Zhang, K
Xiong, MM
Jin, L
机构
[1] Univ Cincinnati, Ctr Genome Informat, Cincinnati, OH 45267 USA
[2] Univ Texas, Ctr Human Genet, Houston, TX 77030 USA
关键词
ascertainment bias; linkage disequilibrium; SNPs; coalescent;
D O I
10.1093/molbev/msg032
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
At present there is tremendous interest in characterizing the magnitude and distribution of linkage disequilibrium (LD) throughout the human genome, which will provide the necessary foundation for genome-wide LD analyses and facilitate detailed evolutionary studies. To this end, a human high-density single-nucleotide polymorphism (SNP) marker map has been constructed. Many of the SNPs on this map, however, were identified by sampling a small number of chromosomes from a single population, and inferences drawn from studies using such SNPs may be influenced by ascertainment bias (AB). Through extensive simulations, we have found that AB is a potentially significant problem in estimating and comparing LD within and between populations. Specifically, the magnitude of AB is a function of the SNP discovery strategy, number of chromosomes used for SNP discovery, population genetic characteristics of the particular genomic region considered, amount of gene flow between populations, and demographic history of the populations. We demonstrate that a balanced SNP discovery strategy (where equal numbers of chromosomes are sampled from multiple subpopulations) is the optimal study design for generating broadly applicable SNP resources. Finally, we validate our theoretical predictions by comparing our results to publicly available data from ten genes sequenced in 24 African American and 23 European American individuals.
引用
收藏
页码:232 / 242
页数:11
相关论文
共 51 条
  • [41] Global patterns of linkage disequilibrium at the CD4 locus and modern human origins
    Tishkoff, SA
    Dietzsch, E
    Speed, W
    Pakstis, AJ
    Kidd, JR
    Cheung, K
    BonneTamir, B
    SantachiaraBenerecetti, AS
    Moral, P
    Krings, M
    Paabo, S
    Watson, E
    Risch, N
    Jenkins, T
    Kidd, KK
    [J]. SCIENCE, 1996, 271 (5254) : 1380 - 1387
  • [42] Wakeley J, 1999, GENETICS, V153, P1863
  • [43] The discovery of single-nucleotide polymorphisms - and inferences about human demographic history
    Wakeley, J
    Nielsen, R
    Liu-Cordero, SN
    Ardlie, K
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 69 (06) : 1332 - 1347
  • [44] Linkage disequilibrium and the mapping of complex human traits
    Weiss, KM
    Clark, AG
    [J]. TRENDS IN GENETICS, 2002, 18 (01) : 19 - 24
  • [45] Indirect measures of gene flow and migration:: FST≠1/(4Nm+1)
    Whitlock, MC
    McCauley, DE
    [J]. HEREDITY, 1999, 82 (2) : 117 - 125
  • [46] Population genetic structure of variable drug response
    Wilson, JF
    Weale, ME
    Smith, AC
    Gratrix, F
    Fletcher, B
    Thomas, MG
    Bradman, N
    Goldstein, DB
    [J]. NATURE GENETICS, 2001, 29 (03) : 265 - 269
  • [47] Sampling SNPs
    Yang, ZY
    Wong, GKS
    Eberle, MA
    Kibukawa, M
    Passey, DA
    Hughes, WR
    Kruglyak, L
    Yu, J
    [J]. NATURE GENETICS, 2000, 26 (01) : 13 - 14
  • [48] Comparison of human genetic and sequence-based physical maps
    Yu, A
    Zhao, CF
    Fan, Y
    Jang, WH
    Mungall, AJ
    Deloukas, P
    Olsen, A
    Doggett, NA
    Ghebranious, N
    Broman, KW
    Weber, JL
    [J]. NATURE, 2001, 409 (6822) : 951 - 953
  • [49] Global patterns of human DNA sequence variation in a 10-kb region on chromosome 1
    Yu, N
    Fu, YX
    Sambuughin, N
    Ramsay, M
    Jenkins, T
    Leskinen, E
    Patthy, L
    Jorde, LB
    Kuromori, T
    Li, WH
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (02) : 214 - 222
  • [50] Statistical inference of sequence-dependent mutation rates
    Zavolan, M
    Kepler, TB
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 2001, 11 (06) : 612 - 615