Handling linkage disequilibrium in qualitative trait linkage analysis using dense SNPs: a two-step strategy

被引:11
作者
Cho, Kelly [1 ]
Dupuis, Josee [1 ]
机构
[1] Boston Univ, Sch Publ Hlth, Dept Biostat, Boston, MA USA
关键词
SINGLE-NUCLEOTIDE POLYMORPHISMS; I ERROR; MARKERS; POWER; MICROSATELLITES;
D O I
10.1186/1471-2156-10-44
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: In affected sibling pair linkage analysis, the presence of linkage disequilibrium (LD) has been shown to lead to overestimation of the number of alleles shared identity-by-descent (IBD) among sibling pairs when parents are ungenotyped. This inflation results in spurious evidence for linkage even when the markers and the disease locus are not linked. In our study, we first theoretically evaluate how inflation in IBD probabilities leads to overestimation of a nonparametric linkage (NPL) statistic under the assumption of linkage equilibrium. Next, we propose a two-step processing strategy in order to systematically evaluate approaches to handle LD. Based on the observed inflation of expected logarithm of the odds ratio (LOD) from our theoretical exploration, we implemented our proposed two-step processing strategy. Step 1 involves three techniques to filter a dense set of markers. In step 2, we use the selected subset of markers from step 1 and apply four different methods of handling LD among dense markers: 1) marker thinning (MT); 2) recursive elimination; 3) SNPLINK; and 4) LD modeling approach in MERLIN. We evaluate relative performance of each method through simulation. Results: We observed LOD score inflation only when the parents were ungenotyped. For a given number of markers, all approaches evaluated for each type of LD threshold performed similarly; however, RE approach was the only one that eliminated the LOD score bias. Our simulation results indicate a reduction of approximately 75% to complete elimination of the LOD score inflation while maintaining the information content (IC) when setting a tolerable squared correlation coefficient LD threshold (r(2)) above 0.3 for or 2 SNPs per cM using MT. Conclusion: We have established a theoretical basis of how inflated IBD information among dense markers overestimates a NPL statistic. The two-step processing strategy serves as a useful framework to systematically evaluate relative performance of different methods to handle LD.
引用
收藏
页数:8
相关论文
共 18 条
[1]   Handling marker-marker linkage disequilibrium: Pedigree analysis with clustered markers [J].
Abecasis, GR ;
Wigginton, JE .
AMERICAN JOURNAL OF HUMAN GENETICS, 2005, 77 (05) :754-767
[2]   Merlin-rapid analysis of dense genetic maps using sparse gene flow trees [J].
Abecasis, GR ;
Cherny, SS ;
Cookson, WO ;
Cardon, LR .
NATURE GENETICS, 2002, 30 (01) :97-101
[3]   Direct power comparisons between simple LOD scores and NPL scores for linkage analysis in complex diseases [J].
Abreu, PC ;
Greenberg, DA ;
Hodge, SE .
AMERICAN JOURNAL OF HUMAN GENETICS, 1999, 65 (03) :847-857
[4]   High-density SNP analysis of 642 Caucasian families with rheumatoid arthritis identifies two new linkage regions on 11p12 and 2q33 [J].
Amos, C. I. ;
Chen, W. V. ;
Lee, A. ;
Li, W. ;
Kern, M. ;
Lundsten, R. ;
Batliwalla, F. ;
Wener, M. ;
Remmers, E. ;
Kastner, D. A. ;
Criswell, L. A. ;
Seldin, M. F. ;
Gregersen, P. K. .
GENES AND IMMUNITY, 2006, 7 (04) :277-286
[5]   Linkage disequilibrium inflates type I error rates in multipoint linkage analysis when parental genotypes are missing [J].
Boyles, AL ;
Scott, WK ;
Martin, ER ;
Schmidt, S ;
Li, YJ ;
Ashley-Koch, A ;
Bass, MP ;
Schmidt, M ;
Pericak-Vance, MA ;
Speer, MC ;
Hauser, ER .
HUMAN HEREDITY, 2005, 59 (04) :220-227
[6]   EFFECTS OF MIS-SPECIFYING GENETIC-PARAMETERS IN LOD SCORE ANALYSIS [J].
CLERGETDARPOUX, F ;
BONAITIPELLIE, C ;
HOCHEZ, J .
BIOMETRICS, 1986, 42 (02) :393-399
[7]  
Haldane JBS, 1919, J GENET, V8, P299
[8]   Ignoring linkage disequilibrium among tightly linked markers induces false-positive evidence of linkage for affected sib pair analysis [J].
Huang, QQ ;
Shete, S ;
Amos, CI .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 75 (06) :1106-1112
[9]   Examining the effect of linkage disequilibrium between markers on the Type I error rate and power of Nonparametric multipoint linkage analysis of two-generation and multigenerational pedigrees in the presence of missing genotype data [J].
Kim, Yoonhee ;
Duggal, Priya ;
Gillanders, Elizabeth M. ;
Kim, Ho ;
Bailey-Wilson, Joan E. .
GENETIC EPIDEMIOLOGY, 2008, 32 (01) :41-51
[10]  
Kruglyak L, 1996, AM J HUM GENET, V58, P1347