PedBLIMP: Extending Linear Predictors to Impute Genotypes in Pedigrees

被引:4
作者
Chen, Wenan [1 ]
Schaid, Daniel J. [1 ]
机构
[1] Mayo Clin, Dept Hlth Sci Res, Div Biomed Stat & Informat, Rochester, MN 55905 USA
基金
美国国家卫生研究院;
关键词
linear predictor; genotype imputation; linkage disequilibrium; identity by descent; GENOME-WIDE ASSOCIATION; LINKAGE DISEQUILIBRIUM; MISSING GENOTYPES; IMPUTATION; INFERENCE; DESCENT; IDENTITY; SPARSE; PHASE; MODEL;
D O I
10.1002/gepi.21838
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Recently, Wen and Stephens (Wen and Stephens [2010] Ann Appl Stat 4(3): 1158-1182) proposed a linear predictor, called BLIMP, that uses conditional multivariate normal moments to impute genotypes with accuracy similar to current state-of-the-art methods. One novelty is that it regularized the estimated covariance matrix based on a model from population genetics. We extended multivariate moments to impute genotypes in pedigrees. Our proposed method, PedBLIMP, utilizes both the linkage-disequilibrium (LD) information estimated from external panel data and the pedigree structure or identity-by-descent (IBD) information. The proposed method was evaluated on a pedigree design where some individuals were genotyped with dense markers and the rest with sparse markers. We found that incorporating the pedigree/IBD information can improve imputation accuracy compared to BLIMP. Because rare variants usually have low LD with other single-nucleotide polymorphisms (SNPs), incorporating pedigree/IBD information largely improved imputation accuracy for rare variants. We also compared PedBLIMP with IMPUTE2 and GIGI. Results show that when sparse markers are in a certain density range, our method can outperform both IMPUTE2 and GIGI. (C) 2014 Wiley Periodicals, Inc.
引用
收藏
页码:531 / 541
页数:11
相关论文
共 32 条
[1]   Merlin-rapid analysis of dense genetic maps using sparse gene flow trees [J].
Abecasis, GR ;
Cherny, SS ;
Cookson, WO ;
Cardon, LR .
NATURE GENETICS, 2002, 30 (01) :97-101
[2]   A Fast, Powerful Method for Detecting Identity by Descent [J].
Browning, Brian L. ;
Browning, Sharon R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2011, 88 (02) :173-182
[3]   Simultaneous Genotype Calling and Haplotype Phasing Improves Genotype Accuracy and Reduces False-Positive Associations for Genome-wide Association Studies [J].
Browning, Brian L. ;
Yu, Zhaoxia .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 85 (06) :847-861
[4]   A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals [J].
Browning, Brian L. ;
Browning, Sharon R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (02) :210-223
[5]   Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering [J].
Browning, Sharon R. ;
Browning, Brian L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) :1084-1097
[6]   In silico method for inferring genotypes in pedigrees [J].
Burdick, Joshua T. ;
Chen, Wei-Min ;
Abecasis, Goncalo R. ;
Cheung, Vivian G. .
NATURE GENETICS, 2006, 38 (09) :1002-1004
[7]   Family-based association tests for genomewide association scans [J].
Chen, Wei-Min ;
Abecasis, Goncalo R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) :913-926
[8]   A Statistical Framework to Guide Sequencing Choices in Pedigrees [J].
Cheung, Charles Y. K. ;
Marchani, Elizabeth ;
Wijsman, Ellen M. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2014, 94 (02) :257-267
[9]   GIGI: An Approach to Effective Imputation of Dense Genotypes on Large Pedigrees [J].
Cheung, Charles Y. K. ;
Thompson, Elizabeth A. ;
Wijsman, Ellen M. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2013, 92 (04) :504-516
[10]   Genotype imputation via matrix completion [J].
Chi, Eric C. ;
Zhou, Hua ;
Chen, Gary K. ;
Del Vecchyo, Diego Ortega ;
Lange, Kenneth .
GENOME RESEARCH, 2013, 23 (03) :509-518