Penalized estimation of haplotype frequencies

被引:16
作者
Ayers, Kristin L. [1 ]
Lange, Kenneth [1 ,2 ,3 ]
机构
[1] Univ Calif Los Angeles, Dept Biomath, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
关键词
D O I
10.1093/bioinformatics/btn236
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Low haplotype diversity and linkage disequilibrium are the rule in short genomic segments. This fact suggests that parsimony should be enforced in estimation of haplotype frequencies. The current article introduces a diversity penalty that automatically discards potential haplotypes with low explanatory power. The standard EM algorithm for haplotype frequency estimation can accommodate the penalty if one passes over to a more general minorizemaximize (MM) scheme for estimation. Results: Our new MM algorithm converges in fewer iterations, eliminates marginal haplotypes from further consideration and reduces the computational complexity of each iteration. Estimation by the MM algorithm also improves haplotyping and genotype imputation compared to naive application of the EM algorithm. Thus, the MM algorithm is a useful substitute for the EM algorithm. Compared to the most sophisticated current methods of haplotyping and genotype imputation, the MM algorithm is slightly less accurate but at least an order of magnitude faster.
引用
收藏
页码:1596 / 1602
页数:7
相关论文
共 23 条