共 50 条
A General Approach for Haplotype Phasing across the Full Spectrum of Relatedness
被引:418
作者:
O'Connell, Jared
[1
,2
]
Gurdasani, Deepti
[3
,4
]
Delaneau, Olivier
[2
]
Pirastu, Nicola
[5
]
Ulivi, Sheila
[6
]
Cocca, Massimiliano
[7
]
Traglia, Michela
[7
]
Huang, Jie
[3
]
Huffman, Jennifer E.
[8
]
Rudan, Igor
[9
]
McQuillan, Ruth
[9
]
Fraser, Ross M.
[9
]
Campbell, Harry
[9
]
Polasek, Ozren
[10
]
Asiki, Gershim
[11
]
Ekoru, Kenneth
[12
]
Hayward, Caroline
[8
]
Wright, Alan F.
[8
]
Vitart, Veronique
[8
]
Navarro, Pau
[8
]
Zagury, Jean-Francois
[12
]
Wilson, James F.
[9
]
Toniolo, Daniela
[7
]
Gasparini, Paolo
[5
]
Soranzo, Nicole
[3
]
Sandhu, Manjinder S.
[3
,4
]
Marchini, Jonathan
[1
,2
]
机构:
[1] Univ Oxford, Wellcome Trust Ctr Human Genet, Oxford, England
[2] Univ Oxford, Dept Stat, Oxford OX1 3TG, England
[3] Wellcome Trust Sanger Inst, Hinxton, England
[4] Univ Cambridge, Dept Publ Hlth & Primary Care, Cambridge, England
[5] Univ Trieste, IRCCS Burlo Garofolo, Inst Maternal & Child Hlth, Trieste, Italy
[6] IRCCS Burlo Garofolo, Inst Maternal & Child Hlth, Trieste, Italy
[7] Ist Sci San Raffaele, Div Genet & Cell Biol, I-20132 Milan, Italy
[8] Univ Edinburgh, MRC, Human Genet Unit, Inst Genet & Mol Med, Edinburgh, Midlothian, Scotland
[9] Univ Edinburgh, Ctr Populat Hlth Sci, Edinburgh, Midlothian, Scotland
[10] Univ Split, Fac Med, Split, Croatia
[11] UVRI, MRC, Uganda Res Unit AIDS, Entebbe, Uganda
[12] Conservatoire Natl Arts & Metiers, Lab Genom Bioinformat & Applicat EA4627, Paris, France
基金:
英国惠康基金;
英国医学研究理事会;
关键词:
GENOME-WIDE ASSOCIATION;
GENOTYPE IMPUTATION;
LINKAGE ANALYSIS;
POPULATION;
DESCENT;
RECOMBINATION;
INFERENCE;
IDENTITY;
RECONSTRUCTION;
SEQUENCE;
D O I:
10.1371/journal.pgen.1004234
中图分类号:
Q3 [遗传学];
学科分类号:
071007 ;
090102 ;
摘要:
Many existing cohorts contain a range of relatedness between genotyped individuals, either by design or by chance. Haplotype estimation in such cohorts is a central step in many downstream analyses. Using genotypes from six cohorts from isolated populations and two cohorts from non-isolated populations, we have investigated the performance of different phasing methods designed for nominally 'unrelated' individuals. We find that SHAPEIT2 produces much lower switch error rates in all cohorts compared to other methods, including those designed specifically for isolated populations. In particular, when large amounts of IBD sharing is present, SHAPEIT2 infers close to perfect haplotypes. Based on these results we have developed a general strategy for phasing cohorts with any level of implicit or explicit relatedness between individuals. First SHAPEIT2 is run ignoring all explicit family information. We then apply a novel HMM method (duoHMM) to combine the SHAPEIT2 haplotypes with any family information to infer the inheritance pattern of each meiosis at all sites across each chromosome. This allows the correction of switch errors, detection of recombination events and genotyping errors. We show that the method detects numbers of recombination events that align very well with expectations based on genetic maps, and that it infers far fewer spurious recombination events than Merlin. The method can also detect genotyping errors and infer recombination events in otherwise uninformative families, such as trios and duos. The detected recombination events can be used in association scans for recombination phenotypes. The method provides a simple and unified approach to haplotype estimation, that will be of interest to researchers in the fields of human, animal and plant genetics.
引用
收藏
页数:21
相关论文