Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank

被引:44
|
作者
Hofmeister, Robin J. [1 ]
Ribeiro, Diogo M. [1 ]
Rubinacci, Simone [1 ]
Delaneau, Olivier [1 ]
机构
[1] Univ Lausanne, Dept Computat Biol, Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
LINKAGE DISEQUILIBRIUM; GENOTYPE IMPUTATION; WIDE ASSOCIATION;
D O I
10.1038/s41588-023-01415-w
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
SHAPEIT5, a phasing method that accurately processes large sequencing datasets, was applied on the UK Biobank whole-genome and whole-exome sequencing data to generate reference panels of haplotypes that boost imputation accuracy and enable the detection of compound heterozygous loss-of-function events for 549 genes. Phasing involves distinguishing the two parentally inherited copies of each chromosome into haplotypes. Here, we introduce SHAPEIT5, a new phasing method that quickly and accurately processes large sequencing datasets and applied it to UK Biobank (UKB) whole-genome and whole-exome sequencing data. We demonstrate that SHAPEIT5 phases rare variants with low switch error rates of below 5% for variants present in just 1 sample out of 100,000. Furthermore, we outline a method for phasing singletons, which, although less precise, constitutes an important step towards future developments. We then demonstrate that the use of UKB as a reference panel improves the accuracy of genotype imputation, which is even more pronounced when phased with SHAPEIT5 compared with other methods. Finally, we screen the UKB data for loss-of-function compound heterozygous events and identify 549 genes where both gene copies are knocked out. These genes complement current knowledge of gene essentiality in the human genome.
引用
收藏
页码:1243 / +
页数:23
相关论文
共 50 条
  • [41] Allele-specific copy-number discovery from whole-genome and whole-exome sequencing
    Wang, WeiBo
    Wang, Wei
    Sun, Wei
    Crowley, James J.
    Szatkiewicz, Jin P.
    NUCLEIC ACIDS RESEARCH, 2015, 43 (14)
  • [42] Genetic Basis of Pancreas Cancer Development and Progression: Insights from Whole-Exome and Whole-Genome Sequencing
    Iacobuzio-Donahue, Christine A.
    Velculescu, Victor E.
    Wolfgang, Christopher L.
    Hruban, Ralph H.
    CLINICAL CANCER RESEARCH, 2012, 18 (16) : 4257 - 4265
  • [43] Development and Validation of Clinical Whole-Exome and Whole-Genome Sequencing for Detection of Germline Variants in Inherited Disease
    Hegde, Madhuri
    Santani, Avni
    Mao, Rong
    Ferreira-Gonzalez, Andrea
    Weck, Karen E.
    Voelkerding, Karl V.
    ARCHIVES OF PATHOLOGY & LABORATORY MEDICINE, 2017, 141 (06) : 798 - 805
  • [44] Improving population scale statistical phasing with whole-genome sequencing data
    Wertenbroek, Rick
    Hofmeister, Robin J.
    Xenarios, Ioannis
    Thoma, Yann
    Delaneau, Olivier
    PLOS GENETICS, 2024, 20 (07):
  • [45] Whole-Exome Sequencing and Whole-Genome Sequencing in Critically Ill Neonates Suspected to Have Single-Gene Disorders
    Smith, Laurie D.
    Willig, Laurel K.
    Kingsmore, Stephen F.
    COLD SPRING HARBOR PERSPECTIVES IN MEDICINE, 2016, 6 (02):
  • [46] Opportunities and challenges of whole-genome and -exome sequencing
    Petersen, Britt-Sabina
    Fredrich, Broder
    Hoeppner, Marc P.
    Ellinghaus, David
    Franke, Andre
    BMC GENETICS, 2017, 18
  • [47] Opportunities and challenges of whole-genome and -exome sequencing
    Britt-Sabina Petersen
    Broder Fredrich
    Marc P. Hoeppner
    David Ellinghaus
    Andre Franke
    BMC Genetics, 18
  • [48] Whole-exome imputation within UK Biobank powers rare coding variant association and fine-mapping analyses
    Alison R. Barton
    Maxwell A. Sherman
    Ronen E. Mukamel
    Po-Ru Loh
    Nature Genetics, 2021, 53 : 1260 - 1269
  • [49] Whole-exome imputation within UK Biobank powers rare coding variant association and fine-mapping analyses
    Barton, Alison R.
    Sherman, Maxwell A.
    Mukamel, Ronen E.
    Loh, Po-Ru
    NATURE GENETICS, 2021, 53 (08) : 1260 - +
  • [50] KIR Genotyping Using Whole Exome Sequencing Data in TCGA and UK Biobank
    Gao, G. F.
    Li, B.
    JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2020, 68 : S309 - S309