Scalable and Robust Regression Methods for Phenome-Wide Association Analysis on Large-Scale Biobank Data

被引:5
|
作者
Bi, Wenjian [1 ,2 ,3 ]
Lee, Seunggeun [4 ]
机构
[1] Peking Univ, Sch Basic Med Sci, Dept Med Genet, Beijing, Peoples R China
[2] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Ctr Stat Genet, Ann Arbor, MI 48109 USA
[4] Seoul Natl Univ, Grad Sch Data Sci, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
phenome-wide association studies; electronic health records-EHR; saddlepoint approximation; biobank data analysis; unbalanced phenotypic distribution; genetic relatedness; mixed model approaches; GENE-ENVIRONMENT INTERACTION; MIXED-MODEL ANALYSIS; RARE VARIANTS; POPULATION-STRUCTURE; COMMON DISEASES; SEQUENCING DATA; TESTS; METAANALYSIS; SURVIVAL; POWER;
D O I
10.3389/fgene.2021.682638
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the advances in genotyping technologies and electronic health records (EHRs), large biobanks have been great resources to identify novel genetic associations and gene-environment interactions on a genome-wide and even a phenome-wide scale. To date, several phenome-wide association studies (PheWAS) have been performed on biobank data, which provides comprehensive insights into many aspects of human genetics and biology. Although inspiring, PheWAS on large-scale biobank data encounters new challenges including computational burden, unbalanced phenotypic distribution, and genetic relationship. In this paper, we first discuss these new challenges and their potential impact on data analysis. Then, we summarize approaches that are scalable and robust in GWAS and PheWAS. This review can serve as a practical guide for geneticists, epidemiologists, and other medical researchers to identify genetic variations associated with health-related phenotypes in large-scale biobank data analysis. Meanwhile, it can also help statisticians to gain a comprehensive and up-to-date understanding of the current technical tool development.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Phenome-wide heritability analysis of the UK Biobank
    Ge, Tian
    Chen, Chia-Yen
    Neale, Benjamin M.
    Sabuncu, Mert R.
    Smoller, Jordan W.
    PLOS GENETICS, 2017, 13 (04):
  • [2] PYPE: A pipeline for phenome-wide association and Mendelian randomization in investigator-driven biobank scale analysis
    Dalal, Taykhoom
    Patel, Chirag J.
    PATTERNS, 2024, 5 (06):
  • [3] Phenome-wide association studies go large
    Louisa Flintoft
    Nature Reviews Genetics, 2014, 15 : 2 - 2
  • [4] Large-scale phenome-wide scan in twins using electronic health records
    Hebbring, Scott J.
    Ye, Zhan
    Pathak, Jyotishman
    Mayer, John
    Cheng, Yijing
    Schrodi, Steven J.
    GENETIC EPIDEMIOLOGY, 2015, 39 (07) : 555 - 556
  • [5] Enhanced rare disease mapping for phenome-wide genetic association in the UK Biobank
    Matthew T. Patrick
    Redina Bardhi
    Wei Zhou
    James T. Elder
    Johann E. Gudjonsson
    Lam C. Tsoi
    Genome Medicine, 14
  • [6] Enhanced rare disease mapping for phenome-wide genetic association in the UK Biobank
    Patrick, Matthew T.
    Bardhi, Redina
    Zhou, Wei
    Elder, James T.
    Gudjonsson, Johann E.
    Tsoi, Lam C.
    GENOME MEDICINE, 2022, 14 (01)
  • [7] DISEASE GENETICS Phenome-wide association studies go large
    Flintoft, Louisa
    NATURE REVIEWS GENETICS, 2014, 15 (01) : 2 - 2
  • [8] Large-Scale Phenome-Wide Association Study of PCSK9 Variants Demonstrates Protection Against Ischemic Stroke
    Rao, Abhiram S.
    Lindholm, Daniel
    Rivas, Manuel A.
    Knowles, Joshua W.
    Montgomery, Stephen B.
    Ingelsson, Erik
    CIRCULATION-GENOMIC AND PRECISION MEDICINE, 2018, 11 (07): : e002162
  • [9] Comorbidities in lichen planus by phenome-wide association study in two biobank population cohorts
    Fromme, Malin
    Schneider, Carolin, V
    Schlapbach, Christoph
    Cazzaniga, Simone
    Trautwein, Christian
    Rader, Dan J.
    Borradori, Luca
    Strnad, Pavel
    BRITISH JOURNAL OF DERMATOLOGY, 2022, 187 (05) : 722 - 729
  • [10] Analysis of genome-wide association data by large-scale Bayesian logistic regression
    Yuanjia Wang
    Nanshi Sha
    Yixin Fang
    BMC Proceedings, 3 (Suppl 7)