Discovering non-additive heritability using additive GWAS summary statistics

被引:0
作者
Pattillo Smith, Samuel [1 ,2 ,3 ,4 ]
Darnell, Gregory [1 ,5 ]
Udwin, Dana [6 ]
Stamp, Julian [1 ]
Harpak, Arbel [3 ,4 ]
Ramachandran, Sohini [1 ,2 ,7 ]
Crawford, Lorin [1 ,6 ,8 ]
机构
[1] Brown Univ, Ctr Computat Mol Biol, Providence, RI 02912 USA
[2] Brown Univ, Dept Ecol & Evolutionary Biol, Providence, RI USA
[3] Univ Texas Austin, Dept Integrat Biol, Austin, TX USA
[4] Univ Texas Austin, Dept Populat Hlth, Austin, TX USA
[5] Brown Univ, Inst Computat & Expt Res Math, Providence, RI USA
[6] Brown Univ, Dept Biostat, Providence, RI 02912 USA
[7] Brown Univ, Data Sci Inst, Providence, RI USA
[8] Microsoft, Cambridge, MA 02142 USA
来源
ELIFE | 2024年 / 13卷
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
interactions; summary statistics; non-additive effects; heritability; BAYESIAN VARIABLE SELECTION; HUMAN COMPLEX TRAITS; MISSING HERITABILITY; SNP-HERITABILITY; REGRESSION; ASSOCIATION; DOMINANCE; EPISTASIS; ARCHITECTURE; VARIANTS;
D O I
10.7554/eLife.90459; 10.7554/eLife.90459.sa1; 10.7554/eLife.90459.sa2
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
LD score regression (LDSC) is a method to estimate narrow-sense heritability from genome-wide association study (GWAS) summary statistics alone, making it a fast and popular approach. In this work, we present interaction-LD score (i-LDSC) regression: an extension of the original LDSC framework that accounts for interactions between genetic variants. By studying a wide range of generative models in simulations, and by re-analyzing 25 well-studied quantitative phenotypes from 349,468 individuals in the UK Biobank and up to 159,095 individuals in BioBank Japan, we show that the inclusion of a cis-interaction score (i.e. interactions between a focal variant and proximal variants) recovers genetic variance that is not captured by LDSC. For each of the 25 traits analyzed in the UK Biobank and BioBank Japan, i-LDSC detects additional variation contributed by genetic interactions. The i-LDSC software and its application to these biobanks represent a step towards resolving further genetic contributions of sources of non-additive genetic effects to complex trait variation.
引用
收藏
页数:32
相关论文
共 70 条
  • [1] FlashPCA2: principal component analysis of Biobank-scale genotype datasets
    Abraham, Gad
    Qiu, Yixuan
    Inouye, Michael
    [J]. BIOINFORMATICS, 2017, 33 (17) : 2776 - 2778
  • [2] Barreto H., 2005, Introductory Econometrics: Using Monte Carlo Simulation with Microsoft Excel, DOI DOI 10.1017/CBO9780511809231
  • [3] Bulik-Sullivan B, 2020, LDSC (LD score). v1.0.1
  • [4] An atlas of genetic correlations across human diseases and traits
    Bulik-Sullivan, Brendan
    Finucane, Hilary K.
    Anttila, Verneri
    Gusev, Alexander
    Day, Felix R.
    Loh, Po-Ru
    Duncan, Laramie
    Perry, John R. B.
    Patterson, Nick
    Robinson, Elise B.
    Daly, Mark J.
    Price, Alkes L.
    Neale, Benjamin M.
    [J]. NATURE GENETICS, 2015, 47 (11) : 1236 - +
  • [5] LD Score regression distinguishes confounding from polygenicity in genome-wide association studies
    Bulik-Sullivan, Brendan K.
    Loh, Po-Ru
    Finucane, Hilary K.
    Ripke, Stephan
    Yang, Jian
    Patterson, Nick
    Daly, Mark J.
    Price, Alkes L.
    Neale, Benjamin M.
    [J]. NATURE GENETICS, 2015, 47 (03) : 291 - +
  • [6] The UK Biobank resource with deep phenotyping and genomic data
    Bycroft, Clare
    Freeman, Colin
    Petkova, Desislava
    Band, Gavin
    Elliott, Lloyd T.
    Sharp, Kevin
    Motyer, Allan
    Vukcevic, Damjan
    Delaneau, Olivier
    O'Connell, Jared
    Cortes, Adrian
    Welsh, Samantha
    Young, Alan
    Effingham, Mark
    McVean, Gil
    Leslie, Stephen
    Allen, Naomi
    Donnelly, Peter
    Marchini, Jonathan
    [J]. NATURE, 2018, 562 (7726) : 203 - +
  • [7] Scalable Variational Inference for Bayesian Variable Selection in Regression, and Its Accuracy in Genetic Association Studies
    Carbonetto, Peter
    Stephens, Matthew
    [J]. BAYESIAN ANALYSIS, 2012, 7 (01): : 73 - 107
  • [8] Second-generation PLINK: rising to the challenge of larger and richer datasets
    Chang, Christopher C.
    Chow, Carson C.
    Tellier, Laurent C. A. M.
    Vattikuti, Shashaank
    Purcell, Shaun M.
    Lee, James J.
    [J]. GIGASCIENCE, 2015, 4
  • [9] Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits
    Cheng, Wei
    Ramachandran, Sohini
    Crawford, Lorin
    [J]. PLOS GENETICS, 2020, 16 (06):
  • [10] Crawford L, 2024, Software Heritage