Genome-wide Analysis of Large-scale Longitudinal Outcomes using Penalization —GALLOP algorithm

被引:0
|
作者
Karolina Sikorska
Emmanuel Lesaffre
Patrick J. F. Groenen
Fernando Rivadeneira
Paul H. C. Eilers
机构
[1] Netherlands Cancer Institute,Department of Biometrics
[2] Leuven University,Leuven Biostatistics and Statistical Bioinformatics Centre
[3] Erasmus University,Erasmus School of Economics
[4] Erasmus Medical Centre,Department of Internal Medicine
[5] Erasmus Medical Centre,Department of Biostatistics
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Genome-wide association studies (GWAS) with longitudinal phenotypes provide opportunities to identify genetic variations associated with changes in human traits over time. Mixed models are used to correct for the correlated nature of longitudinal data. GWA studies are notorious for their computational challenges, which are considerable when mixed models for thousands of individuals are fitted to millions of SNPs. We present a new algorithm that speeds up a genome-wide analysis of longitudinal data by several orders of magnitude. It solves the equivalent penalized least squares problem efficiently, computing variances in an initial step. Factorizations and transformations are used to avoid inversion of large matrices. Because the system of equations is bordered, we can re-use components, which can be precomputed for the mixed model without a SNP. Two SNP effects (main and its interaction with time) are obtained. Our method completes the analysis a thousand times faster than the R package lme4, providing an almost identical solution for the coefficients and p-values. We provide an R implementation of our algorithm.
引用
收藏
相关论文
共 50 条
  • [1] Genome-wide Analysis of Large-scale Longitudinal Outcomes using Penalization - GALLOP algorithm
    Sikorska, Karolina
    Lesaffre, Emmanuel
    Groenen, Patrick J. F.
    Rivadeneira, Fernando
    Eilers, Paul H. C.
    SCIENTIFIC REPORTS, 2018, 8
  • [2] Analysis of Genome-Wide Association Studies with Multiple Outcomes Using Penalization
    Liu, Jin
    Huang, Jian
    Ma, Shuangge
    PLOS ONE, 2012, 7 (12):
  • [3] Fast Principal Component Analysis of Large-Scale Genome-Wide Data
    Abraham, Gad
    Inouye, Michael
    PLOS ONE, 2014, 9 (04):
  • [4] Secure large-scale genome-wide association studies using homomorphic encryption
    Blatt, Marcelo
    Gusev, Alexander
    Polyakov, Yuriy
    Goldwasser, Shafi
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (21) : 11608 - 11613
  • [5] Bayesian hierarchical hypothesis testing in large-scale genome-wide association analysis
    Samaddar, Anirban
    Maiti, Tapabrata
    de los Campos, Gustavo
    GENETICS, 2024, 228 (04)
  • [6] Analysis of genome-wide association data by large-scale Bayesian logistic regression
    Yuanjia Wang
    Nanshi Sha
    Yixin Fang
    BMC Proceedings, 3 (Suppl 7)
  • [7] Bifidobacteriaceae diversity in the human microbiome from a large-scale genome-wide analysis
    Pasolli, Edoardo
    Mauriello, Italia Elisa
    Avagliano, Michele
    Cavaliere, Sara
    De Filippis, Francesca
    Ercolini, Danilo
    CELL REPORTS, 2024, 43 (12):
  • [8] Genome-wide association studies and large-scale collaborations in epidemiology
    Psaty, Bruce M.
    Hofman, Albert
    EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2010, 25 (08) : 525 - 529
  • [9] Genome-wide association studies and large-scale collaborations in epidemiology
    Bruce M. Psaty
    Albert Hofman
    European Journal of Epidemiology, 2010, 25 : 525 - 529
  • [10] SPAGRM: effectively controlling for sample relatedness in large-scale genome-wide association studies of longitudinal traits
    Xu, He
    Ma, Yuzhuo
    Xu, Lin-lin
    Li, Yin
    Liu, Yufei
    Li, Ying
    Zhou, Xu-jie
    Zhou, Wei
    Lee, Seunggeun
    Zhang, Peipei
    Yue, Weihua
    Bi, Wenjian
    NATURE COMMUNICATIONS, 2025, 16 (01)