Federated Principal Component Analysis for Genome-Wide Association Studies

被引:9
|
作者
Hartebrodt, Anne [1 ]
Nasirigerdeh, Reza [2 ]
Blumenthal, David B. [3 ]
Rottger, Richard [1 ]
机构
[1] Univ Southern Denmark, Odense, Denmark
[2] Tech Univ Munich, Munich, Germany
[3] Friedrich Alexander Univ Erlangen Nurnberg, Erlangen, Germany
关键词
ALGORITHMS;
D O I
10.1109/ICDM51629.2021.00127
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) has emerged as a privacy-aware alternative to centralized data analysis, especially for biomedical analyses such as genome-wide association studies (GWAS). The data remains with the owner, which enables studies previously impossible due to privacy protection regulations. Principal component analysis (PCA) is a frequent preprocessing step in GWAS, where the eigenvectors of the sample-by-sample covariance matrix are used as covariates in the statistical tests. Therefore, a federated version of PCA suitable for vertical data partitioning is required for federated GWAS. Existing federated PCA algorithms exchange the complete sample eigenvectors, a potential privacy breach. In this paper, we present a federated PCA algorithm for vertically partitioned data which does not exchange the sample eigenvectors and is hence suitable for federated GWAS. We show that it outperforms existing federated solutions in terms of convergence behavior and scalability. Additionally, we provide a user-friendly privacy-aware web tool to promote acceptance of federated PCA among GWAS researchers.
引用
收藏
页码:1090 / 1095
页数:6
相关论文
共 50 条
  • [41] Guidelines for Genome-Wide Association Studies
    Barsh, Gregory S.
    Copenhaver, Gregory P.
    Gibson, Greg
    Williams, Scott M.
    PLOS GENETICS, 2012, 8 (07):
  • [42] Genome-wide association studies in neurology
    Tan, Meng-Shan
    Jiang, Teng
    Tan, Lan
    Yu, Jin-Tai
    ANNALS OF TRANSLATIONAL MEDICINE, 2014, 2 (12)
  • [43] Genome-Wide Association Studies and Diet
    Ferguson, Lynnette R.
    JOURNAL OF NUTRIGENETICS AND NUTRIGENOMICS, 2009, 2 (4-5) : 191 - 191
  • [44] The decline of genome-wide association studies
    Jordan, Bertrand
    M S-MEDECINE SCIENCES, 2009, 25 (05): : 537 - 539
  • [45] Genome-wide association studies in pharmacogenomics
    Ann K. Daly
    Nature Reviews Genetics, 2010, 11 : 241 - 246
  • [46] Genome-wide association studies with metabolomics
    Adamski, Jerzy
    GENOME MEDICINE, 2012, 4
  • [47] Autism and Genome-Wide Association Studies
    Celec, Peter
    Ostatnikova, Daniela
    ACTIVITAS NERVOSA SUPERIOR REDIVIVA, 2010, 52 (01):
  • [48] Genome-wide association studies: a primer
    Corvin, A.
    Craddock, N.
    Sullivan, P. F.
    PSYCHOLOGICAL MEDICINE, 2010, 40 (07) : 1063 - 1077
  • [49] Genome-Wide Association Studies and Diet
    Ferguson, Lynnette R.
    JOURNAL OF NUTRIGENETICS AND NUTRIGENOMICS, 2010, 3 (4-6) : 144 - 150
  • [50] Replication in Genome-Wide Association Studies
    Kraft, Peter
    Zeggini, Eleftheria
    Ioannidis, John P. A.
    STATISTICAL SCIENCE, 2009, 24 (04) : 561 - 573