Low-Dimensional Genotype Embeddings for Predictive Models

被引:0
|
作者
Sultan, Syed Fahad [1 ]
Guo, Xingzhi [2 ]
Skiena, Steven [2 ]
机构
[1] Furman Univ, Greenville, SC 29613 USA
[2] SUNY Stony Brook, Stony Brook, NY USA
来源
13TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, BCB 2022 | 2022年
关键词
genotype; embeddings; privacy-preserving;
D O I
10.1145/3535508.3545507
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop methods for constructing low-dimensional vector representations (embeddings) of large-scale genotyping data, capable of reducing genotypes of hundreds of thousands of SNPs to 100-dimensional embeddings that retain substantial predictive power for inferring medical phenotypes. We demonstrate that embedding-based models yield an average F-score of 0.605 on a test of ten phenoypes (including BMI prediction, genetic relatedness, and depression) versus 0.339 for baseline models. Genotype embeddings also hold promise for creating sharing data while preserving subject anonymity: we show that they retain substantial predictive power even after anonymization by adding Gaussian noise to each dimension.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Equilibration in low-dimensional quantum matrix models
    R. Hübener
    Y. Sekino
    J. Eisert
    Journal of High Energy Physics, 2015
  • [42] Efficient Low-Dimensional Compression of Overparameterized Models
    Kwon, Soo Min
    Zhang, Zekai
    Song, Dogyoon
    Balzano, Laura
    Qu, Qing
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [43] Low-dimensional models of coherent structures in turbulence
    Holmes, PJ
    Lumley, JL
    Berkooz, G
    Mattingly, JC
    Wittenberg, RW
    PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 1997, 287 (04): : 337 - 384
  • [44] On low-dimensional Galerkin models for fluid flow
    Rempfer, D
    THEORETICAL AND COMPUTATIONAL FLUID DYNAMICS, 2000, 14 (02) : 75 - 88
  • [45] EMG-Based Control of a Robot Arm Using Low-Dimensional Embeddings
    Artemiadis, Panagiotis K.
    Kyriakopoulos, Kostas J.
    IEEE TRANSACTIONS ON ROBOTICS, 2010, 26 (02) : 393 - 398
  • [46] Solution of low-dimensional constrained model predictive control problems
    Gupta, YP
    ISA TRANSACTIONS, 2004, 43 (04) : 499 - 508
  • [47] Low-dimensional models of thin film fluid dynamics
    Physics Letters. Section A: General, Atomic and Solid State Physics, 1996, 212 (1-2):
  • [48] Direct estimation of low-dimensional components in additive models
    Fan, JQ
    Härdle, W
    Mammen, E
    ANNALS OF STATISTICS, 1998, 26 (03): : 943 - 971
  • [49] DROPLET THEORY OF LOW-DIMENSIONAL ISING-MODELS
    BRUCE, AD
    WALLACE, DJ
    PHYSICAL REVIEW LETTERS, 1981, 47 (24) : 1743 - 1746
  • [50] Phase Transitions in Low-Dimensional Disordered Potts Models
    Babaev, A. B.
    Murtazaev, A. K.
    PHYSICS OF THE SOLID STATE, 2020, 62 (05) : 851 - 855