Genomic data imputation with variational auto-encoders

被引:44
|
作者
Qiu, Yeping Lina [1 ,2 ]
Zheng, Hong [1 ]
Gevaert, Olivier [1 ,3 ]
机构
[1] Stanford Univ, Stanford Ctr Biomed Informat Res, Dept Med, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Biomed Data Sci, Stanford, CA 94305 USA
来源
GIGASCIENCE | 2020年 / 9卷 / 08期
基金
美国国家卫生研究院;
关键词
imputation; variational auto-encoder; deep learning; MISSING VALUE IMPUTATION; AUTOENCODERS; NETWORK;
D O I
10.1093/gigascience/giaa082
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: As missing values are frequently present in genomic data, practical methods to handle missing data are necessary for downstream analyses that require complete data sets. State-of-the-art imputation techniques, including methods based on singular value decomposition and K-nearest neighbors, can be computationally expensive for large data sets and it is difficult to modify these algorithms to handle certain cases not missing at random. Results: In this work, we use a deep-learning framework based on the variational auto-encoder (VAE) for genomic missing value imputation and demonstrate its effectiveness in transcriptome and methylome data analysis. We show that in the vast majority of our testing scenarios, VAE achieves similar or better performances than the most widely used imputation standards, while having a computational advantage at evaluation time. When dealing with data missing not at random (e.g., few values are missing), we develop simple yet effective methodologies to leverage the prior knowledge about missing data. Furthermore, we investigate the effect of varying latent space regularization strength in VAE on the imputation performances and, in this context, show why VAE has a better imputation capacity compared to a regular deterministic auto-encoder. Conclusions: We describe a deep learning imputation framework for transcriptome and methylome data using a VAE and show that it can be a preferable alternative to traditional methods for data imputation, especially in the setting of large-scale data and certain missing-not-at-random scenarios.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Dynamic Feature Collaborative Variational Auto-Encoders for Academic Paper Recommendation
    Niu, Yuanhao
    Jiang, Ting
    Chen, Zhiheng
    Bai, Weichen
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 1620 - 1627
  • [42] Attribute-based regularization of latent spaces for variational auto-encoders
    Pati, Ashis
    Lerch, Alexander
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (09): : 4429 - 4444
  • [43] Interpretable ECG Beat Embedding using Disentangled Variational Auto-Encoders
    Van Steenkiste, Tom
    Deschrijver, Dirk
    Dhaene, Tom
    2019 IEEE 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2019, : 373 - 378
  • [44] FMCW Radar Sensing for Indoor Drones Using Variational Auto-Encoders
    Safa, Ali
    Verbelen, Tim
    Catal, Ozan
    Van de Maele, Toon
    Hartmann, Matthias
    Dhoedt, Bart
    Bourdoux, Andre
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [45] Disentangling Factors of Variation with Cycle-Consistent Variational Auto-encoders
    Jha, Ananya Harsh
    Anand, Saket
    Singh, Maneesh
    Veeravasarapu, V. S. R.
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 829 - 845
  • [46] Variational graph auto-encoders for miRNA-disease association prediction
    Ding, Yulian
    Tian, Li-Ping
    Lei, Xiujuan
    Liao, Bo
    Wu, Fang-Xiang
    METHODS, 2021, 192 : 25 - 34
  • [47] Attribute-based regularization of latent spaces for variational auto-encoders
    Pati, Ashis
    Lerch, Alexander
    Neural Computing and Applications, 2021, 33 (09) : 4429 - 4444
  • [48] Attribute-based regularization of latent spaces for variational auto-encoders
    Ashis Pati
    Alexander Lerch
    Neural Computing and Applications, 2021, 33 : 4429 - 4444
  • [49] Directed Graph Auto-Encoders
    Kollias, Georgios
    Kalantzis, Vasileios
    Ide, Tsuyoshi
    Lozano, Aurelie
    Abe, Naoki
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7211 - 7219
  • [50] Graph Attention Auto-Encoders
    Salehi, Amin
    Davulcu, Hasan
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 989 - 996