Interpretable generative deep learning: an illustration with single cell gene expression data

被引:8
作者
Treppner, Martin [1 ,2 ]
Binder, Harald [3 ]
Hess, Moritz [3 ]
机构
[1] Univ Freiburg, Fac Med, Inst Med Biometry & Stat, Stefan Meier Str 26, D-79104 Freiburg, Germany
[2] Univ Freiburg, Med Ctr, Stefan Meier Str 26, D-79104 Freiburg, Germany
[3] Univ Freiburg, Freiburg Ctr Data Anal & Modeling, D-79104 Freiburg, Germany
关键词
Explainable AI; Deep learning; Generative model; Dimension reduction; NEURAL-NETWORKS;
D O I
10.1007/s00439-021-02417-6
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Deep generative models can learn the underlying structure, such as pathways or gene programs, from omics data. We provide an introduction as well as an overview of such techniques, specifically illustrating their use with single-cell gene expression data. For example, the low dimensional latent representations offered by various approaches, such as variational auto-encoders, are useful to get a better understanding of the relations between observed gene expressions and experimental factors or phenotypes. Furthermore, by providing a generative model for the latent and observed variables, deep generative models can generate synthetic observations, which allow us to assess the uncertainty in the learned representations. While deep generative models are useful to learn the structure of high-dimensional omics data by efficiently capturing non-linear dependencies between genes, they are sometimes difficult to interpret due to their neural network building blocks. More precisely, to understand the relationship between learned latent variables and observed variables, e.g., gene transcript abundances and external phenotypes, is difficult. Therefore, we also illustrate current approaches that allow us to infer the relationship between learned latent variables and observed variables as well as external phenotypes. Thereby, we render deep learning approaches more interpretable. In an application with single-cell gene expression data, we demonstrate the utility of the discussed methods.
引用
收藏
页码:1481 / 1498
页数:18
相关论文
共 89 条
  • [21] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
  • [22] Single-cell transcriptomic analysis of mIHC images via antigen mapping
    Govek, Kiya W.
    Troisi, Emma C.
    Miao, Zhen
    Aubin, Rachael G.
    Woodhouse, Steven
    Camara, Pablo G.
    [J]. SCIENCE ADVANCES, 2021, 7 (10)
  • [23] Grün D, 2014, NAT METHODS, V11, P637, DOI [10.1038/nmeth.2930, 10.1038/NMETH.2930]
  • [24] Gupta A, 2021, EUR PHYS J C
  • [25] Gut G., pmVAE: Learning Interpretable Single-Cell Representations with Pathway Modules, V2021, DOI DOI 10.1101/2021.01.28.428664
  • [26] Exploring generative deep learning for omics data using log-linear models
    Hess, Moritz
    Hackenberg, Maren
    Binder, Harald
    [J]. BIOINFORMATICS, 2020, 36 (20) : 5045 - 5053
  • [27] Missing data and technical variability in single-cell RNA-sequencing experiments
    Hicks, Stephanie C.
    Townes, F. William
    Teng, Mingxiang
    Irizarry, Rafael A.
    [J]. BIOSTATISTICS, 2018, 19 (04) : 562 - 578
  • [28] Higgins I., 2017, Conference Track Proceedings, V3
  • [29] Hilbe J.M., 2011, NEGATIVE BINOMIAL RE
  • [30] Reducing the dimensionality of data with neural networks
    Hinton, G. E.
    Salakhutdinov, R. R.
    [J]. SCIENCE, 2006, 313 (5786) : 504 - 507