Interpretable generative deep learning: an illustration with single cell gene expression data

被引：8

作者：

Treppner, Martin ^{[1
,2
]}

Binder, Harald ^{[3
]}

Hess, Moritz ^{[3
]}

机构：

[1] Univ Freiburg, Fac Med, Inst Med Biometry & Stat, Stefan Meier Str 26, D-79104 Freiburg, Germany

[2] Univ Freiburg, Med Ctr, Stefan Meier Str 26, D-79104 Freiburg, Germany

[3] Univ Freiburg, Freiburg Ctr Data Anal & Modeling, D-79104 Freiburg, Germany

来源：

HUMAN GENETICS | 2022年 / 141卷 / 09期

关键词：

Explainable AI; Deep learning; Generative model; Dimension reduction; NEURAL-NETWORKS;

D O I：

10.1007/s00439-021-02417-6

中图分类号：

Q3 [遗传学];

学科分类号：

071007 ; 090102 ;

摘要：

Deep generative models can learn the underlying structure, such as pathways or gene programs, from omics data. We provide an introduction as well as an overview of such techniques, specifically illustrating their use with single-cell gene expression data. For example, the low dimensional latent representations offered by various approaches, such as variational auto-encoders, are useful to get a better understanding of the relations between observed gene expressions and experimental factors or phenotypes. Furthermore, by providing a generative model for the latent and observed variables, deep generative models can generate synthetic observations, which allow us to assess the uncertainty in the learned representations. While deep generative models are useful to learn the structure of high-dimensional omics data by efficiently capturing non-linear dependencies between genes, they are sometimes difficult to interpret due to their neural network building blocks. More precisely, to understand the relationship between learned latent variables and observed variables, e.g., gene transcript abundances and external phenotypes, is difficult. Therefore, we also illustrate current approaches that allow us to infer the relationship between learned latent variables and observed variables as well as external phenotypes. Thereby, we render deep learning approaches more interpretable. In an application with single-cell gene expression data, we demonstrate the utility of the discussed methods.

引用

页码：1481 / 1498

页数：18

共 89 条

[21] Generative Adversarial Networks
Goodfellow, Ian
Pouget-Abadie, Jean
Mirza, Mehdi
Xu, Bing
Warde-Farley, David
Ozair, Sherjil
Courville, Aaron
Bengio, Yoshua
[J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
[22] Single-cell transcriptomic analysis of mIHC images via antigen mapping
Govek, Kiya W.
Troisi, Emma C.
Miao, Zhen
Aubin, Rachael G.
Woodhouse, Steven
Camara, Pablo G.
[J]. SCIENCE ADVANCES, 2021, 7 (10)
[23] Grün D, 2014, NAT METHODS, V11, P637, DOI [10.1038/nmeth.2930, 10.1038/NMETH.2930]
[24] Gupta A, 2021, EUR PHYS J C
[25] Gut G., pmVAE: Learning Interpretable Single-Cell Representations with Pathway Modules, V2021, DOI DOI 10.1101/2021.01.28.428664
[26] Exploring generative deep learning for omics data using log-linear models
Hess, Moritz
Hackenberg, Maren
Binder, Harald
[J]. BIOINFORMATICS, 2020, 36 (20) : 5045 - 5053
[27] Missing data and technical variability in single-cell RNA-sequencing experiments
Hicks, Stephanie C.
Townes, F. William
Teng, Mingxiang
Irizarry, Rafael A.
[J]. BIOSTATISTICS, 2018, 19 (04) : 562 - 578
[28] Higgins I., 2017, Conference Track Proceedings, V3
[29] Hilbe J.M., 2011, NEGATIVE BINOMIAL RE
[30] Reducing the dimensionality of data with neural networks
Hinton, G. E.
Salakhutdinov, R. R.
[J]. SCIENCE, 2006, 313 (5786) : 504 - 507

← 1 2 3 4 5 6 7 8 9 →