VASC: Dimension Reduction and Visualization of Single-cell RNA-seq Data by Deep Variational Autoencoder

被引:2
作者
Dongfang Wang [1 ]
Jin Gu [1 ]
机构
[1] MOE Key Laboratory of Bioinformatics, BNRIST Bioinformatics Division & Center for Synthetic and Systems Biology,Department of Automation, Tsinghua University
基金
中国国家自然科学基金;
关键词
Single cell RNA sequencing; Deep variational autoencoder; Dimension reduction; Visualization; Dropout;
D O I
暂无
中图分类号
Q811.4 [生物信息论];
学科分类号
0711 ; 0831 ;
摘要
Single-cell RNA sequencing(scRNA-seq) is a powerful technique to analyze the transcriptomic heterogeneities at the single cell level. It is an important step for studying cell subpopulations and lineages, with an effective low-dimensional representation and visualization of the original scRNA-Seq data. At the single cell level, the transcriptional fluctuations are much larger than the average of a cell population, and the low amount of RNA transcripts will increase the rate of technical dropout events. Therefore, scRNA-seq data are much noisier than traditional bulk RNA-seq data. In this study, we proposed the deep variational autoencoder for scRNA-seq data(VASC), a deep multi-layer generative model, for the unsupervised dimension reduction and visualization of scRNA-seq data. VASC can explicitly model the dropout events and find the nonlinear hierarchical feature representations of the original data. Tested on over 20 datasets, VASC shows superior performances in most cases and exhibits broader dataset compatibility compared to four state-of-the-art dimension reduction and visualization methods. In addition, VASC provides better representations for very rare cell populations in the 2D visualization. As a case study, VASC successfully re-establishes the cell dynamics in pre-implantation embryos and identifies several candidate marker genes associated with early embryo development. Moreover, VASC also performs well on a 10× Genomics dataset with more cells and higher dropout rate.
引用
收藏
页码:320 / 331
页数:12
相关论文
共 7 条
[1]  
Heterogeneity in Oct4 and Sox2 Targets Biases Cell Fate in Four-Cell Mouse Embryos[J] . Mubeen Goolam,Antonio Scialdone,Sarah J.L. Graham,Iain C. Macaulay,Agnieszka Jedrusik,Anna Hupalowska,Thierry Voet,John C. Marioni,Magdalena Zernicka-Goetz.Cell . 2016
[2]   Droplet Barcoding for Single-Cell Transcriptomics Applied to Embryonic Stem Cells [J].
Klein, Allon M. ;
Mazutis, Linas ;
Akartuna, Ilke ;
Tallapragada, Naren ;
Veres, Adrian ;
Li, Victor ;
Peshkin, Leonid ;
Weitz, David A. ;
Kirschner, Marc W. .
CELL, 2015, 161 (05) :1187-1201
[3]   Single Cell RNA-Sequencing of Pluripotent States Unlocks Modular Transcriptional Variation [J].
Kolodziejczyk, Aleksandra A. ;
Kim, Jong Kyoung ;
Tsang, Jason C. H. ;
Ilicic, Tomislav ;
Henriksson, Johan ;
Natarajan, Kedar N. ;
Tuck, Alex C. ;
Gao, Xuefei ;
Buehler, Marc ;
Liu, Pentao ;
Marioni, John C. ;
Teichmann, Sarah A. .
CELL STEM CELL, 2015, 17 (04) :471-485
[4]  
Comparing partitions[J] . Lawrence Hubert,Phipps Arabie.Journal of Classification . 1985 (1)
[5]  
Multilineage communication regulates human liver bud development .2 Camp JG,Sekine K,Gerber T,Loeffler-Wirth H. Nature . 2017
[6]  
Extracting and composing robust features with denoising autoencoders .2 P.Vincent,H.Larochelle,Y.Bengio,P.A.Manzagol. Proc.25thInt.Conf.Machine Learning . 2008
[7]  
A Single-cell transcriptomic map of the human and mouse pancreas reveals inter-and intra-cell population structure .2 Baron M,Veres A,Wolock SL,et al. Cell Syst . 2016