DrImpute: imputing dropout events in single cell RNA sequencing data

被引:199
作者
Gong, Wuming [1 ]
Kwak, Il-Youp [1 ]
Pota, Pruthvi [1 ]
Koyano-Nakagawa, Naoko [1 ]
Garry, Daniel J. [1 ]
机构
[1] Univ Minnesota, Lillehei Heart Inst, 2231 6th St SE,4-165 CCRB, Minneapolis, MN 55114 USA
来源
BMC BIOINFORMATICS | 2018年 / 19卷
基金
美国国家卫生研究院;
关键词
Single cell RNA sequencing data; Dropout events; Imputation; Next generation sequencing; MISSING VALUE ESTIMATION; GENE-EXPRESSION; FATE DECISIONS; IMPUTATION; HETEROGENEITY; DESIGN;
D O I
10.1186/s12859-018-2226-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The single cell RNA sequencing (scRNA-seq) technique begin a new era by allowing the observation of gene expression at the single cell level. However, there is also a large amount of technical and biological noise. Because of the low number of RNA transcriptomes and the stochastic nature of the gene expression pattern, there is a high chance of missing nonzero entries as zero, which are called dropout events. Results: We develop DrImpute to impute dropout events in scRNA-seq data. We show that DrImpute has significantly better performance on the separation of the dropout zeros from true zeros than existing imputation algorithms. We also demonstrate that DrImpute can significantly improve the performance of existing tools for clustering, visualization and lineage reconstruction of nine published scRNA-seq datasets. Conclusions: DrImpute can serve as a very useful addition to the currently existing statistical tools for single cell RNA-seq analysis. .
引用
收藏
页数:10
相关论文
共 50 条
[21]   CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data [J].
Lin, Peijie ;
Troup, Michael ;
Ho, Joshua W. K. .
GENOME BIOLOGY, 2017, 18
[22]  
LOVE MI, 2014, GENOME BIOL, V15, DOI DOI 10.1186/S13059-014-0550-8
[23]   From single-cell to cell-pool transcriptomes: Stochasticity in gene expression and RNA splicing [J].
Marinov, Georgi K. ;
Williams, Brian A. ;
McCue, Ken ;
Schroth, Gary P. ;
Gertz, Jason ;
Myers, Richard M. ;
Wold, Barbara J. .
GENOME RESEARCH, 2014, 24 (03) :496-510
[24]  
Nainys J, MAGIC DIFFUSION BASE
[25]   Gaussian mixture clustering and imputation of microarray data [J].
Ouyang, M ;
Welsh, WJ ;
Georgopoulos, P .
BIOINFORMATICS, 2004, 20 (06) :917-923
[26]  
Petropoulos S, 2016, CELL, V167, P285, DOI [10.1016/j.cell.2016.08.009, 10.1016/j.cell.2016.03.023]
[27]   ZIFA: Dimensionality reduction for zero-inflated single-cell gene expression analysis [J].
Pierson, Emma ;
Yau, Christopher .
GENOME BIOLOGY, 2015, 16
[28]   Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex [J].
Pollen, Alex A. ;
Nowakowski, Tomasz J. ;
Shuga, Joe ;
Wang, Xiaohui ;
Leyrat, Anne A. ;
Lui, Jan H. ;
Li, Nianzhen ;
Szpankowski, Lukasz ;
Fowler, Brian ;
Chen, Peilin ;
Ramalingam, Naveen ;
Sun, Gang ;
Thu, Myo ;
Norris, Michael ;
Lebofsky, Ronald ;
Toppani, Dominique ;
Kemp, Darnell W., II ;
Wong, Michael ;
Clerkson, Barry ;
Jones, Brittnee N. ;
Wu, Shiquan ;
Knutsson, Lawrence ;
Alvarado, Beatriz ;
Wang, Jing ;
Weaver, Lesley S. ;
May, Andrew P. ;
Jones, Robert C. ;
Unger, Marc A. ;
Kriegstein, Arnold R. ;
West, Jay A. A. .
NATURE BIOTECHNOLOGY, 2014, 32 (10) :1053-+
[29]  
Prabhakaran S, 2016, DIRICHLET PROCESS MI, P1070
[30]   Pseudotime estimation: deconfounding single cell time series [J].
Reid, John E. ;
Wernisch, Lorenz .
BIOINFORMATICS, 2016, 32 (19) :2973-2980