DrImpute: imputing dropout events in single cell RNA sequencing data

被引:199
作者
Gong, Wuming [1 ]
Kwak, Il-Youp [1 ]
Pota, Pruthvi [1 ]
Koyano-Nakagawa, Naoko [1 ]
Garry, Daniel J. [1 ]
机构
[1] Univ Minnesota, Lillehei Heart Inst, 2231 6th St SE,4-165 CCRB, Minneapolis, MN 55114 USA
来源
BMC BIOINFORMATICS | 2018年 / 19卷
基金
美国国家卫生研究院;
关键词
Single cell RNA sequencing data; Dropout events; Imputation; Next generation sequencing; MISSING VALUE ESTIMATION; GENE-EXPRESSION; FATE DECISIONS; IMPUTATION; HETEROGENEITY; DESIGN;
D O I
10.1186/s12859-018-2226-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The single cell RNA sequencing (scRNA-seq) technique begin a new era by allowing the observation of gene expression at the single cell level. However, there is also a large amount of technical and biological noise. Because of the low number of RNA transcriptomes and the stochastic nature of the gene expression pattern, there is a high chance of missing nonzero entries as zero, which are called dropout events. Results: We develop DrImpute to impute dropout events in scRNA-seq data. We show that DrImpute has significantly better performance on the separation of the dropout zeros from true zeros than existing imputation algorithms. We also demonstrate that DrImpute can significantly improve the performance of existing tools for clustering, visualization and lineage reconstruction of nine published scRNA-seq datasets. Conclusions: DrImpute can serve as a very useful addition to the currently existing statistical tools for single cell RNA-seq analysis. .
引用
收藏
页数:10
相关论文
共 50 条
[1]  
[Anonymous], 2016, NUCL ACIDS RES, DOI DOI 10.1093/NAR/GKW430
[2]  
[Anonymous], 2010, P 19 INT C WORLD WID, DOI [DOI 10.1145/1772690.1772862, 10.1145/1772690.1772862]
[3]  
[Anonymous], DEEP GENERATIVE MODE
[4]   Statistical Design and Analysis of RNA Sequencing Data [J].
Auer, Paul L. ;
Doerge, R. W. .
GENETICS, 2010, 185 (02) :405-U32
[5]  
Azizi Elham., 2017, Genomics and Computational Biology, V3, P46, DOI DOI 10.18547/GCB.2017.VOL3.ISS1.E46
[6]   Design and computational analysis of single-cell RNA-sequencing experiments [J].
Bacher, Rhonda ;
Kendziorski, Christina .
GENOME BIOLOGY, 2016, 17
[7]  
Blakeley P, 2015, DEVELOPMENT, V142, P3151, DOI [10.1242/dev.123547, 10.1242/dev.131235]
[8]  
Campbell K, 2017, BIORXIV
[9]   Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells [J].
Deng, Qiaolin ;
Ramskold, Daniel ;
Reinius, Bjorn ;
Sandberg, Rickard .
SCIENCE, 2014, 343 (6167) :193-196
[10]   MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data [J].
Finak, Greg ;
McDavid, Andrew ;
Yajima, Masanao ;
Deng, Jingyuan ;
Gersuk, Vivian ;
Shalek, Alex K. ;
Slichter, Chloe K. ;
Miller, Hannah W. ;
McElrath, M. Juliana ;
Prlic, Martin ;
Linsley, Peter S. ;
Gottardo, Raphael .
GENOME BIOLOGY, 2015, 16