Mclmpute: Matrix Completion Based Imputation for Single Cell RNA-seq Data

被引:52
作者
Mongia, Aanchal [1 ]
Sengupta, Debarka [1 ,2 ]
Majumdar, Angshul [3 ]
机构
[1] Indraprastha Inst Informat Technol Delhi, Dept Comp Sci & Engn, New Delhi, India
[2] Indraprastha Inst Informat Technol Delhi, Ctr Computat Biol, New Delhi, India
[3] Indraprastha Inst Informat Technol Delhi, Dept Elect & Commun Engn, New Delhi, India
关键词
scRNA-seq; dropouts; imputation; matrix completion; Nuclear norm minization; HETEROGENEITY; EMBRYOS;
D O I
10.3389/fgene.2019.00009
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Motivation: Single-cell RNA sequencing has been proved to be revolutionary for its potential of zooming into complex biological systems. Genome-wide expression analysis at single-cell resolution provides a window into dynamics of cellular phenotypes. This facilitates the characterization of transcriptional heterogeneity in normal and diseased tissues under various conditions. It also sheds light on the development or emergence of specific cell populations and phenotypes. However, owing to the paucity of input RNA, a typical single cell RNA sequencing data features a high number of dropout events where transcripts fail to get amplified. Results: We introduce mclmpute, a low-rank matrix completion based technique to impute dropouts in single cell expression data. On a number of real datasets, application of mclmpute yields significant improvements in the separation of true zeros from dropouts, cell-clustering, differential expression analysis, cell type separability, the performance of dimensionality reduction techniques for cell visualization, and gene distribution.
引用
收藏
页数:12
相关论文
共 53 条
[1]  
Abernethy Jacob., 2006, CS0611124 ARXIV
[2]  
Amit Y., 2007, P 24 INT C MACH LEAR, V24, P17
[3]  
[Anonymous], 2016, BIORXIV, DOI DOI 10.1101/049734
[4]  
[Anonymous], BIORXIV
[5]  
[Anonymous], 2015, WIDESPREAD CRITICAL
[6]  
[Anonymous], 2008, Principal Components Analysis (PCA)
[7]  
Argyriou Andreas, 2007, Advances in Neural Information Processing Systems, P41
[8]  
Bell RM., 2007, KDD CUP WORKSHOP 13, P7, DOI DOI 10.1007/S007790170019
[9]  
Bennett James, 2007, The Netflix Prize, V2007, P35
[10]   Cell fate inclination within 2-cell and 4-cell mouse embryos revealed by single-cell RNA sequencing [J].
Blase, Fernando H. ;
Cao, Xiaoyi ;
Zhong, Sheng .
GENOME RESEARCH, 2014, 24 (11) :1787-1796