共 50 条
Imputing single-cell RNA-seq data by considering cell heterogeneity and prior expression of dropouts
被引:21
|作者:
Zhang, Lihua
[1
,2
]
Zhang, Shihua
[1
,2
,3
,4
]
机构:
[1] Chinese Acad Sci, Acad Math & Syst Sci, RCSDS, NCMIS,CEMS, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, Ctr Excellence Anim Evolut & Genet, Kunming 650223, Yunnan, Peoples R China
[4] Chinese Acad Sci, Univ Chinese Acad Sci, Hangzhou Inst Adv Study, Key Lab Syst Biol, Hangzhou 310024, Peoples R China
基金:
中国国家自然科学基金;
国家重点研发计划;
关键词:
single-cell RNA-seq;
dropout;
imputation;
low-rank;
systems biology;
D O I:
10.1093/jmcb/mjaa052
中图分类号:
Q2 [细胞生物学];
学科分类号:
071009 ;
090102 ;
摘要:
Single-cell RNA sequencing (scRNA-seq) provides a powerful tool to determine expression patterns of thousands of individual cells. However, the analysis of scRNA-seq data remains a computational challenge due to the high technical noise such as the presence of dropout events that lead to a large proportion of zeros for expressed genes. Taking into account the cell heterogeneity and the relationship between dropout rate and expected expression level, we present a cell sub-population based bounded low-rank (PBLR) method to impute the dropouts of scRNA-seq data. Through application to both simulated and real scRNA-seq datasets, PBLR is shown to be effective in recovering dropout events, and it can dramatically improve the low-dimensional representation and the recovery of genegene relationships masked by dropout events compared to several state-of-the-art methods. Moreover, PBLR also detects accurate and robust cell sub-populations automatically, shedding light on its flexibility and generality for scRNA-seq data analysis.
引用
收藏
页码:29 / 40
页数:12
相关论文