EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data

被引:0
作者
Aaron T. L. Lun
Samantha Riesenfeld
Tallulah Andrews
The Phuong Dao
Tomas Gomes
John C. Marioni
机构
[1] Cancer Research UK Cambridge Institute,
[2] University of Cambridge,undefined
[3] Klarman Cell Observatory,undefined
[4] Broad Institute of MIT and Harvard,undefined
[5] Wellcome Trust Sanger Institute,undefined
[6] Wellcome Genome Campus,undefined
[7] Program for Computational and Systems Biology,undefined
[8] Sloan Kettering Institute,undefined
[9] EMBL European Bioinformatics Institute,undefined
[10] Wellcome Genome Campus,undefined
来源
Genome Biology | / 20卷
关键词
Single-cell transcriptomics; Droplet-based protocols; Empty droplets; Cell detection;
D O I
暂无
中图分类号
学科分类号
摘要
Droplet-based single-cell RNA sequencing protocols have dramatically increased the throughput of single-cell transcriptomics studies. A key computational challenge when processing these data is to distinguish libraries for real cells from empty droplets. Here, we describe a new statistical method for calling cells from droplet-based data, based on detecting significant deviations from the expression profile of the ambient solution. Using simulations, we demonstrate that EmptyDrops has greater power than existing approaches while controlling the false discovery rate among detected cells. Our method also retains distinct cell types that would have been discarded by existing methods in several real data sets.
引用
收藏
相关论文
共 239 条
[1]  
Macosko EZ(2015)Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets Cell 161 1202-14
[2]  
Basu A(2015)Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells Cell 161 1187-201
[3]  
Satija R(2017)Massively parallel digital transcriptional profiling of single cells Nat Commun 8 14049-6
[4]  
Nemesh J(2014)Quantitative single-cell RNA-seq with unique molecular identifiers Nat Methods 11 163-8
[5]  
Shekhar K(2013)Smart-seq2 for sensitive full-length transcriptome profiling in single cells Nat Methods 10 1096-8
[6]  
Goldman M(2014)Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex Nat Biotechnol 32 1053-37
[7]  
Tirosh I(2017)The Human Cell Atlas Elife 6 e27041-40
[8]  
Bialas AR(1995)Good-Turing frequency estimation without tears J Quant Linguist 2 217-300
[9]  
Kamitaki N(2010)edgeR: a Bioconductor package for differential expression analysis of digital gene expression data Bioinformatics 26 139-8
[10]  
Martersteck EM(2010)Permutation Stat Appl Genet Mol Biol 9 39-91