Bias, robustness and scalability in single-cell differential expression analysis

被引:0
作者
Soneson C. [1 ,2 ]
Robinson M.D. [1 ,2 ]
机构
[1] Institute of Molecular Life Sciences, University of Zurich, Zurich
[2] SIB Swiss Institute of Bioinformatics, Zurich
关键词
D O I
10.1038/nmeth.4612
中图分类号
学科分类号
摘要
Many methods have been used to determine differential gene expression from single-cell RNA (scRNA)-seq data. We evaluated 36 approaches using experimental and synthetic data and found considerable differences in the number and characteristics of the genes that are called differentially expressed. Prefiltering of lowly expressed genes has important effects, particularly for some of the methods developed for bulk RNA-seq data analysis. However, we found that bulk RNA-seq analysis methods do not generally perform worse than those developed specifically for scRNA-seq. We also present conquer, a repository of consistently processed, analysis-ready public scRNA-seq data sets that is aimed at simplifying method evaluation and reanalysis of published results. Each data set provides abundance estimates for both genes and transcripts, as well as quality control and exploratory analysis reports. © 2018 Nature Publishing Group. All rights reserved.
引用
收藏
页码:255 / 261
页数:6
相关论文
共 52 条
  • [1] Tang F., Et al., mRNA-Seq whole-Transcriptome analysis of a single cell, Nat. Methods, 6, pp. 377-382, (2009)
  • [2] Picelli S., Et al., Smart-seq2 for sensitive full-length transcriptome profiling in single cells, Nat. Methods, 10, pp. 1096-1098, (2013)
  • [3] Klein A.M., Et al., Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells, Cell, 161, pp. 1187-1201, (2015)
  • [4] Macosko E.Z., Et al., Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets, Cell, 161, pp. 1202-1214, (2015)
  • [5] Zheng G.X.Y., Et al., Massively parallel digital transcriptional profiling of single cells, Nat. Commun, 8, (2017)
  • [6] Love M.I., Huber W., Anders S., Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2, Genome Biol, 15, (2014)
  • [7] Robinson M.D., McCarthy D.J., Smyth G.K., EdgeR: A Bioconductor package for differential expression analysis of digital gene expression data, Bioinformatics, 26, pp. 139-140, (2010)
  • [8] Law C.W., Chen Y., Shi W., Smyth G.K., Voom: Precision weights unlock linear model analysis tools for RNA-seq read counts, Genome Biol, 15, (2014)
  • [9] Miao Z., Zhang X., Differential expression analyses for single-cell RNA-Seq: Old questions on new data, Quant. Biol, 4, pp. 243-260, (2016)
  • [10] Jaakkola M.K., Seyednasrollah F., Mehmood A., Elo L.L., Comparison of methods to detect differentially expressed genes between single-cell populations, Brief. Bioinform, 18, pp. 735-743, (2017)