Bias, robustness and scalability in single-cell differential expression analysis

被引:0
作者
Soneson C. [1 ,2 ]
Robinson M.D. [1 ,2 ]
机构
[1] Institute of Molecular Life Sciences, University of Zurich, Zurich
[2] SIB Swiss Institute of Bioinformatics, Zurich
关键词
D O I
10.1038/nmeth.4612
中图分类号
学科分类号
摘要
Many methods have been used to determine differential gene expression from single-cell RNA (scRNA)-seq data. We evaluated 36 approaches using experimental and synthetic data and found considerable differences in the number and characteristics of the genes that are called differentially expressed. Prefiltering of lowly expressed genes has important effects, particularly for some of the methods developed for bulk RNA-seq data analysis. However, we found that bulk RNA-seq analysis methods do not generally perform worse than those developed specifically for scRNA-seq. We also present conquer, a repository of consistently processed, analysis-ready public scRNA-seq data sets that is aimed at simplifying method evaluation and reanalysis of published results. Each data set provides abundance estimates for both genes and transcripts, as well as quality control and exploratory analysis reports. © 2018 Nature Publishing Group. All rights reserved.
引用
收藏
页码:255 / 261
页数:6
相关论文
共 52 条
  • [11] Lun A.T.L., Marioni J.C., Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data, Biostatistics, 18, pp. 451-464, (2017)
  • [12] Vallejos C.A., Richardson S., Marioni J.C., Beyond comparisons of means: Understanding changes in gene expression at the single-cell level, Genome Biol, 17, (2016)
  • [13] Korthauer K.D., Et al., A statistical approach for identifying differential distributions in single-cell RNA-seq experiments, Genome Biol, 17, (2016)
  • [14] Satija R., Farrell J.A., Gennert D., Schier A.F., Regev A., Spatial reconstruction of single-cell gene expression data, Nat. Biotechnol, 33, pp. 495-502, (2015)
  • [15] Lun A.T.L., Chen Y., Smyth G.K., It?s DE-licious: A recipe for differential expression analyses of RNA-seq experiments using quasi-likelihood methods in edger. in, Statistical Genomics (Eds. Mathé, E. & Davis, S.), pp. 391-416, (2016)
  • [16] Paulson J.N., Stine O.C., Bravo H.C., Pop M., Differential abundance analysis for microbial marker-gene surveys, Nat. Methods, 10, pp. 1200-1202, (2013)
  • [17] Bourgon R., Gentleman R., Huber W., Independent filtering increases detection power for high-Throughput experiments, Proc. Natl. Acad. Sci. USA, 107, pp. 9546-9551, (2010)
  • [18] Ignatiadis N., Klaus B., Zaugg J.B., Huber W., Data-driven hypothesis weighting increases detection power in genome-scale multiple testing, Nat. Methods, 13, pp. 577-580, (2016)
  • [19] Lappalainen T., Et al., Transcriptome and genome sequencing uncovers functional variation in humans, Nature, 501, pp. 506-511, (2013)
  • [20] Elo L.L., Filen S., Lahesmaa R., Aittokallio T., Reproducibilityoptimized test statistic for ranking genes in microarray studies, IEEE/ACM Trans. Comput. Biol. Bioinform, 5, pp. 423-431, (2008)