SCOPIT: sample size calculations for single-cell sequencing experiments

被引:34
作者
Davis, Alexander [1 ,2 ]
Gao, Ruli [1 ]
Navin, Nicholas E. [1 ,3 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Genet, Houston, TX 77030 USA
[2] Univ Texas MD Anderson Canc Ctr UTHlth, Grad Sch Biomed Sci, Houston, TX USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, Houston, TX 77030 USA
关键词
Single cell sequencing; Sample size; Multinomial distributions; DESIGN;
D O I
10.1186/s12859-019-3167-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background In single cell DNA and RNA sequencing experiments, the number of cells to sequence must be decided before running an experiment, and afterwards, it is necessary to decide whether sufficient cells were sampled. These questions can be addressed by calculating the probability of sampling at least a defined number of cells from each subpopulation (cell type or cancer clone). Results We developed an interactive web application called SCOPIT (Single-Cell One-sided Probability Interactive Tool), which calculates the required probabilities using a multinomial distribution (). In addition, we created an R package called pmultinom for scripting these calculations. Conclusions Our tool for fast multinomial calculations provide a simple and intuitive procedure for prospectively planning single-cell experiments or retrospectively evaluating if sufficient numbers of cells have been sequenced. The web application can be accessed at navinlab.com/SCOPIT.
引用
收藏
页数:6
相关论文
共 14 条
  • [1] [Anonymous], 2003, Probability Theory
  • [2] Experimental design for single-cell RNA sequencing
    Baran-Gale, Jeanette
    Chandra, Tamir
    Kirschner, Kristina
    [J]. BRIEFINGS IN FUNCTIONAL GENOMICS, 2018, 17 (04) : 233 - 239
  • [3] On the interpretation of x(2) from contingency tables, and the calculation of P
    Fisher, RA
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY, 1922, 85 : 87 - 94
  • [4] The design and implementation of FFTW3
    Frigo, M
    Johnson, SG
    [J]. PROCEEDINGS OF THE IEEE, 2005, 93 (02) : 216 - 231
  • [5] Gao R., 2016, NAT GENET, V48, P1
  • [6] Gotelli Nicholas J., 2011, P39
  • [7] Evaluation of algorithms for generating Dirichlet random vectors
    Hung, Y. C.
    Balakrishnan, N.
    Cheng, C. W.
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2011, 81 (04) : 445 - 459
  • [8] A REPRESENTATION FOR MULTINOMIAL CUMULATIVE DISTRIBUTION-FUNCTIONS
    LEVIN, B
    [J]. ANNALS OF STATISTICS, 1981, 9 (05) : 1123 - 1126
  • [9] The first five years of single-cell cancer genomics and beyond
    Navin, Nicholas E.
    [J]. GENOME RESEARCH, 2015, 25 (10) : 1499 - 1507
  • [10] Shen TJ, 2003, ECOLOGY, V84, P798, DOI 10.1890/0012-9658(2003)084[0798:PTNONS]2.0.CO