SERGIO: A Single-Cell Expression Simulator Guided by Gene Regulatory Networks

被引:67
作者
Dibaeinia, Payam [1 ]
Sinha, Saurabh [1 ,2 ,3 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Univ Illinois, Carl R Woese Inst Genom Biol, Urbana, IL 61801 USA
[3] Univ Illinois, Canc Ctr Illinois, Urbana, IL 61801 USA
基金
美国国家卫生研究院;
关键词
INFERENCE; STOCHASTICITY; NOISE; FATE;
D O I
10.1016/j.cels.2020.08.003
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A common approach to benchmarking of single-cell transcriptomics tools is to generate synthetic datasets that statistically resemble experimental data. However, most existing single-cell simulators do not incorporate transcription factor-gene regulatory interactions that underlie expression dynamics. Here, we present SERGIO, a simulator of single-cell gene expression data that models the stochastic nature of transcription as well as regulation of genes by multiple transcription factors according to a user-provided gene regulatory network. SERGIO can simulate any number of cell types in steady state or cells differentiating to multiple fates. We show that datasets generated by SERGIO are statistically comparable to experimental data generated by Illumina HiSeq2000, Drop-seq, Illumina 10X chromium, and Smart-seq. We use SERGIO to benchmark several single-cell analysis tools, including GRN inference methods, and identify Tcf7, Gata3, and Bcl11b as key drivers of T cell differentiation by performing in silico knockout experiments.
引用
收藏
页码:252 / +
页数:31
相关论文
共 85 条
[1]  
Aibar S, 2017, NAT METHODS, V14, P1083, DOI [10.1038/NMETH.4463, 10.1038/nmeth.4463]
[2]   M3Drop: dropout-based feature selection for scRNASeq [J].
Andrews, Tallulah S. ;
Hemberg, Martin .
BIOINFORMATICS, 2019, 35 (16) :2865-2867
[3]   Cellular Decision Making and Biological Noise: From Microbes to Mammals [J].
Balazsi, Gabor ;
van Oudenaarden, Alexander ;
Collins, James J. .
CELL, 2011, 144 (06) :910-925
[4]   Signaling in Cell Differentiation and Morphogenesis [J].
Basson, M. Albert .
COLD SPRING HARBOR PERSPECTIVES IN BIOLOGY, 2012, 4 (06)
[5]   Dimensionality reduction for visualizing single-cell data using UMAP [J].
Becht, Etienne ;
McInnes, Leland ;
Healy, John ;
Dutertre, Charles-Antoine ;
Kwok, Immanuel W. H. ;
Ng, Lai Guan ;
Ginhoux, Florent ;
Newell, Evan W. .
NATURE BIOTECHNOLOGY, 2019, 37 (01) :38-+
[6]   NetBenchmark: a bioconductor package for reproducible benchmarks of gene regulatory network inference [J].
Bellot, Pau ;
Olsen, Catharina ;
Salembier, Philippe ;
Oliveras-Verges, Albert ;
Meyer, Patrick E. .
BMC BIOINFORMATICS, 2015, 16
[7]   The Inferelator:: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo [J].
Bonneau, Richard ;
Reiss, David J. ;
Shannon, Paul ;
Facciotti, Marc ;
Hood, Leroy ;
Baliga, Nitin S. ;
Thorsson, Vesteinn .
GENOME BIOLOGY, 2006, 7 (05)
[8]   A cellular census of human lungs identifies novel cell states in health and in asthma [J].
Braga, Felipe A. Vieira ;
Kar, Gozde ;
Berg, Marijn ;
Carpaij, Orestes A. ;
Polanski, Krzysztof ;
Simon, Lukas M. ;
Brouwer, Sharon ;
Gomes, Tomas ;
Hesse, Laura ;
Jiang, Jian ;
Fasouli, Eirini S. ;
Efremova, Mirjana ;
Vento-Tormo, Roser ;
Talavera-Lopez, Carlos ;
Jonker, Marnix R. ;
Affleck, Karen ;
Palit, Subarna ;
Strzelecka, Paulina M. ;
Firth, Helen V. ;
Mahbubani, Krishnaa T. ;
Cvejic, Ana ;
Meyer, Kerstin B. ;
Saeb-Parsy, Kourosh ;
Luinge, Marjan ;
Brandsma, Corry-Anke ;
Timens, Wim ;
Angelidis, Ilias ;
Strunz, Maximilian ;
Koppelman, Gerard H. ;
van Oosterhout, Antoon J. ;
Schiller, Herbert B. ;
Theis, Fabian J. ;
van den Berge, Maarten ;
Nawijn, Martijn C. ;
Teichmann, Sarah A. .
NATURE MEDICINE, 2019, 25 (07) :1153-+
[9]   Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells [J].
Buettner, Florian ;
Natarajan, Kedar N. ;
Casale, F. Paolo ;
Proserpio, Valentina ;
Scialdone, Antonio ;
Theis, Fabian J. ;
Teichmann, Sarah A. ;
Marioni, John C. ;
Stegie, Oliver .
NATURE BIOTECHNOLOGY, 2015, 33 (02) :155-160
[10]   Integrating single-cell transcriptomic data across different conditions, technologies, and species [J].
Butler, Andrew ;
Hoffman, Paul ;
Smibert, Peter ;
Papalexi, Efthymia ;
Satija, Rahul .
NATURE BIOTECHNOLOGY, 2018, 36 (05) :411-+