Quantifying the effect of experimental perturbations at single-cell resolution

被引:101
作者
Burkhardt, Daniel B. [1 ]
Stanley, Jay S., III [2 ]
Tong, Alexander [3 ]
Perdigoto, Ana Luisa [4 ]
Gigante, Scott A. [2 ]
Herold, Kevan C. [4 ]
Wolf, Guy [6 ,7 ]
Giraldez, Antonio J. [1 ]
van Dijk, David [5 ]
Krishnaswamy, Smita [1 ,3 ]
机构
[1] Yale Univ, Dept Genet, New Haven, CT 06510 USA
[2] Yale Univ, Computat Biol & Bioinformat Program, New Haven, CT USA
[3] Yale Univ, Dept Comp Sci, POB 2158, New Haven, CT 06520 USA
[4] Yale Univ, Dept Immunobiol, New Haven, CT USA
[5] Yale Univ, Dept Internal Med Cardiol, New Haven, CT 06510 USA
[6] Univ Montreal, Dept Math & Stat, Montreal, PQ, Canada
[7] Mila Quebec Inst, Montreal, PQ, Canada
关键词
K-NEAREST NEIGHBOR; GRAPH LAPLACIAN REGULARIZATION; IFN-GAMMA; HETEROGENEITY; GENES;
D O I
10.1038/s41587-020-00803-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Matched treatment and control single-cell RNA sequencing samples are more accurately compared at the single-cell level. Current methods for comparing single-cell RNA sequencing datasets collected in multiple conditions focus on discrete regions of the transcriptional state space, such as clusters of cells. Here we quantify the effects of perturbations at the single-cell level using a continuous measure of the effect of a perturbation across the transcriptomic space. We describe this space as a manifold and develop a relative likelihood estimate of observing each cell in each of the experimental conditions using graph signal processing. This likelihood estimate can be used to identify cell populations specifically affected by a perturbation. We also develop vertex frequency clustering to extract populations of affected cells at the level of granularity that matches the perturbation response. The accuracy of our algorithm at identifying clusters of cells that are enriched or depleted in each condition is, on average, 57% higher than the next-best-performing algorithm tested. Gene signatures derived from these clusters are more accurate than those of six alternative algorithms in ground truth comparisons.
引用
收藏
页码:619 / +
页数:20
相关论文
共 74 条
[1]   Islet T cells secreting IFN-γ in NOD mouse diabetes:: Arrest by p277 peptide treatment [J].
Ablamunits, V ;
Elias, D ;
Reshef, T ;
Cohen, IR .
JOURNAL OF AUTOIMMUNITY, 1998, 11 (01) :73-81
[2]  
Ando R. K., 2007, P ADV NEUR INF PROC, P25
[3]  
[Anonymous], 2004, ICLM WORKSH STAT REL
[4]  
[Anonymous], 2011, PROC INT C DISTRIB C, DOI DOI 10.1109/DCOSS.2011.5982158
[5]  
[Anonymous], 2005, P ANN C UNC ART INT
[6]   Identifying and removing the cellcycle effect from single-cell RNA-Sequencing data [J].
Barron, Martin ;
Li, Jun .
SCIENTIFIC REPORTS, 2016, 6
[7]   Dimensionality reduction for visualizing single-cell data using UMAP [J].
Becht, Etienne ;
McInnes, Leland ;
Healy, John ;
Dutertre, Charles-Antoine ;
Kwok, Immanuel W. H. ;
Ng, Lai Guan ;
Ginhoux, Florent ;
Newell, Evan W. .
NATURE BIOTECHNOLOGY, 2019, 37 (01) :38-+
[8]   Regularization and semi-supervised learning on large graphs [J].
Belkin, M ;
Matveeva, I ;
Niyogi, P .
LEARNING THEORY, PROCEEDINGS, 2004, 3120 :624-638
[9]  
Belkin M., 2006, P 20 INT C NEURAL IN, P129
[10]   A weighted k-nearest neighbor density estimate for geometric inference [J].
Biau, Gerard ;
Chazal, Frederic ;
Cohen-Steiner, David ;
Devroye, Luc ;
Rodriguez, Carlos .
ELECTRONIC JOURNAL OF STATISTICS, 2011, 5 :204-237