Searching large-scale scRNA-seq databases via unbiased cell embedding with Cell BLAST

被引:100
作者
Cao, Zhi-Jie [1 ]
Wei, Lin [1 ]
Lu, Shen [1 ]
Yang, De-Chang [1 ]
Gao, Ge [1 ]
机构
[1] Peking Univ, Biomed Pioneering Innovat Ctr BIOPIC, Beijing Adv Innovat Ctr Genom ICG, Ctr Bioinformat CBI,Sch Life Sci, Beijing 100871, Peoples R China
关键词
RNA-SEQUENCING DATA; SINGLE; EXPRESSION; ATLAS;
D O I
10.1038/s41467-020-17281-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Single-cell RNA-seq (scRNA-seq) is being used widely to resolve cellular heterogeneity. With the rapid accumulation of public scRNA-seq data, an effective and efficient cell-querying method is critical for the utilization of the existing annotations to curate newly sequenced cells. Such a querying method should be based on an accurate cell-to-cell similarity measure, and capable of handling batch effects properly. Herein, we present Cell BLAST, an accurate and robust cell-querying method built on a neural network-based generative model and a customized cell-to-cell similarity metric. Through extensive benchmarks and case studies, we demonstrate the effectiveness of Cell BLAST in annotating discrete cell types and continuous cell differentiation potential, as well as identifying novel cell types. Powered by a well-curated reference database and a user-friendly Web server, Cell BLAST provides the one-stop solution for real-world scRNA-seq cell querying and annotation. Single-cell RNA-seq (scRNA-seq) is being widely used to resolve cellular heterogeneity. Here, the authors present a cell-querying method built on a neural network-based generative model and a customized cell-to-cell similarity metric.
引用
收藏
页数:13
相关论文
共 67 条
[41]  
Lun Aaron T L, 2016, F1000Res, V5, P2122
[42]  
Mahalanobis P., 1936, Sankhya: The Indian Journal of Statistics, Series A, V12, P49, DOI [10.1007/s13171-019-00164-5, DOI 10.1007/S13171-019-00164-5]
[43]  
Makhzani A., 2016, WORKSH TRACK 4 INT C
[44]  
Marques S, 2016, SCIENCE, V352, P1326, DOI 10.1126/science.aaf6463
[45]  
McInnes L, 2020, Arxiv, DOI [arXiv:1802.03426, DOI 10.21105/JOSS.00861, DOI 10.48550/ARXIV.1802.03426]
[46]   A revised airway epithelial hierarchy includes CFTR-expressing ionocytes [J].
Montoro, Daniel T. ;
Haber, Adam L. ;
Biton, Moshe ;
Vinarsky, Vladimir ;
Lin, Brian ;
Birket, Susan E. ;
Yuan, Feng ;
Chen, Sijia ;
Leung, Hui Min ;
Villoria, Jorge ;
Rogel, Noga ;
Burgin, Grace ;
Tsankov, Alexander M. ;
Waghray, Avinash ;
Slyper, Michal ;
Waldman, Julia ;
Nguyen, Lan ;
Dionne, Danielle ;
Rozenblatt-Rosen, Orit ;
Tata, Purushothama Rao ;
Mou, Hongmei ;
Shivaraju, Manjunatha ;
Bihler, Hermann ;
Mense, Martin ;
Tearney, Guillermo J. ;
Rowe, Steven M. ;
Engelhardt, John F. ;
Regev, Aviv ;
Rajagopal, Jayaraj .
NATURE, 2018, 560 (7718) :319-+
[47]  
Picelli S, 2013, NAT METHODS, V10, P1096, DOI [10.1038/NMETH.2639, 10.1038/nmeth.2639]
[48]   ZIFA: Dimensionality reduction for zero-inflated single-cell gene expression analysis [J].
Pierson, Emma ;
Yau, Christopher .
GENOME BIOLOGY, 2015, 16
[49]   A single-cell atlas of the airway epithelium reveals the CFTR-rich pulmonary ionocyte [J].
Plasschaert, Lindsey W. ;
Zilionis, Rapolas ;
Choo-Wing, Rayman ;
Savova, Virginia ;
Knehr, Judith ;
Roma, Guglielmo ;
Klein, Allon M. ;
Jaffe, Aron B. .
NATURE, 2018, 560 (7718) :377-+
[50]  
Qizhe H., 2017, ADV NEURAL INFORM PR, P585, DOI DOI 10.5555/3294771.3294827