Searching large-scale scRNA-seq databases via unbiased cell embedding with Cell BLAST

被引:100
作者
Cao, Zhi-Jie [1 ]
Wei, Lin [1 ]
Lu, Shen [1 ]
Yang, De-Chang [1 ]
Gao, Ge [1 ]
机构
[1] Peking Univ, Biomed Pioneering Innovat Ctr BIOPIC, Beijing Adv Innovat Ctr Genom ICG, Ctr Bioinformat CBI,Sch Life Sci, Beijing 100871, Peoples R China
关键词
RNA-SEQUENCING DATA; SINGLE; EXPRESSION; ATLAS;
D O I
10.1038/s41467-020-17281-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Single-cell RNA-seq (scRNA-seq) is being used widely to resolve cellular heterogeneity. With the rapid accumulation of public scRNA-seq data, an effective and efficient cell-querying method is critical for the utilization of the existing annotations to curate newly sequenced cells. Such a querying method should be based on an accurate cell-to-cell similarity measure, and capable of handling batch effects properly. Herein, we present Cell BLAST, an accurate and robust cell-querying method built on a neural network-based generative model and a customized cell-to-cell similarity metric. Through extensive benchmarks and case studies, we demonstrate the effectiveness of Cell BLAST in annotating discrete cell types and continuous cell differentiation potential, as well as identifying novel cell types. Powered by a well-curated reference database and a user-friendly Web server, Cell BLAST provides the one-stop solution for real-world scRNA-seq cell querying and annotation. Single-cell RNA-seq (scRNA-seq) is being widely used to resolve cellular heterogeneity. Here, the authors present a cell-querying method built on a neural network-based generative model and a customized cell-to-cell similarity metric.
引用
收藏
页数:13
相关论文
共 67 条
[1]  
10X Genomics,, 2017, 13 MILL BRAIN CELLS
[2]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[3]   SCPortalen: human and mouse single-cell centric database [J].
Abugessaisa, Imad ;
Noguchi, Shuhei ;
Bottcher, Michael ;
Hasegawa, Akira ;
Kouno, Tsukasa ;
Kato, Sachi ;
Tada, Yuhki ;
Ura, Hiroki ;
Abe, Kuniya ;
Shin, Jay W. ;
Plessy, Charles ;
Carninci, Piero ;
Kasukawa, Takeya .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D781-D787
[4]   Psychrophilic proteases dramatically reduce single-cell RNA-seq artifacts: a molecular atlas of kidney development [J].
Adam, Mike ;
Potter, Andrew S. ;
Potter, S. Steven .
DEVELOPMENT, 2017, 144 (19) :3625-3632
[5]   A web server for comparative analysis of single-cell RNA-seq data [J].
Alavi, Amir ;
Ruffalo, Matthew ;
Parvangada, Aiyappa ;
Huang, Zhilin ;
Bar-Joseph, Ziv .
NATURE COMMUNICATIONS, 2018, 9
[6]   scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data [J].
Alquicira-Hernandez, Jose ;
Sathe, Anuja ;
Ji, Hanlee P. ;
Quan Nguyen ;
Powell, Joseph E. .
GENOME BIOLOGY, 2019, 20 (01)
[7]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[8]   Exploring single-cell data with deep multitasking neural networks [J].
Amodio, Matthew ;
van Dijk, David ;
Srinivasan, Krishnan ;
Chen, William S. ;
Mohsen, Hussein ;
Moon, Kevin R. ;
Campbell, Allison ;
Zhao, Yujiao ;
Wang, Xiaomei ;
Venkataswamy, Manjunatha ;
Desai, Anita ;
Ravi, V. ;
Kumar, Priti ;
Montgomery, Ruth ;
Wolf, Guy ;
Krishnaswamy, Smita .
NATURE METHODS, 2019, 16 (11) :1139-+
[9]   M3Drop: dropout-based feature selection for scRNASeq [J].
Andrews, Tallulah S. ;
Hemberg, Martin .
BIOINFORMATICS, 2019, 35 (16) :2865-2867
[10]  
Arjovsky M., 2017, 5 INT C LEARN REPR O