Dictionary learning for integrative, multimodal and scalable single-cell analysis

被引:617
作者
Hao, Yuhan [1 ,2 ]
Stuart, Tim [1 ,2 ]
Kowalski, Madeline H. H. [2 ,3 ]
Choudhary, Saket [1 ,2 ]
Hoffman, Paul [1 ]
Hartman, Austin [1 ]
Srivastava, Avi [1 ,2 ]
Molla, Gesmira [2 ]
Madad, Shaista [1 ,2 ]
Fernandez-Granda, Carlos [4 ,5 ]
Satija, Rahul [1 ,2 ]
机构
[1] NYU, Ctr Genom & Syst Biol, New York, NY 10012 USA
[2] New York Genome Ctr, New York, NY 10013 USA
[3] NYU Langone Med Ctr, Inst Syst Genet, New York, NY USA
[4] NYU, Ctr Data Sci, New York, NY USA
[5] NYU, Courant Inst Math Sci, New York, NY USA
关键词
RNA-SEQ DATA; CHROMATIN ACCESSIBILITY; T-CELLS; K-SVD; MILD; HETEROGENEITY; PROJECTION; ALIGNMENT; RESOLVES; SPARSE;
D O I
10.1038/s41587-023-01767-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Mapping single-cell sequencing profiles to comprehensive reference datasets provides a powerful alternative to unsupervised analysis. However, most reference datasets are constructed from single-cell RNA-sequencing data and cannot be used to annotate datasets that do not measure gene expression. Here we introduce 'bridge integration', a method to integrate single-cell datasets across modalities using a multiomic dataset as a molecular bridge. Each cell in the multiomic dataset constitutes an element in a 'dictionary', which is used to reconstruct unimodal datasets and transform them into a shared space. Our procedure accurately integrates transcriptomic data with independent single-cell measurements of chromatin accessibility, histone modifications, DNA methylation and protein levels. Moreover, we demonstrate how dictionary learning can be combined with sketching techniques to improve computational scalability and harmonize 8.6 million human immune cell profiles from sequencing and mass cytometry experiments. Our approach, implemented in version 5 of our Seurat toolkit (), broadens the utility of single-cell reference datasets and facilitates comparisons across diverse molecular modalities.
引用
收藏
页码:293 / 304
页数:22
相关论文
共 121 条
[21]   COVID-19 severity correlates with airway epithelium-immune cell interactions identified by single-cell analysis [J].
Chua, Robert Lorenz ;
Lukassen, Soeren ;
Trump, Saskia ;
Hennig, Bianca P. ;
Wendisch, Daniel ;
Pott, Fabian ;
Debnath, Olivia ;
Thuermann, Loreen ;
Kurth, Florian ;
Voelker, Maria Theresa ;
Kazmierski, Julia ;
Timmermann, Bernd ;
Twardziok, Sven ;
Schneider, Stefan ;
Machleidt, Felix ;
Mueller-Redetzky, Holger ;
Maier, Melanie ;
Krannich, Alexander ;
Schmidt, Sein ;
Balzer, Felix ;
Liebig, Johannes ;
Loske, Jennifer ;
Suttorp, Norbert ;
Eils, Juergen ;
Ishaque, Naveed ;
Liebert, Uwe Gerd ;
von Kalle, Christof ;
Hocke, Andreas ;
Witzenrath, Martin ;
Goffinet, Christine ;
Drosten, Christian ;
Laudi, Sven ;
Lehmann, Irina ;
Conrad, Christian ;
Sander, Leif-Erik ;
Eils, Roland .
NATURE BIOTECHNOLOGY, 2020, 38 (08) :970-+
[22]   Joint single-cell measurements of nuclear proteins and RNA in vivo [J].
Chung, Hattie ;
Parkhurst, Christopher N. ;
Magee, Emma M. ;
Phillips, Devan ;
Habibi, Ehsan ;
Chen, Fei ;
Yeung, Bertrand Z. ;
Waldman, Julia ;
Artis, David ;
Regev, Aviv .
NATURE METHODS, 2021, 18 (10) :1204-+
[23]   scNMT-seq enables joint profiling of chromatin accessibility DNA methylation and transcription in single cells [J].
Clark, Stephen J. ;
Argelaguet, Ricard ;
Kapourani, Chantriolnt-Andreas ;
Stubbs, Thomas M. ;
Lee, Heather J. ;
Alda-Catalinas, Celia ;
Krueger, Felix ;
Sanguinetti, Guido ;
Kelsey, Gavin ;
Marioni, John C. ;
Stegle, Oliver ;
Reik, Wolf .
NATURE COMMUNICATIONS, 2018, 9
[24]   Genome-wide base-resolution mapping of DNA methylation in single cells using single-cell bisulfite sequencing( scBS-seq) [J].
Clark, Stephen J. ;
Smallwood, Sebastien A. ;
Lee, Heather J. ;
Krueger, Felix ;
Reik, Wolf ;
Kelsey, Gavin .
NATURE PROTOCOLS, 2017, 12 (03) :534-U159
[25]   Low-Rank Approximation and Regression in Input Sparsity Time [J].
Clarkson, Kenneth L. ;
Woodruff, David P. .
JOURNAL OF THE ACM, 2017, 63 (06)
[26]  
Combes Alexis J, 2020, bioRxiv, DOI [10.1038/s41586-021-03234-7, 10.1101/2020.10.28.359935]
[27]   Cross-tissue immune cell analysis reveals tissue-specific features in humans [J].
Conde, C. Dominguez ;
Xu, C. ;
Jarvis, L. B. ;
Rainbow, D. B. ;
Wells, S. B. ;
Gomes, T. ;
Howlett, S. K. ;
Suchanek, O. ;
Polanski, K. ;
King, H. W. ;
Mamanova, L. ;
Huang, N. ;
Szabo, P. A. ;
Richardson, L. ;
Bolt, L. ;
Fasouli, E. S. ;
Mahbubani, K. T. ;
Prete, M. ;
Tuck, L. ;
Richoz, N. ;
Tuong, Z. K. ;
Campos, L. ;
Mousa, H. S. ;
Needham, E. J. ;
Pritchard, S. ;
Li, T. ;
Elmentaite, R. ;
Park, J. ;
Rahmani, E. ;
Chen, D. ;
Menon, D. K. ;
Bayraktar, O. A. ;
James, L. K. ;
Meyer, K. B. ;
Yosef, N. ;
Clatworthy, M. R. ;
Sims, P. A. ;
Farber, D. L. ;
Saeb-Parsy, K. ;
Jones, J. L. ;
Teichmann, S. A. .
SCIENCE, 2022, 376 (6594) :713-+
[28]   Multiplex single-cell profiling of chromatin accessibility by combinatorial cellular indexing [J].
Cusanovich, Darren A. ;
Daza, Riza ;
Adey, Andrew ;
Pliner, Hannah A. ;
Christiansen, Lena ;
Gunderson, Kevin L. ;
Steemers, Frank J. ;
Trapnell, Cole ;
Shendure, Jay .
SCIENCE, 2015, 348 (6237) :910-914
[29]   Ultra-high-throughput single-cell RNA sequencing and perturbation screening with combinatorial fluidic indexing [J].
Datlinger, Paul ;
Rendeiro, Andre F. ;
Boenke, Thorina ;
Senekowitsch, Martin ;
Krausgruber, Thomas ;
Barreca, Daniele ;
Bock, Christoph .
NATURE METHODS, 2021, 18 (06) :635-+
[30]   COVID-19 tissue atlases reveal SARS-CoV-2 pathology and cellular targets [J].
Delorey, Toni M. ;
Ziegler, Carly G. K. ;
Heimberg, Graham ;
Normand, Rachelly ;
Yang, Yiming ;
Segerstolpe, Asa ;
Abbondanza, Domenic ;
Fleming, Stephen J. ;
Subramanian, Ayshwarya ;
Montoro, Daniel T. ;
Jagadeesh, Karthik A. ;
Dey, Kushal K. ;
Sen, Pritha ;
Slyper, Michal ;
Pita-Juarez, Yered H. ;
Phillips, Devan ;
Biermann, Jana ;
Bloom-Ackermann, Zohar ;
Barkas, Nikolaos ;
Ganna, Andrea ;
Gomez, James ;
Melms, Johannes C. ;
Katsyv, Igor ;
Normandin, Erica ;
Naderi, Pourya ;
Popov, Yury V. ;
Raju, Siddharth S. ;
Niezen, Sebastian ;
Tsai, Linus T. -Y. ;
Siddle, Katherine J. ;
Sud, Malika ;
Tran, Victoria M. ;
Vellarikkal, Shamsudheen K. ;
Wang, Yiping ;
Amir-Zilberstein, Liat ;
Atri, Deepak S. ;
Beechem, Joseph ;
Brook, Olga R. ;
Chen, Jonathan ;
Divakar, Prajan ;
Dorceus, Phylicia ;
Engreitz, Jesse M. ;
Essene, Adam ;
Fitzgerald, Donna M. ;
Fropf, Robin ;
Gazal, Steven ;
Gould, Joshua ;
Grzyb, John ;
Harvey, Tyler ;
Hecht, Jonathan .
NATURE, 2021, 595 (7865) :107-+