Efficient and precise single-cell reference atlas mapping with Symphony

被引:106
作者
Kang, Joyce B. [1 ,2 ,3 ,4 ,5 ,6 ]
Nathan, Aparna [1 ,2 ,3 ,4 ,5 ,6 ]
Weinand, Kathryn [1 ,2 ,3 ,4 ,5 ,6 ]
Zhang, Fan [1 ,2 ,3 ,4 ,5 ,6 ]
Millard, Nghia [1 ,2 ,3 ,4 ,5 ,6 ]
Rumker, Laurie [1 ,2 ,3 ,4 ,5 ,6 ]
Moody, D. Branch [3 ,4 ]
Korsunsky, Ilya [1 ,2 ,3 ,4 ,5 ,6 ]
Raychaudhuri, Soumya [1 ,2 ,3 ,4 ,5 ,6 ,7 ]
机构
[1] Brigham & Womens Hosp, Ctr Data Sci, 75 Francis St, Boston, MA 02115 USA
[2] Brigham & Womens Hosp, Dept Med, Div Genet, 75 Francis St, Boston, MA 02115 USA
[3] Harvard Med Sch, Boston, MA 02115 USA
[4] Brigham & Womens Hosp, Dept Med, 3 Div Rheumatol Inflammat & Immun, 75 Francis St, Boston, MA 02115 USA
[5] Harvard Med Sch, Dept Biomed Informat, Boston, MA 02115 USA
[6] Broad Inst MIT & Harvard, Program Med & Populat Genet, Cambridge, MA 02142 USA
[7] Univ Manchester, Manchester Acad Hlth Sci Ctr, Ctr Musculoskeletal Res, Versus Arthrit Ctr Genet & Genom, Manchester, Lancs, England
基金
美国国家卫生研究院;
关键词
D O I
10.1038/s41467-021-25957-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advances in single-cell technologies and integration algorithms make it possible to construct comprehensive reference atlases encompassing many donors, studies, disease states, and sequencing platforms. Much like mapping sequencing reads to a reference genome, it is essential to be able to map query cells onto complex, multimillion-cell reference atlases to rapidly identify relevant cell states and phenotypes. We present Symphony (), an algorithm for building large-scale, integrated reference atlases in a convenient, portable format that enables efficient query mapping within seconds. Symphony localizes query cells within a stable low-dimensional reference embedding, facilitating reproducible downstream transfer of reference-defined annotations to the query. We demonstrate the power of Symphony in multiple real-world datasets, including (1) mapping a multi-donor, multi-species query to predict pancreatic cell types, (2) localizing query cells along a developmental trajectory of fetal liver hematopoiesis, and (3) inferring surface protein expression with a multimodal CITE-seq atlas of memory T cells. The number of single-cell RNA-seq datasets generated is increasing rapidly, making methods that map cell types to well-curated references increasingly important. Here, the authors propose an accurate method for mapping single cells onto a reference atlas in seconds.
引用
收藏
页数:21
相关论文
共 70 条
[1]   A comparison of automatic cell identification methods for single-cell RNA sequencing data [J].
Abdelaal, Tamim ;
Michielsen, Lieke ;
Cats, Davy ;
Hoogduin, Dylan ;
Mei, Hailiang ;
Reinders, Marcel J. T. ;
Mahfouz, Ahmed .
GENOME BIOLOGY, 2019, 20 (01)
[2]   A single-cell transcriptomic atlas characterizes ageing tissues in the mouse [J].
Almanzar, Nicole ;
Antony, Jane ;
Baghel, Ankit S. ;
Bakerman, Isaac ;
Bansal, Ishita ;
Barres, Ben A. ;
Beachy, Philip A. ;
Berdnik, Daniela ;
Bilen, Biter ;
Brownfield, Douglas ;
Cain, Corey ;
Chan, Charles K. F. ;
Chen, Michelle B. ;
Clarke, Michael F. ;
Conley, Stephanie D. ;
Darmanis, Spyros ;
Demers, Aaron ;
Demir, Kubilay ;
De Morree, Antoine ;
du Bois, Tessa Divita Haley ;
Ebadi, Hamid ;
Espinoza, F. Hernan ;
Fish, Matt ;
Gan, Qiang ;
George, Benson M. ;
Gillich, Astrid ;
Gomez-Sjoberg, Rafael ;
Green, Foad ;
Genetiano, Geraldine ;
Gu, Xueying ;
Gulati, Gunsagar S. ;
Hahn, Oliver ;
Haney, Michael Seamus ;
Hang, Yan ;
Harris, Lincoln ;
He, Mu ;
Hosseinzadeh, Shayan ;
Huang, Albin ;
Huang, Kerwyn Casey ;
Iram, Tal ;
Isobe, Taichi ;
Ives, Feather ;
Jones, Robert C. ;
Kao, Kevin S. ;
Karkanias, Jim ;
Karnam, Guruswamy ;
Keller, Andreas ;
Kershner, Aaron M. ;
Khoury, Nathalie ;
Kim, Seung K. .
NATURE, 2020, 583 (7817) :590-+
[3]   scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data [J].
Alquicira-Hernandez, Jose ;
Sathe, Anuja ;
Ji, Hanlee P. ;
Quan Nguyen ;
Powell, Joseph E. .
GENOME BIOLOGY, 2019, 20 (01)
[4]   Interpretation of T cell states from single-cell transcriptomics data using reference atlases [J].
Andreatta, Massimo ;
Corria-Osorio, Jesus ;
Muller, Soren ;
Cubas, Rafael ;
Coukos, George ;
Carmona, Santiago J. .
NATURE COMMUNICATIONS, 2021, 12 (01)
[5]   Augmented implicitly restarted Lanczos bidiagonalization methods [J].
Baglama, J ;
Reichel, L .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2005, 27 (01) :19-42
[6]   A Single-Cell Transcriptomic Map of the Human and Mouse Pancreas Reveals Inter- and Intra-cell Population Structure [J].
Baron, Maayan ;
Veres, Adrian ;
Wolock, Samuel L. ;
Faust, Aubrey L. ;
Gaujoux, Renaud ;
Vetere, Amedeo ;
Ryu, Jennifer Hyoje ;
Wagner, Bridget K. ;
Shen-Orr, Shai S. ;
Klein, Allon M. ;
Melton, Douglas A. ;
Yanai, Itai .
CELL SYSTEMS, 2016, 3 (04) :346-+
[7]   Emerging technologies towards enhancing privacy in genomic data sharing [J].
Berger, Bonnie ;
Cho, Hyunghoon .
GENOME BIOLOGY, 2019, 20 (1)
[8]   Tumor and immune reprogramming during immunotherapy in advanced renal cell carcinoma [J].
Bi, Kevin ;
He, Meng Xiao ;
Bakouny, Ziad ;
Kanodia, Abhay ;
Napolitano, Sara ;
Wu, Jingyi ;
Grimaldi, Grace ;
Braun, David A. ;
Cuoco, Michael S. ;
Mayorga, Angie ;
DelloStritto, Laura ;
Bouchard, Gabrielle ;
Steinharter, John ;
Tewari, Alok K. ;
Vokes, Natalie, I ;
Shannon, Erin ;
Sun, Maxine ;
Park, Jihye ;
Chang, Steven L. ;
McGregor, Bradley A. ;
Haq, Rizwan ;
Denize, Thomas ;
Signoretti, Sabina ;
Guerriero, Jennifer L. ;
Vigneau, Sebastien ;
Rozenblatt-Rosen, Orit ;
Rotem, Asaf ;
Regev, Aviv ;
Choueiri, Toni K. ;
Van Allen, Eliezer M. .
CANCER CELL, 2021, 39 (05) :649-+
[9]   Integrated Single-Cell Analysis Maps the Continuous Regulatory Landscape of Human Hematopoietic Differentiation [J].
Buenrostro, Jason D. ;
Corces, M. Ryan ;
Lareau, Caleb A. ;
Wu, Beijing ;
Schep, Alicia N. ;
Aryee, Martin J. ;
Majeti, Ravindra ;
Chang, Howard Y. ;
Greenleaf, William J. .
CELL, 2018, 173 (06) :1535-+
[10]   Renal cell carcinoma [J].
Cairns, Paul .
CANCER BIOMARKERS, 2011, 9 (1-6) :461-473