scMatch: a single-cell gene expression profile annotation tool using reference datasets

被引:71
作者
Hou, Rui [1 ,2 ]
Denisenko, Elena [1 ,2 ]
Forrest, Alistair R. R. [1 ]
机构
[1] Univ Western Australia, QEII Med Ctr, Harry Perkins Inst Med Res, Perth, WA 6009, Australia
[2] Univ Western Australia, Ctr Med Res, Perth, WA 6009, Australia
基金
英国医学研究理事会;
关键词
MESSENGER-RNA; SEQ; HETEROGENEITY; DYNAMICS;
D O I
10.1093/bioinformatics/btz292
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Single-cell RNA sequencing (scRNA-seq) measures gene expression at the resolution of individual cells. Massively multiplexed single-cell profiling has enabled large-scale transcriptional analyses of thousands of cells in complex tissues. In most cases, the true identity of individual cells is unknown and needs to be inferred from the transcriptomic data. Existing methods typically cluster (group) cells based on similarities of their gene expression profiles and assign the same identity to all cells within each cluster using the averaged expression levels. However, scRNA-seq experiments typically produce low-coverage sequencing data for each cell, which hinders the clustering process. Results: We introduce scMatch, which directly annotates single cells by identifying their closest match in large reference datasets. We used this strategy to annotate various single-cell datasets and evaluated the impacts of sequencing depth, similarity metric and reference datasets. We found that scMatch can rapidly and robustly annotate single cells with comparable accuracy to another recent cell annotation tool (SingleR), but that it is quicker and can handle larger reference datasets. We demonstrate how scMatch can handle large customized reference gene expression profiles that combine data from multiple sources, thus empowering researchers to identify cell populations in any complex tissue with the desired precision.
引用
收藏
页码:4688 / 4695
页数:8
相关论文
共 36 条
  • [1] Genomic Classification of Cutaneous Melanoma
    Akbani, Rehan
    Akdemir, Kadir C.
    Aksoy, B. Arman
    Albert, Monique
    Ally, Adrian
    Amin, Samirkumar B.
    Arachchi, Harindra
    Arora, Arshi
    Auman, J. Todd
    Ayala, Brenda
    Baboud, Julien
    Balasundaram, Miruna
    Balu, Saianand
    Barnabas, Nandita
    Bartlett, John
    Bartlett, Pam
    Bastian, Boris C.
    Baylin, Stephen B.
    Behera, Madhusmita
    Belyaev, Dmitry
    Benz, Christopher
    Bernard, Brady
    Beroukhim, Rameen
    Bir, Natalie
    Black, Aaron D.
    Bodenheimer, Tom
    Boice, Lori
    Boland, Genevieve M.
    Bono, Riccardo
    Bootwalla, Moiz S.
    Bosenberg, Marcus
    Bowen, Jay
    Bowlby, Reanne
    Bristow, Christopher A.
    Brockway-Lunardi, Laura
    Brooks, Denise
    Brzezinski, Jakub
    Bshara, Wiam
    Buda, Elizabeth
    Burns, William R.
    Butterfield, Yaron S. N.
    Button, Michael
    Calderone, Tiffany
    Cappellini, Giancarlo Antonini
    Carter, Candace
    Carter, Scott L.
    Cherney, Lynn
    Cherniack, Andrew D.
    Chevalier, Aaron
    Chin, Lynda
    [J]. CELL, 2015, 161 (07) : 1681 - 1696
  • [2] [Anonymous], 2018, NATURE, V562, P367, DOI DOI 10.1038/S41586-018-0590-4
  • [3] Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage
    Aran, Dvir
    Looney, Agnieszka P.
    Liu, Leqian
    Wu, Esther
    Fong, Valerie
    Hsu, Austin
    Chak, Suzanna
    Naikawadi, Ram P.
    Wolters, Paul J.
    Abate, Adam R.
    Butte, Atul J.
    Bhattacharya, Mallar
    [J]. NATURE IMMUNOLOGY, 2019, 20 (02) : 163 - +
  • [4] Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells
    Arner, Erik
    Daub, Carsten O.
    Vitting-Seerup, Kristoffer
    Andersson, Robin
    Lilje, Berit
    Drablos, Finn
    Lennartsson, Andreas
    Roennerblad, Michelle
    Hrydziuszko, Olga
    Vitezic, Morana
    Freeman, Tom C.
    Alhendi, Ahmad M. N.
    Arner, Peter
    Axton, Richard
    Baillie, J. Kenneth
    Beckhouse, Anthony
    Bodega, Beatrice
    Briggs, James
    Brombacher, Frank
    Davis, Margaret
    Detmar, Michael
    Ehrlund, Anna
    Endoh, Mitsuhiro
    Eslami, Afsaneh
    Fagiolini, Michela
    Fairbairn, Lynsey
    Faulkner, Geoffrey J.
    Ferrai, Carmelo
    Fisher, Malcolm E.
    Forrester, Lesley
    Goldowitz, Daniel
    Guler, Reto
    Ha, Thomas
    Hara, Mitsuko
    Herlyn, Meenhard
    Ikawa, Tomokatsu
    Kai, Chieko
    Kawamoto, Hiroshi
    Khachigian, Levon M.
    Klinken, S. Peter
    Kojima, Soichi
    Koseki, Haruhiko
    Klein, Sarah
    Mejhert, Niklas
    Miyaguchi, Ken
    Mizuno, Yosuke
    Morimoto, Mitsuru
    Morris, Kelly J.
    Mummery, Christine
    Nakachi, Yutaka
    [J]. SCIENCE, 2015, 347 (6225) : 1010 - 1014
  • [5] Single-Cell Trajectory Detection Uncovers Progression and Regulatory Coordination in Human B Cell Development
    Bendall, Sean C.
    Davis, Kara L.
    Amir, El-ad David
    Tadmor, Michelle D.
    Simonds, Erin F.
    Chen, Tiffany J.
    Shenfeld, Daniel K.
    Nolan, Garry P.
    Pe'er, Dana
    [J]. CELL, 2014, 157 (03) : 714 - 725
  • [6] Reconstruction of complex single-cell trajectories using CellRouter
    da Rocha, Edroaldo Lummertz
    Rowe, R. Grant
    Lundin, Vanessa
    Malleshaiah, Mohan
    Jha, Deepak Kumar
    Rambo, Carlos R.
    Li, Hu
    North, Trista E.
    Collins, James J.
    Daley, George Q.
    [J]. NATURE COMMUNICATIONS, 2018, 9
  • [7] The Cell Ontology 2016: enhanced content, modularization, and ontology interoperability
    Diehl, Alexander D.
    Meehan, Terrence F.
    Bradford, Yvonne M.
    Brush, Matthew H.
    Dahdul, Wasila M.
    Dougall, David S.
    He, Yongqun
    Osumi-Sutherland, David
    Ruttenberg, Alan
    Sarntivijai, Sirarat
    Van Slyke, Ceri E.
    Vasilevsky, Nicole A.
    Haendel, Melissa A.
    Blake, Judith A.
    Mungall, Christopher J.
    [J]. JOURNAL OF BIOMEDICAL SEMANTICS, 2016, 7
  • [8] A promoter-level mammalian expression atlas
    Forrest, Alistair R. R.
    Kawaji, Hideya
    Rehli, Michael
    Baillie, J. Kenneth
    de Hoon, Michiel J. L.
    Haberle, Vanja
    Lassmann, Timo
    Kulakovskiy, Ivan V.
    Lizio, Marina
    Itoh, Masayoshi
    Andersson, Robin
    Mungall, Christopher J.
    Meehan, Terrence F.
    Schmeier, Sebastian
    Bertin, Nicolas
    Jorgensen, Mette
    Dimont, Emmanuel
    Arner, Erik
    Schmidl, Christian
    Schaefer, Ulf
    Medvedeva, Yulia A.
    Plessy, Charles
    Vitezic, Morana
    Severin, Jessica
    Semple, Colin A.
    Ishizu, Yuri
    Young, Robert S.
    Francescatto, Margherita
    Alam, Intikhab
    Albanese, Davide
    Altschuler, Gabriel M.
    Arakawa, Takahiro
    Archer, John A. C.
    Arner, Peter
    Babina, Magda
    Rennie, Sarah
    Balwierz, Piotr J.
    Beckhouse, Anthony G.
    Pradhan-Bhatt, Swati
    Blake, Judith A.
    Blumenthal, Antje
    Bodega, Beatrice
    Bonetti, Alessandro
    Briggs, James
    Brombacher, Frank
    Burroughs, A. Maxwell
    Califano, Andrea
    Cannistraci, Carlo V.
    Carbajo, Daniel
    Chen, Yun
    [J]. NATURE, 2014, 507 (7493) : 462 - +
  • [9] Freytag Saskia, 2018, F1000Res, V7, P1297, DOI 10.12688/f1000research.15809.1
  • [10] Diffusion pseudotime robustly reconstructs lineage branching
    Haghverdi, Laleh
    Buettner, Maren
    Wolf, F. Alexander
    Buettner, Florian
    Theis, Fabian J.
    [J]. NATURE METHODS, 2016, 13 (10) : 845 - +