Cluster stability in the analysis of mass cytometry data

被引:19
|
作者
Melchiotti, Rossella [1 ,2 ]
Gracio, Filipe [1 ,2 ]
Kordasti, Shahram [3 ]
Todd, Alan K. [1 ,2 ]
de Rinaldis, Emanuele [1 ,2 ]
机构
[1] Guys & St Thomas NHS Fdn Trust, 16th Floor,Tower Wing Guys Hosp, London SE1 9RT, England
[2] Kings Coll London, Translat Bioinformat Platform, R&D Dept, Biomed Res Ctr, London SE1 9RT, England
[3] Kings Coll London, Rayne Inst, Canc Studies Div, Dept Haematol Med, London SE5 9NU, England
关键词
CyTOF; data analysis; automated clustering; cluster stability; SPADE; PhenoGraph; FLOCK; AUTOMATED IDENTIFICATION; CELL SUBSETS; IMMUNE;
D O I
10.1002/cyto.a.23001
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Manual gating has been traditionally applied to cytometry data sets to identify cells based on protein expression. The advent of mass cytometry allows for a higher number of proteins to be simultaneously measured on cells, therefore providing a means to define cell clusters in a high dimensional expression space. This enhancement, whilst opening unprecedented opportunities for single cell-level analyses, makes the incremental replacement of manual gating with automated clustering a compelling need. To this aim many methods have been implemented and their successful applications demonstrated in different settings. However, the reproducibility of automatically generated clusters is proving challenging and an analytical framework to distinguish spurious clusters from more stable entities, and presumably more biologically relevant ones, is still missing. One way to estimate cell clusters' stability is the evaluation of their consistent re-occurrence within- and between-algorithms, a metric that is commonly used to evaluate results from gene expression. Herein we report the usage and importance of cluster stability evaluations, when applied to results generated from three popular clustering algorithms - SPADE, FLOCK and PhenoGraph - run on four different data sets. These algorithms were shown to generate clusters with various degrees of statistical stability, many of them being unstable. By comparing the results of automated clustering with manually gated populations, we illustrate how information on cluster stability can assist towards a more rigorous and informed interpretation of clustering results. We also explore the relationships between statistical stability and other properties such as clusters' compactness and isolation, demonstrating that whilst cluster stability is linked to other properties it cannot be reliably predicted by any of them. Our study proposes the introduction of cluster stability as a necessary checkpoint for cluster interpretation and contributes to the construction of a more systematic and standardized analytical framework for the assessment of cytometry clustering results. (c) 2016 International Society for Advancement of Cytometry
引用
收藏
页码:73 / 84
页数:12
相关论文
共 50 条
  • [1] Compositional Data Analysis using Kernels in mass cytometry data
    Rudra, Pratyaydipta
    Baxter, Ryan
    Hsieh, Elena W. Y.
    Ghosh, Debashis
    BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [2] The end of gating? An introduction to automated analysis of high dimensional cytometry data
    Mair, Florian
    Hartmann, Felix J.
    Mrdjen, Dunja
    Tosevski, Vinko
    Krieg, Carsten
    Becher, Burkhard
    EUROPEAN JOURNAL OF IMMUNOLOGY, 2016, 46 (01) : 34 - 43
  • [3] Insights into highly multiplexed tissue images: A primer for Mass Cytometry Imaging data analysis
    Zhao, Jing
    Liu, Yaquan
    Wang, Meng
    Ma, Junjie
    Yang, Pu
    Wang, Shunhao
    Wu, Qi
    Gao, Jie
    Chen, Mingli
    Qu, Guangbo
    Wang, Jianhua
    Jiang, Guibin
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2022, 157
  • [4] High-Dimensional Data Analysis Algorithms Yield Comparable Results for Mass Cytometry and Spectral Flow Cytometry Data
    Ferrer-Font, Laura
    Mayer, Johannes U.
    Old, Samuel
    Hermans, Ian F.
    Irish, Jonathan
    Price, Kylie M.
    CYTOMETRY PART A, 2020, 97 (08) : 824 - 831
  • [5] ImmCellTyper facilitates systematic mass cytometry data analysis for deep immune profiling
    Sun, Jing
    Choy, Desmond
    Sompairac, Nicolas
    Jamshidi, Shirin
    Mishto, Michele
    Kordasti, Shahram
    ELIFE, 2024, 13
  • [6] Comparison of Clustering Methods for High-Dimensional Single-Cell Flow and Mass Cytometry Data
    Weber, Lukas M.
    Robinson, Mark D.
    CYTOMETRY PART A, 2016, 89A (12) : 1084 - 1096
  • [7] HiPPO and PANDA: Two Bioinformatics Tools to Support Analysis of Mass Cytometry Data
    Pirro, Stefano
    Spada, Filomena
    Gadaleta, Emanuela
    Ferrentino, Federica
    Thorn, Graeme J.
    Cesareni, Gianni
    Chelala, Claude
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (08) : 1283 - 1294
  • [8] Normalization of mass cytometry data with bead standards
    Finck, Rachel
    Simonds, Erin F.
    Jager, Astraea
    Krishnaswamy, Smita
    Sachs, Karen
    Fantl, Wendy
    Pe'er, Dana
    Nolan, Garry P.
    Bendall, Sean C.
    CYTOMETRY PART A, 2013, 83A (05) : 483 - +
  • [9] Gating mass cytometry data by deep learning
    Li, Huamin
    Shaham, Uri
    Stanton, Kelly P.
    Yao, Yi
    Montgomery, Ruth R.
    Kluger, Yuval
    BIOINFORMATICS, 2017, 33 (21) : 3423 - 3430
  • [10] Testing for differential abundance in mass cytometry data
    Lun, Aaron T. L.
    Richard, Arianne C.
    Marioni, John C.
    NATURE METHODS, 2017, 14 (07) : 707 - +