Automated Gating of flow cytometry data via robust model-based clustering

被引:177
作者
Lo, Kenneth [1 ]
Brinkman, Ryan Remy [2 ]
Gottardo, Raphael [1 ]
机构
[1] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z2, Canada
[2] British Columbia Canc Res Ctr, Terry Fox Lab, Vancouver, BC V5Z 1L3, Canada
关键词
Box-Cox transformation; EM algorithm; mixture model; outliers; statistics; t distribution; flow cytometry; gating; clustering;
D O I
10.1002/cyto.a.20531
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The capability of flow cytometry to offer rapid quantification of multidimensional characteristics for millions of cells has made this technology indispensable for health research, medical diagnosis, and treatment. However, the lack of statistical and bioinformatics tools to parallel recent high-throughput technological advancements has hindered this technology from reaching its full potential. We propose a flexible statistical model-based clustering approach for identifying cell populations in flow cytometry data based on t-mixture models with a Box-Cox transformation. This approach generalizes the popular Gaussian mixture models to account for outliers and allow for nonelliptical clusters. We describe an Expectation-Maximization (EM) algorithm to simultaneously handle parameter estimation and transformation selection. Using two publicly available datasets, we demonstrate that our proposed methodology provides enough flexibility and robustness to mimic manual gating results performed by an expert researcher. In addition, we present results from a simulation study, which show that this new clustering framework gives better results in terms of robustness to model misspecification and estimation of the number of clusters, compared to the popular mixture models. The proposed clustering methodology is well adapted to automated analysis of flow cytometry data. It tends to give more reproducible results, and helps reduce the significant subjectivity and human time cost encountered in manual gating analysis. (C) 2008 International Society for Analytical Cytology.
引用
收藏
页码:321 / 332
页数:12
相关论文
共 66 条
  • [1] High content screening applied to large-scale cell biology
    Abraham, VC
    Taylor, DL
    Haskins, JR
    [J]. TRENDS IN BIOTECHNOLOGY, 2004, 22 (01) : 15 - 22
  • [2] TRANSFORMATIONS UNMASKED
    ATKINSON, AC
    [J]. TECHNOMETRICS, 1988, 30 (03) : 311 - 318
  • [3] DNA histogram analysis for node-negative breast cancer
    Bagwell, CB
    [J]. CYTOMETRY PART A, 2004, 58A (01): : 76 - 78
  • [4] MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING
    BANFIELD, JD
    RAFTERY, AE
    [J]. BIOMETRICS, 1993, 49 (03) : 803 - 821
  • [5] CLASSIFICATION AND REGRESSION TREES FOR BONE-MARROW IMMUNOPHENOTYPING
    BECKMAN, RJ
    SALZMAN, GC
    STEWART, CC
    [J]. CYTOMETRY, 1995, 20 (03): : 210 - 217
  • [6] AN ANALYSIS OF TRANSFORMATIONS REVISITED
    BICKEL, PJ
    DOKSUM, KA
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1981, 76 (374) : 296 - 311
  • [7] Assessing a mixture model for clustering with the integrated completed likelihood
    Biernacki, C
    Celeux, G
    Govaert, G
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (07) : 719 - 725
  • [8] Choosing models in model-based clustering and discriminant analysis
    Biernacki, C
    Govaert, G
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1999, 64 (01) : 49 - 71
  • [9] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [10] Identification of 72 phytoplankton species by radial basis function neural network analysis of flow cytometric data
    Boddy, L
    Morris, CW
    Wilkins, MF
    Al-Haddad, L
    Tarran, GA
    Jonker, RR
    Burkill, PH
    [J]. MARINE ECOLOGY PROGRESS SERIES, 2000, 195 : 47 - 59