SWIFT: SCALABLE WEIGHTED ITERATIVE SAMPLING FOR FLOW CYTOMETRY CLUSTERING

被引:16
作者
Naim, Iftekhar [1 ]
Datta, Suprakash [4 ]
Sharma, Gaurav [1 ,2 ]
Cavenaugh, James S. [3 ]
Mosmann, Tim R. [3 ]
机构
[1] Univ Rochester, Dept Elect & Comp Engn, Rochester, NY 14627 USA
[2] Univ Rochester, Dept Biostatist & Computat Biol, Rochester, NY 14627 USA
[3] Univ Rochester, Ctr Vaccine Biol & Immunol, Rochester, NY 14627 USA
[4] York Univ, Dept Comp Sci & Engn, Toronto, ON M3J 2R7, Canada
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
基金
加拿大自然科学与工程研究理事会;
关键词
Flow cytometry; clustering; Gaussian mixture model; sampling; expectation-maximization; DATASETS;
D O I
10.1109/ICASSP.2010.5495653
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Flow cytometry (FC) is a powerful technology for rapid multivariate analysis and functional discrimination of cells. Current FC platforms generate large, high-dimensional datasets which pose a significant challenge for traditional manual bivariate analysis. Automated multivariate clustering, though highly desirable, is also stymied by the critical requirement of identifying rare populations that form rather small clusters, in addition to the computational challenges posed by the large size and dimensionality of the datasets. In this paper, we address these twin challenges by developing a two-stage scalable multivariate parametric clustering algorithm. In the first stage, we model the data as a mixture of Gaussians and use an iterative weighted sampling technique to estimate the mixture components successively in order of decreasing size. In the second stage, we apply a graphbased hierarchical merging technique to combine Gaussian components with significant overlaps into the final number of desired clusters. The resulting algorithm offers a reduction in complexity over conventional mixture modeling while simultaneously allowing for better detection of small populations. We demonstrate the effectiveness of our method both on simulated data and actual flow cytometry datasets.
引用
收藏
页码:509 / 512
页数:4
相关论文
共 50 条
  • [21] Unbiased segmentation of diffusion-weighted magnetic resonance images of the brain using iterative clustering
    Hadjiprocopis, A
    Rashid, W
    Tofts, PS
    MAGNETIC RESONANCE IMAGING, 2005, 23 (08) : 877 - 885
  • [22] A Two-Stage Clustering Technique for Automatic Biaxial Gating of Flow Cytometry Data
    Pouyan, M. Baran
    Jindal, V.
    Birjandtalab, J.
    Nourani, M.
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 511 - 516
  • [23] Bayesian clustering of flow cytometry data for the diagnosis of B-Chronic Lymphocytic Leukemia
    Lakoumentas, John
    Drakos, John
    Karakantza, Marina
    Nikiforidis, George C.
    Sakellaropoulos, George C.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (02) : 251 - 261
  • [24] Under-sampling Algorithm with Weighted Distance Based on Adaptive K-Means Clustering
    Qian Z.
    Zhen Y.
    Bo S.
    Data Analysis and Knowledge Discovery, 2022, 6 (05) : 127 - 136
  • [25] Combination of automated high throughput platforms, flow cytometry, and hierarchical clustering to detect cell state
    Kitsos, Christine M.
    Bhamidipati, Phani
    Melnikova, Irena
    Cash, Ethan P.
    McNulty, Chris
    Furman, Julia
    Cima, Michael J.
    Levinson, Douglas
    CYTOMETRY PART A, 2007, 71A (01) : 16 - 27
  • [26] Adaptive weighted over-sampling for imbalanced datasets based on density peaks clustering with heuristic filtering
    Tao, Xinmin
    Li, Qing
    Guo, Wenjie
    Ren, Chao
    He, Qing
    Liu, Rui
    Zou, JunRong
    INFORMATION SCIENCES, 2020, 519 : 43 - 73
  • [27] Single point iterative weighted fuzzy C-means clustering algorithm for remote sensing image segmentation
    Fan, Jianchao
    Han, Min
    Wang, Jun
    PATTERN RECOGNITION, 2009, 42 (11) : 2527 - 2540
  • [28] Model-based cell clustering and population tracking for time-series flow cytometry data
    Kodai Minoura
    Ko Abe
    Yuka Maeda
    Hiroyoshi Nishikawa
    Teppei Shimamura
    BMC Bioinformatics, 20
  • [29] Auto clustering method study of flow cytometry data based on skew t-mixture models
    Wang, Xian-Wen
    Chen, Feng
    Cheng, Zhi
    Du, Yao-Hua
    Bao, Hong-Tao
    Wu, Tai-Hu
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2014, 42 (12): : 2527 - 2535
  • [30] Model-based cell clustering and population tracking for time-series flow cytometry data
    Minoura, Kodai
    Abe, Ko
    Maeda, Yuka
    Nishikawa, Hiroyoshi
    Shimamura, Teppei
    BMC BIOINFORMATICS, 2019, 20 (01)