SWIFT: SCALABLE WEIGHTED ITERATIVE SAMPLING FOR FLOW CYTOMETRY CLUSTERING

被引:16
作者
Naim, Iftekhar [1 ]
Datta, Suprakash [4 ]
Sharma, Gaurav [1 ,2 ]
Cavenaugh, James S. [3 ]
Mosmann, Tim R. [3 ]
机构
[1] Univ Rochester, Dept Elect & Comp Engn, Rochester, NY 14627 USA
[2] Univ Rochester, Dept Biostatist & Computat Biol, Rochester, NY 14627 USA
[3] Univ Rochester, Ctr Vaccine Biol & Immunol, Rochester, NY 14627 USA
[4] York Univ, Dept Comp Sci & Engn, Toronto, ON M3J 2R7, Canada
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
基金
加拿大自然科学与工程研究理事会;
关键词
Flow cytometry; clustering; Gaussian mixture model; sampling; expectation-maximization; DATASETS;
D O I
10.1109/ICASSP.2010.5495653
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Flow cytometry (FC) is a powerful technology for rapid multivariate analysis and functional discrimination of cells. Current FC platforms generate large, high-dimensional datasets which pose a significant challenge for traditional manual bivariate analysis. Automated multivariate clustering, though highly desirable, is also stymied by the critical requirement of identifying rare populations that form rather small clusters, in addition to the computational challenges posed by the large size and dimensionality of the datasets. In this paper, we address these twin challenges by developing a two-stage scalable multivariate parametric clustering algorithm. In the first stage, we model the data as a mixture of Gaussians and use an iterative weighted sampling technique to estimate the mixture components successively in order of decreasing size. In the second stage, we apply a graphbased hierarchical merging technique to combine Gaussian components with significant overlaps into the final number of desired clusters. The resulting algorithm offers a reduction in complexity over conventional mixture modeling while simultaneously allowing for better detection of small populations. We demonstrate the effectiveness of our method both on simulated data and actual flow cytometry datasets.
引用
收藏
页码:509 / 512
页数:4
相关论文
共 50 条
  • [1] Scalable Clustering with Adaptive Instance Sampling
    Yang, JaeKyung
    Yu, ByoungJin
    Choi, MyoungJin
    2013 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM 2013), 2013, : 1309 - 1313
  • [2] Clustering of cell populations in flow cytometry data using a combination of Gaussian mixtures
    Reiter, Michael
    Rota, Paolo
    Kleber, Florian
    Diem, Markus
    Groeneveld-Krentz, Stefanie
    Dworzak, Michael
    PATTERN RECOGNITION, 2016, 60 : 1029 - 1040
  • [3] The Challange of Clustering Flow Cytometry Data from Phytoplankton in Lakes
    Gluege, Stefan
    Pomati, Francesco
    Albert, Carlo
    Kauf, Peter
    Ott, Thomas
    NONLINEAR DYNAMICS OF ELECTRONIC SYSTEMS, 2014, 438 : 379 - 386
  • [4] A Scalable Pipeline for High-Throughput Flow Cytometry
    Wilson, Aaron C.
    Moutsatsos, Ioannis K.
    Yu, Gary
    Pineda, Javier J.
    Feng, Yan
    Auld, Douglas S.
    SLAS DISCOVERY, 2018, 23 (07) : 708 - 718
  • [5] Color-scalable flow cytometry with Raman tags
    Nishiyama, Ryo
    Hiramatsu, Kotaro
    Kawamura, Shintaro
    Dodo, Kosuke
    Furuyaa, Kei
    de Pablo, Julia Gala
    Takizawa, Shigekazu
    Min, Wei
    Sodeoka, Mikiko
    Goda, Keisuke
    PNAS NEXUS, 2023, 2 (02):
  • [6] Stochastic Load Flow Calculation Method Based on Clustering and Sampling
    Xie H.
    Ren C.
    Guo Z.
    Zhang P.
    Guo B.
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2020, 35 (23): : 4940 - 4948
  • [7] Clustering and Kernel Density Estimation for Assessment of Measurable Residual Disease by Flow Cytometry
    Jacqmin, Hugues
    Chatelain, Bernard
    Louveaux, Quentin
    Jacqmin, Philippe
    Dogne, Jean-Michel
    Graux, Carlos
    Mullier, Francois
    DIAGNOSTICS, 2020, 10 (05)
  • [8] Ultrafast clustering of single-cell flow cytometry data using FlowGrid
    Ye, Xiaoxin
    Ho, Joshua W. K.
    BMC SYSTEMS BIOLOGY, 2019, 13
  • [9] Automated clustering of heterotrophic bacterioplankton in flow cytometry data
    Garcia, Francisca C.
    Lopez-Urrutia, Angel
    Moran, Xose Anxelu G.
    AQUATIC MICROBIAL ECOLOGY, 2014, 72 (02) : 175 - 185
  • [10] Feature-guided clustering of multi-dimensional flow cytometry datasets
    Zeng, Qing T.
    Pratt, Juan Pablo
    Pak, Jane
    Ravnic, Dino
    Huss, Harold
    Mentzer, Steven J.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (03) : 325 - 331