Auto clustering method study of flow cytometry data based on skew t-mixture models

被引:0
作者
Wang, Xian-Wen [1 ]
Chen, Feng [1 ]
Cheng, Zhi [1 ]
Du, Yao-Hua [1 ]
Bao, Hong-Tao [1 ]
Wu, Tai-Hu [1 ]
机构
[1] Institute of Medical Equipment, Academy of Military Medical Sciences, Tianjin
来源
Tien Tzu Hsueh Pao/Acta Electronica Sinica | 2014年 / 42卷 / 12期
关键词
EM algorithm; Flow cytometry; Mixture models; Skew t-distribution;
D O I
10.3969/j.issn.0372-2112.2014.12.028
中图分类号
学科分类号
摘要
A major component of flow cytometry data analysis involves gating, which is the process of identifying homogeneous groups of cells. As manual gating is error-prone, non-reproducible, nonstandardized, and time-consuming, we propose a flexible statistical model-based clustering approach to identifying cell populations in flow cytometry data based on skew t-mixture models. This approach, which employs a finite mixture model with the density function of skew t-distribution, estimates parameters via an expectation maximization algorithm. Data analysis from two different experiments prove that the model-based clustering methods give better results in terms of robustness against outliers than non model-based clustering methods. Compared to the Gaussian mixture models, skew normal mixture models and t-mixture models, the skew t-mixture models have more flexibility in clustering symmetric data and leads to lower misclassification rates when handling highly asymmetric data. ©, 2014, Chinese Institute of Electronics. All right reserved.
引用
收藏
页码:2527 / 2535
页数:8
相关论文
共 29 条
  • [1] Satoh C., Dan K., Yamashita T., Et al., Flow cytometric parameters with little interexaminer variability for diagnosing low-grade myelodysplastic syndromes, Leukemia Research, 32, 5, pp. 699-707, (2008)
  • [2] Gratama J.W., Kraan J., Keeney M., Et al., Reduction of variation in T-cell subset enumeration among 55 laboratories using single-platform, three or four-color flow cytometry based on CD45 and SSC-based gating of lymphocytes, Cytometry, 50, 2, pp. 92-101, (2002)
  • [3] Van Blerk M., Bernier M., Bossuyt X., Et al., National external quality assessment scheme for lymphocyte immunophenotyping in Belgium, Clinical Chemistry and Laboratory Medicine, 41, 3, pp. 323-330, (2003)
  • [4] Hahne F., Khodabakhshi A.H., Bashashati A., Et al., Per-channel basis normalization methods for flow cytometry data, Cytometry Part A, 77, 2, pp. 121-131, (2010)
  • [5] Bashashati A., Brinkman R.R., A survey of flow cytometry data analysis methods, Advance in Bioinformatics, 2009, (2009)
  • [6] Demers S., Kim J., Legendre P., Et al., Analyzing multivariate flow cytometric data in aquatic sciences, Cytometry, 13, 3, pp. 291-298, (1992)
  • [7] Wilkins M.F., Hardy S.A., Boddy L., Et al., Comparison of five clustering algorithms to classify phytoplankton from flow cytometry data, Cytometry, 44, 3, pp. 210-217, (2001)
  • [8] Rousseeuw P.J., Kaufman L., Trauwaert E., Fuzzy clustering using scatter matrices, Comput Statist Data Anal, 23, 1, pp. 135-151, (1996)
  • [9] Aghaeepour N., Nikolic R., Hoos H.H., Et al., Rapid cell population identification in flow cytometry data, Cytometry Part A, 79, 1, pp. 6-13, (2011)
  • [10] Zare H., Shooshtari P., Gupta A., Et al., Data reduction for spectral clustering to analyze high throughput flow cytometry data[J/OL], BMC Bioinformatics, 11, (2010)