flowClean: Automated Identification and Removal of Fluorescence Anomalies in Flow Cytometry Data

被引:40
作者
Fletez-Brant, Kipper [1 ,2 ,3 ]
Spidlen, Josef [4 ]
Brinkman, Ryan R. [4 ]
Roederer, Mario [3 ]
Chattopadhyay, Pratip K. [3 ]
机构
[1] Johns Hopkins Univ, McKusick Nathans Inst Genet Med, Baltimore, MD USA
[2] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
[3] NIAID, Vaccine Res Ctr, NIH, Baltimore, MD USA
[4] British Columbia Canc Agcy, Terry Fox Lab, Vancouver, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
high throughput; quality control; flow cytometry; automated; BIOCONDUCTOR;
D O I
10.1002/cyto.a.22837
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Modern flow cytometry systems can be coupled to plate readers for high-throughput acquisition. These systems allow hundreds of samples to be analyzed in a single day. Quality control of the data remains challenging, however, and is further complicated when a large number of parameters is measured in an experiment. Our examination of 29,228 publicly available FCS files from laboratories worldwide indicates 13.7% have a fluorescence anomaly. In particular, fluorescence measurements for a sample over the collection time may not remain stable due to fluctuations in fluid dynamics; the impact of instabilities may differ between samples and among parameters. Therefore, we hypothesized that tracking cell populations (which represent a summary of all parameters) in centered log ratio space would provide a sensitive and consistent method of quality control. Here, we present flowClean, an algorithm to track subset frequency changes within a sample during acquisition, and flag time periods with fluorescence perturbations leading to the emergence of false populations. Aberrant time periods are reported as a new parameter and added to a revised data file, allowing users to easily review and exclude those events from further analysis. We apply this method to proof-of-concept datasets and also to a subset of data from a recent vaccine trial. The algorithm flags events that are suspicious by visual inspection, as well as those showing more subtle effects that might not be consistently flagged by investigators reviewing the data manually, and out-performs the current state-of-the-art. flowClean is available as an R package on Bioconductor, as a module on the free-to-use GenePattern web server, and as a plugin for FlowJo X. (C) 2016 International Society for Advancement of Cytometry
引用
收藏
页码:461 / 471
页数:11
相关论文
共 13 条
  • [1] Early immunologic correlates of HIV protection can be identified from computational analysis of complex multivariate T-cell flow cytometry assays*
    Aghaeepour, Nima
    Chattopadhyay, Pratip K.
    Ganesan, Anuradha
    O'Neill, Kieran
    Zare, Habil
    Jalali, Adrin
    Hoos, Holger H.
    Roederer, Mario
    Brinkman, Ryan R.
    [J]. BIOINFORMATICS, 2012, 28 (07) : 1009 - 1016
  • [2] Anderson T.W., 1986, STAT ANAL DATA, V2nd, DOI DOI 10.1007/978-94-009-4109-0
  • [3] Quantum dot semiconductor nanocrystals for immunophenotyping by polychromatic flow cytometry
    Chattopadhyay, Pratip K.
    Price, David A.
    Harper, Theresa F.
    Betts, Michael R.
    Yu, Joanne
    Gostick, Emma
    Perfetto, Stephen P.
    Goepfert, Paul
    Koup, Richard A.
    De Rosa, Stephen C.
    Bruchez, Marcel P.
    Roederer, Mario
    [J]. NATURE MEDICINE, 2006, 12 (08) : 972 - 977
  • [4] High-Throughput Flow Cytometry Data Normalization for Clinical Trials
    Finak, Greg
    Jiang, Wenxin
    Krouse, Kevin
    Wei, Chungwen
    Sanz, Ignacio
    Phippard, Deborah
    Asare, Adam
    De Rosa, Stephen C.
    Self, Steve
    Gottardo, Raphael
    [J]. CYTOMETRY PART A, 2014, 85 (03) : 277 - 286
  • [5] Fry JM, 1996, COMPOSITIONAL DATA A
  • [6] Gentleman R, FLOWQ QUALITY CONTRO
  • [7] Bioconductor: open software development for computational biology and bioinformatics
    Gentleman, RC
    Carey, VJ
    Bates, DM
    Bolstad, B
    Dettling, M
    Dudoit, S
    Ellis, B
    Gautier, L
    Ge, YC
    Gentry, J
    Hornik, K
    Hothorn, T
    Huber, W
    Iacus, S
    Irizarry, R
    Leisch, F
    Li, C
    Maechler, M
    Rossini, AJ
    Sawitzki, G
    Smith, C
    Smyth, G
    Tierney, L
    Yang, JYH
    Zhang, JH
    [J]. GENOME BIOLOGY, 2004, 5 (10)
  • [8] Huber W, 2015, NAT METHODS, V12, P115, DOI [10.1038/nmeth.3252, 10.1038/NMETH.3252]
  • [9] Optimal Detection of Changepoints With a Linear Computational Cost
    Killick, R.
    Fearnhead, P.
    Eckley, I. A.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (500) : 1590 - 1598
  • [10] Data quality assessment of ungated flow cytometry data in high throughput experiments
    Le Meur, Nolwenn
    Rossini, Anthony
    Gasparetto, Maura
    Smith, Clay
    Brinkman, Ryan R.
    Gentleman, Robert
    [J]. CYTOMETRY PART A, 2007, 71A (06) : 393 - 403