Flow Cytometry Bioinformatics

被引:72
作者
O'Neill, Kieran [1 ,2 ]
Aghaeepour, Nima [1 ,2 ]
Spidlen, Josef [1 ]
Brinkman, Ryan [1 ,3 ]
机构
[1] BC Canc Agcy, Terry Fox Lab, Vancouver, BC, Canada
[2] Univ British Columbia, Bioinformat Grad Program, Vancouver, BC V5Z 1M9, Canada
[3] Univ British Columbia, Dept Med Genet, Vancouver, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
CELL MASS CYTOMETRY; DATA FILE STANDARD; BIOCONDUCTOR PACKAGE; HIERARCHY; SUBSETS; FUTURE;
D O I
10.1371/journal.pcbi.1003365
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Flow cytometry bioinformatics is the application of bioinformatics to flow cytometry data, which involves storing, retrieving, organizing, and analyzing flow cytometry data using extensive computational resources and tools. Flow cytometry bioinformatics requires extensive use of and contributes to the development of techniques from computational statistics and machine learning. Flow cytometry and related methods allow the quantification of multiple independent biomarkers on large numbers of single cells. The rapid growth in the multidimensionality and throughput of flow cytometry data, particularly in the 2000s, has led to the creation of a variety of computational analysis methods, data standards, and public databases for the sharing of results. Computational methods exist to assist in the preprocessing of flow cytometry data, identifying cell populations within it, matching those cell populations across samples, and performing diagnosis and discovery using the results of previous steps. For preprocessing, this includes compensating for spectral overlap, transforming data onto scales conducive to visualization and analysis, assessing data for quality, and normalizing data across samples and experiments. For population identification, tools are available to aid traditional manual identification of populations in two-dimensional scatter plots (gating), to use dimensionality reduction to aid gating, and to find populations automatically in higher dimensional space in a variety of ways. It is also possible to characterize data in more comprehensive ways, such as the density-guided binary space partitioning technique known as probability binning, or by combinatorial gating. Finally, diagnosis using flow cytometry data can be aided by supervised learning techniques, and discovery of new cell types of biological importance by high-throughput statistical methods, as part of pipelines incorporating all of the aforementioned methods. Open standards, data, and software are also key parts of flow cytometry bioinformatics. Data standards include the widely adopted Flow Cytometry Standard (FCS) defining how data from cytometers should be stored, but also several new standards under development by the International Society for Advancement of Cytometry (ISAC) to aid in storing more detailed information about experimental design and analytical steps. Open data is slowly growing with the opening of the CytoBank database in 2010 and FlowRepository in 2012, both of which allow users to freely distribute their data, and the latter of which has been recommended as the preferred repository for MIFlowCyt-compliant data by ISAC. Open software is most widely available in the form of a suite of Bioconductor packages, but is also available for web execution on the GenePattern platform.
引用
收藏
页数:10
相关论文
共 81 条
  • [31] Multiplexed mass cytometry profiling of cellular states perturbed by small-molecule regulators
    Bodenmiller, Bernd
    Zunder, Eli R.
    Finck, Rachel
    Chen, Tiffany J.
    Savig, Erica S.
    Bruggner, Robert V.
    Simonds, Erin F.
    Bendall, Sean C.
    Sachs, Karen
    Krutzik, Peter O.
    Nolan, Garry P.
    [J]. NATURE BIOTECHNOLOGY, 2012, 30 (09) : 858 - U89
  • [32] Brando B, 2000, CYTOMETRY, V42, P327, DOI 10.1002/1097-0320(20001215)42:6<327::AID-CYTO1000>3.0.CO
  • [33] 2-F
  • [34] A chromatic explosion: the development and future of multiparameter flow cytometry
    Chattopadhyay, Pratip K.
    Hogerkorp, Carl-Magnus
    Roederer, Mario
    [J]. IMMUNOLOGY, 2008, 125 (04) : 441 - 449
  • [35] Automated pattern-guided principal component analysis vs expert-based immunophenotypic classification of B-cell chronic lymphoproliferative disorders: a step forward in the standardization of clinical immunophenotyping
    Costa, E. S.
    Pedreira, C. E.
    Barrena, S.
    Lecrevisse, Q.
    Flores, J.
    Quijano, S.
    Almeida, J.
    del Carmen Garcia-Macias, M.
    Bottcher, S.
    Van Dongen, J. J. M.
    Orfao, A.
    [J]. LEUKEMIA, 2010, 24 (11) : 1927 - 1933
  • [36] INTRODUCTION TO FLOW-CYTOMETRY DATA FILE STANDARD
    DEAN, PN
    BAGWELL, CB
    LINDMO, T
    MURPHY, RF
    SALZMAN, GC
    [J]. CYTOMETRY, 1990, 11 (03): : 321 - 322
  • [37] Contribution of Multiparameter Flow Cytometry Immunophenotyping to the Diagnostic Screening and Classification of Pediatric Cancer
    Ferreira-Facio, Cristiane S.
    Milito, Cristiane
    Botafogo, Vitor
    Fontana, Marcela
    Thiago, Leandro S.
    Oliveira, Elen
    da Rocha-Filho, Ariovaldo S.
    Werneck, Fernando
    Forny, Danielle N.
    Dekermacher, Samuel
    de Azambuja, Ana Paula
    Ferman, Sima Esther
    Silvestre de Faria, Paulo Antonio
    Land, Marcelo G. P.
    Orfao, Alberto
    Costa, Elaine S.
    [J]. PLOS ONE, 2013, 8 (03):
  • [38] Optimizing transformations for automated, high throughput analysis of flow cytometry data
    Finak, Greg
    Perez, Juan-Manuel
    Weng, Andrew
    Gottardo, Raphael
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [39] flowPeaks: a fast unsupervised clustering for flow cytometry data via K-means and density peak finding
    Ge, Yongchao
    Sealfon, Stuart C.
    [J]. BIOINFORMATICS, 2012, 28 (15) : 2052 - 2058
  • [40] Bioconductor: open software development for computational biology and bioinformatics
    Gentleman, RC
    Carey, VJ
    Bates, DM
    Bolstad, B
    Dettling, M
    Dudoit, S
    Ellis, B
    Gautier, L
    Ge, YC
    Gentry, J
    Hornik, K
    Hothorn, T
    Huber, W
    Iacus, S
    Irizarry, R
    Leisch, F
    Li, C
    Maechler, M
    Rossini, AJ
    Sawitzki, G
    Smith, C
    Smyth, G
    Tierney, L
    Yang, JYH
    Zhang, JH
    [J]. GENOME BIOLOGY, 2004, 5 (10)