A robust and interpretable end-to-end deep learning model for cytometry data

被引:31
作者
Hu, Zicheng [1 ]
Tang, Alice [1 ]
Singh, Jaiveer [1 ]
Bhattacharya, Sanchita [1 ]
Butte, Atul J. [1 ]
机构
[1] Univ Calif San Francisco, Bakar Computat Hlth Sci Inst, San Francisco, CA 94158 USA
关键词
CyTOF; flow cytometry; deep learning; cytomegalovirus; model interpretation; FLOW-CYTOMETRY; AUTOMATED IDENTIFICATION; EXPRESSION; INFECTION; DIAGNOSIS; MAPS; MASS;
D O I
10.1073/pnas.2003026117
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cytometry technologies are essential tools for immunology research, providing high-throughput measurements of the immune cells at the single-cell level. Existing approaches in interpreting and using cytometry measurements include manual or automated gating to identify cell subsets from the cytometry data, providing highly intuitive results but may lead to significant information loss, in that additional details in measured or correlated cell signals might be missed. In this study, we propose and test a deep convolutional neural network for analyzing cytometry data in an end-to-end fashion, allowing a direct association between raw cytometry data and the clinical outcome of interest. Using nine large cytometry by time-of-flight mass spectrometry or mass cytometry (CyTOF) studies from the open-access ImmPort data-base, we demonstrated that the deep convolutional neural net- work model can accurately diagnose the latent cytomegalovirus (CMV) in healthy individuals, even when using highly heterogeneous data from different studies. In addition, we developed a permutation-based method for interpreting the deep convolutional neural network model. We were able to identify a CD27-CD94+ CD8+ T cell population significantly associated with latent CMV infection, confirming the findings in previous studies. Finally, we provide a tutorial for creating, training, and interpreting the tailored deep learning model for cytometry data using Keras and TensorFlow (https://github.com/hzc36/DeepLearningCyTOF).
引用
收藏
页码:21373 / 21380
页数:8
相关论文
共 46 条
  • [1] A clinically meaningful metric of immune age derived from high-dimensional longitudinal monitoring
    Alpert, Ayelet
    Pickman, Yishai
    Leipold, Michael
    Rosenberg-Hasson, Yael
    Ji, Xuhuai
    Gaujoux, Renaud
    Rabani, Hadas
    Starosvetsky, Elina
    Kveler, Ksenya
    Schaffert, Steven
    Furman, David
    Caspi, Oren
    Rosenschein, Uri
    Khatri, Purvesh
    Dekker, Cornelia L.
    Maecker, Holden T.
    Davis, Mark M.
    Shen-Orr, Shai S.
    [J]. NATURE MEDICINE, 2019, 25 (03) : 487 - +
  • [2] Sensitive detection of rare disease-associated cell subsets via representation learning
    Arvaniti, Eirini
    Claassen, Manfred
    [J]. NATURE COMMUNICATIONS, 2017, 8
  • [3] ImmPort, toward repurposing of open access immunological assay data for translational and clinical research
    Bhattacharya, Sanchita
    Dunn, Patrick
    Thomas, Cristel G.
    Smith, Barry
    Schaefer, Henry
    Chen, Jieming
    Hu, Zicheng
    Zalocusky, Kelly A.
    Shankar, Ravi D.
    Shen-Orr, Shai S.
    Thomson, Elizabeth
    Wiser, Jeffrey
    Butte, Atul J.
    [J]. SCIENTIFIC DATA, 2018, 5
  • [4] Automated identification of stratifying signatures in cellular subpopulations
    Bruggner, Robert V.
    Bodenmiller, Bernd
    Dill, David L.
    Tibshirani, Robert J.
    Nolan, Garry P.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (26) : E2770 - E2777
  • [5] The Dynamic Processing of CD46 Intracellular Domains Provides a Molecular Rheostat for T Cell Activation
    Choileain, Siobhan Ni
    Weyand, Nathan J.
    Neumann, Christian
    Thomas, Joelle
    So, Magdalene
    Astier, Anne L.
    [J]. PLOS ONE, 2011, 6 (01):
  • [6] Incidence of cytomegalovirus infection among the general population and pregnant women in the United States
    Colugnati, Fernando A. B.
    Staras, Stephanie A. S.
    Dollard, Sheila C.
    Cannon, Michael J.
    [J]. BMC INFECTIOUS DISEASES, 2007, 7 (1)
  • [7] Multifunctional Gold-Mesoporous Silica Nanocomposites for Enhanced Two-Photon Imaging and Therapy of Cancer Cells
    Croissant, Jonas G.
    Qi, Christian
    Maynadier, Marie
    Cattoen, Xavier
    Man, Michel Wong Chi
    Raehm, Laurence
    Mongin, Olivier
    Blanchard-Desce, Mireille
    Garcia, Marcel
    Gary-Bobo, Magali
    Durand, Jean-Olivier
    [J]. FRONTIERS IN MOLECULAR BIOSCIENCES, 2016, 3
  • [8] Davis M., 2010, T CELL RESPONSES H1N
  • [9] Davis M., 2019, MONOZYGOTIC DIZYGOTI
  • [10] Davis M., 2013, T CELL RESPONSES H1N