Hematologist-Level Classification of Mature B-Cell Neoplasm Using Deep Learning on Multiparameter Flow Cytometry Data

被引:37
作者
Zhao, Max [1 ,2 ]
Mallesh, Nanditha [1 ]
Hoellein, Alexander [3 ,4 ]
Schabath, Richard [3 ,5 ]
Haferlach, Claudia [3 ]
Haferlach, Torsten [3 ]
Elsner, Franz [6 ]
Lueling, Hannes [6 ]
Krawitz, Peter [1 ]
Kern, Wolfgang [3 ]
机构
[1] Univ Bonn, Inst Genom Stat & Bioinformat, Bonn, Germany
[2] Charite, Inst Human Genet & Med Genet, Berlin, Germany
[3] MLL Munich Leukemia Lab, Munich, Germany
[4] Red Cross Hosp Munich, Munich, Germany
[5] Onkol Praxis Berlin Mitte, Berlin, Germany
[6] Res Mech GmbH, Munich, Germany
关键词
deep learning; self-organizing maps; non-Hodgkin lymphoma; SELF-ORGANIZING MAPS;
D O I
10.1002/cyto.a.24159
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The wealth of information captured by multiparameter flow cytometry (MFC) can be analyzed by recent methods of computer vision when represented as a single image file. We therefore transformed MFC raw data into a multicolor 2D image by a self-organizing map and classified this representation using a convolutional neural network. By this means, we built an artificial intelligence that is not only able to distinguish diseased from healthy samples, but it can also differentiate seven subtypes of mature B-cell neoplasm. We trained our model with 18,274 cases including chronic lymphocytic leukemia and its precursor monoclonal B-cell lymphocytosis, marginal zone lymphoma, mantle cell lymphoma, prolymphocytic leukemia, follicular lymphoma, hairy cell leukemia, lymphoplasmacytic lymphoma and achieved a weighted F1 score of 0.94 on a separate test set of 2,348 cases. Furthermore, we estimated the trustworthiness of a classification and could classify 70% of all cases with a confidence of 0.95 and higher. Our performance analyses indicate that particularly for rare subtypes further improvement can be expected when even more samples are available for training. (c) 2020 The Authors. Cytometry Part A published by Wiley Periodicals, Inc. on behalf of International Society for Advancement of Cytometry.
引用
收藏
页码:1073 / 1080
页数:8
相关论文
共 23 条
[1]  
Aghaeepour N, 2013, NAT METHODS, V10, P228, DOI [10.1038/NMETH.2365, 10.1038/nmeth.2365]
[2]  
[Anonymous], 2014, WORKSHOP INT C LEARN, DOI DOI 10.48550/ARXIV.1312.6034
[3]  
[Anonymous], 2015, Tech. Rep.
[4]   Lymphoplasmacytic Lymphoma and Marginal Zone Lymphoma in the Bone Marrow Paratrabecular Involvement as an Important Distinguishing Feature [J].
Bassarova, Assia ;
Troen, Gunhild ;
Spetalen, Signe ;
Micci, Francesca ;
Tierens, Anne ;
Delabie, Jan .
AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2015, 143 (06) :797-806
[5]   INTRODUCTION TO FLOW-CYTOMETRY DATA FILE STANDARD [J].
DEAN, PN ;
BAGWELL, CB ;
LINDMO, T ;
MURPHY, RF ;
SALZMAN, GC .
CYTOMETRY, 1990, 11 (03) :321-322
[6]   A guide to deep learning in healthcare [J].
Esteva, Andre ;
Robicquet, Alexandre ;
Ramsundar, Bharath ;
Kuleshov, Volodymyr ;
DePristo, Mark ;
Chou, Katherine ;
Cui, Claire ;
Corrado, Greg ;
Thrun, Sebastian ;
Dean, Jeff .
NATURE MEDICINE, 2019, 25 (01) :24-29
[7]   Stochastic on-line algorithm versus batch algorithm for quantization and self organizing maps [J].
Fort, JC ;
Cottrell, M ;
Letremy, P .
NEURAL NETWORKS FOR SIGNAL PROCESSING XI, 2001, :43-52
[8]  
GOODFELLOW I, 2016, DEEP LEARNING, V801
[9]  
GORMAN C, 2019, MULTIGPU IMPLEMENTAT
[10]   Deep Learning in Medical Imaging: Overview and Future Promise of an Exciting New Technique [J].
Greenspan, Hayit ;
van Ginneken, Bram ;
Summers, Ronald M. .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) :1153-1159