Chronic Lymphocytic Leukemia Progression Diagnosis with Intrinsic Cellular Patterns via Unsupervised Clustering

被引:10
作者
Chen, Pingjun [1 ]
El Hussein, Siba [2 ,3 ]
Xing, Fuyong [4 ]
Aminu, Muhammad [1 ]
Kannapiran, Aparajith [5 ]
Hazle, John D. [1 ]
Medeiros, L. Jeffrey [3 ]
Wistuba, Ignacio I. [6 ]
Jaffray, David [1 ,7 ]
Khoury, Joseph D. [3 ]
Wu, Jia [1 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Imaging Phys, Houston, TX 77030 USA
[2] Univ Rochester, Dept Pathol, Med Ctr, Rochester, NY 14642 USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Hematopathol, Houston, TX 77030 USA
[4] Univ Colorado, Dept Biostat & Informat, Anschutz Med Campus, Aurora, CO 80045 USA
[5] Univ Texas Austin, Dept Biomed Engn, Austin, TX 78705 USA
[6] Univ Texas MD Anderson Canc Ctr, Dept Translat Mol Pathol, Houston, TX 77030 USA
[7] Univ Texas MD Anderson Canc Ctr, Dept Technol & Digital Off, Houston, TX 77030 USA
关键词
chronic lymphocytic leukemia (CLL); accelerated CLL; Richter transformation (RT); large cell transformation; disease progression; cellular feature engineering; unsupervised clustering; feature fusion; feature selection; DIAGNOSTICALLY RELEVANT REGIONS; BREAST-CANCER; WHOLE; SEGMENTATION; LOCALIZATION; LYMPHOMA;
D O I
10.3390/cancers14102398
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Simple Summary Distinguishing between chronic lymphocytic leukemia (CLL), accelerated CLL (aCLL), and full-blown transformation to diffuse large B-cell lymphoma (Richter transformation; RT) has significant clinical implications. Identifying cellular phenotypes via unsupervised clustering provides the most robust analytic performance in analyzing digitized pathology slides. This study serves as a proof of concept that using an unsupervised machine learning scheme can enhance diagnostic accuracy. Identifying the progression of chronic lymphocytic leukemia (CLL) to accelerated CLL (aCLL) or transformation to diffuse large B-cell lymphoma (Richter transformation; RT) has significant clinical implications as it prompts a major change in patient management. However, the differentiation between these disease phases may be challenging in routine practice. Unsupervised learning has gained increased attention because of its substantial potential in data intrinsic pattern discovery. Here, we demonstrate that cellular feature engineering, identifying cellular phenotypes via unsupervised clustering, provides the most robust analytic performance in analyzing digitized pathology slides (accuracy = 0.925, AUC = 0.978) when compared to alternative approaches, such as mixed features, supervised features, unsupervised/mixed/supervised feature fusion and selection, as well as patch-based convolutional neural network (CNN) feature extraction. We further validate the reproducibility and robustness of unsupervised feature extraction via stability and repeated splitting analysis, supporting its utility as a diagnostic aid in identifying CLL patients with histologic evidence of disease progression. The outcome of this study serves as proof of principle using an unsupervised machine learning scheme to enhance the diagnostic accuracy of the heterogeneous histology patterns that pathologists might not easily see.
引用
收藏
页数:16
相关论文
共 63 条
[1]   Computational pathology definitions, best practices, and recommendations for regulatory guidance: a white paper from the Digital Pathology Association [J].
Abels, Esther ;
Pantanowitz, Liron ;
Aeffner, Famke ;
Zarella, Mark D. ;
van der Laak, Jeroen ;
Bui, Marilyn M. ;
Vemuri, Venkata N. P. ;
Parwani, Anil V. ;
Gibbs, Jeff ;
Agosto-Arroyo, Emmanuel ;
Beck, Andrew H. ;
Kozlowski, Cleopatra .
JOURNAL OF PATHOLOGY, 2019, 249 (03) :286-294
[2]   Histologic transformation of chronic lymphocytic leukemia/small lymphocytic lymphoma [J].
Agbay, Rose Lou Marie C. ;
Jain, Nitin ;
Loghavi, Sanam ;
Medeiros, L. Jeffrey ;
Khoury, Joseph D. .
AMERICAN JOURNAL OF HEMATOLOGY, 2016, 91 (10) :1036-1043
[3]  
Austin PC, 2008, STAT MED, V27, P2037, DOI 10.1002/sim.3150
[4]  
Bradski G, 2000, DR DOBBS J, V25, P120
[5]   Data-driven approach for creating synthetic electronic medical records [J].
Buczak, Anna L. ;
Babin, Steven ;
Moniz, Linda .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2010, 10
[6]   Automatic whole slide pathology image diagnosis framework via unit stochastic selection and attention fusion [J].
Chen, Pingjun ;
Liang, Yun ;
Shi, Xiaoshuang ;
Yang, Lin ;
Gader, Paul .
NEUROCOMPUTING, 2021, 453 :312-325
[7]   Interactive thyroid whole slide image diagnostic system using deep representation [J].
Chen, Pingjun ;
Shi, Xiaoshuang ;
Liang, Yun ;
Li, Yuan ;
Yang, Lin ;
Gader, Paul D. .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 195
[8]  
Chen PJ, 2015, I S BIOMED IMAGING, P633, DOI 10.1109/ISBI.2015.7163953
[9]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[10]   Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning [J].
Coudray, Nicolas ;
Ocampo, Paolo Santiago ;
Sakellaropoulos, Theodore ;
Narula, Navneet ;
Snuderl, Matija ;
Fenyo, David ;
Moreira, Andre L. ;
Razavian, Narges ;
Tsirigos, Aristotelis .
NATURE MEDICINE, 2018, 24 (10) :1559-+