Unsupervised machine learning for identifying important visual features through bag-of-words using histopathology data from chronic kidney disease

被引:21
作者
Lee, Joonsang [1 ]
Warner, Elisa [1 ]
Shaikhouni, Salma [3 ]
Bitzer, Markus [3 ]
Kretzler, Matthias [3 ]
Gipson, Debbie [4 ]
Pennathur, Subramaniam [3 ]
Bellovich, Keith [5 ]
Bhat, Zeenat [6 ]
Gadegbeku, Crystal [7 ]
Massengill, Susan [8 ]
Perumal, Kalyani [9 ]
Saha, Jharna [2 ]
Yang, Yingbao [2 ]
Luo, Jinghui [2 ]
Zhang, Xin [1 ]
Mariani, Laura [3 ]
Hodgin, Jeffrey B. [2 ]
Rao, Arvind [1 ,10 ,11 ,12 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Dept Internal Med Nephrol, Ann Arbor, MI 48109 USA
[4] Univ Michigan, Dept Pediat, Pediat Nephrol, Ann Arbor, MI 48109 USA
[5] St Clair Nephrol Res, Dept Internal Med, Nephrol, Detroit, MI USA
[6] Wayne State Univ, Dept Internal Med Nephrol, Detroit, MI USA
[7] Cleveland Clin, Dept Internal Med Nephrol, Cleveland, OH 44106 USA
[8] Levine Childrens Hosp, Dept Pediat, Pediat Nephrol, Charlotte, NC USA
[9] Dept JH Stroger Hosp, Dept Internal Med, Nephrol, Chicago, IL USA
[10] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[11] Univ Michigan, Dept Radiat Oncol, Ann Arbor, MI 48109 USA
[12] Univ Michigan, Dept Biomed Engn, Ann Arbor, MI 48109 USA
关键词
DIGITAL PATHOLOGY; IMAGE; SEGMENTATION; PROGRESSION; EQUATION; GFR;
D O I
10.1038/s41598-022-08974-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pathologists use visual classification to assess patient kidney biopsy samples when diagnosing the underlying cause of kidney disease. However, the assessment is qualitative, or semi-quantitative at best, and reproducibility is challenging. To discover previously unknown features which predict patient outcomes and overcome substantial interobserver variability, we developed an unsupervised bag-of-words model. Our study applied to the C-PROBE cohort of patients with chronic kidney disease (CKD). 107,471 histopathology images were obtained from 161 biopsy cores and identified important morphological features in biopsy tissue that are highly predictive of the presence of CKD both at the time of biopsy and in one year. To evaluate the performance of our model, we estimated the AUC and its 95% confidence interval. We show that this method is reliable and reproducible and can achieve 0.93 AUC at predicting glomerular filtration rate at the time of biopsy as well as predicting a loss of function at one year. Additionally, with this method, we ranked the identified morphological features according to their importance as diagnostic markers for chronic kidney disease. In this study, we have demonstrated the feasibility of using an unsupervised machine learning method without human input in order to predict the level of kidney function in CKD. The results from our study indicate that the visual dictionary, or visual image pattern, obtained from unsupervised machine learning can predict outcomes using machine-derived values that correspond to both known and unknown clinically relevant features.
引用
收藏
页数:13
相关论文
共 48 条
[1]   QuPath: Open source software for digital pathology image analysis [J].
Bankhead, Peter ;
Loughrey, Maurice B. ;
Fernandez, Jose A. ;
Dombrowski, Yvonne ;
Mcart, Darragh G. ;
Dunne, Philip D. ;
McQuaid, Stephen ;
Gray, Ronan T. ;
Murray, Liam J. ;
Coleman, Helen G. ;
James, Jacqueline A. ;
Salto-Tellez, Manuel ;
Hamilton, Peter W. .
SCIENTIFIC REPORTS, 2017, 7
[2]   Digital pathology in nephrology clinical trials, research, and pathology practice [J].
Barisoni, Laura ;
Hodgin, Jeffrey B. .
CURRENT OPINION IN NEPHROLOGY AND HYPERTENSION, 2017, 26 (06) :450-459
[3]   Digital pathology imaging as a novel platform for standardization and globalization of quantitative nephropathology [J].
Barisoni, Laura ;
Gimpel, Charlotte ;
Kain, Renate ;
Laurinavicius, Arvydas ;
Bueno, Gloria ;
Zeng, Caihong ;
Liu, Zhihong ;
Schaefer, Franz ;
Kretzler, Matthias ;
Holzman, Lawrence B. ;
Hewitt, Stephen M. .
CLINICAL KIDNEY JOURNAL, 2017, 10 (02) :176-187
[4]   Emerging Themes in Image Informatics and Molecular Analysis for Digital Pathology [J].
Bhargava, Rohit ;
Madabhushi, Anant .
ANNUAL REVIEW OF BIOMEDICAL ENGINEERING, VOL 18, 2016, 18 :387-412
[5]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[6]   Glomerulosclerosis identification in whole slide images using semantic segmentation [J].
Bueno, Gloria ;
Milagro Fernandez-Carrobles, M. ;
Gonzalez-Lopez, Lucia ;
Deniz, Oscar .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 184
[7]  
Centers for Disease Control and Prevention, CHRON KIDN DIS SURV
[8]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[9]   Performance of Deep Learning Architectures and Transfer Learning for Detecting Glaucomatous Optic Neuropathy in Fundus Photographs [J].
Christopher, Mark ;
Beighith, Akram ;
Bowd, Christopher ;
Proudfoot, James A. ;
Goldbaum, Michael H. ;
Weinreb, Robert N. ;
Girkin, Christopher A. ;
Liebmann, Jeffrey M. ;
Zangwill, Linda M. .
SCIENTIFIC REPORTS, 2018, 8
[10]   Using machine learning models to predict the initiation of renal replacement therapy among chronic kidney disease patients [J].
Dovgan, Erik ;
Gradisek, Anton ;
Lustrek, Mitja ;
Uddin, Mohy ;
Nursetyo, Aldilas Achmad ;
Annavarajula, Sashi Kiran ;
Li, Yu-Chuan ;
Syed-Abdul, Shabbir .
PLOS ONE, 2020, 15 (06)