Weakly supervised instance learning for thyroid malignancy prediction from whole slide cytopathology images

被引:53
作者
Dov, David [1 ]
Kovalsky, Shahar Z. [2 ]
Assaad, Serge [1 ]
Cohen, Jonathan [3 ]
Range, Danielle Elliott [4 ]
Pendse, Avani A. [4 ]
Henao, Ricardo [1 ]
Carin, Lawrence [1 ]
机构
[1] Duke Univ, Dept Elect & Comp Engn, Durham, NC 27708 USA
[2] Duke Univ, Dept Math, Durham, NC 27708 USA
[3] Duke Univ, Med Ctr, Dept Surg, Durham, NC 27710 USA
[4] Duke Univ, Med Ctr, Dept Pathol, Durham, NC 27710 USA
关键词
Medical image analysis; Multiple instance learning; AI; Deep learning; Healthcare; Pathology; Human level; Thyroid; ARTIFICIAL-INTELLIGENCE; FOLLICULAR LESIONS; NEURAL-NETWORKS; DIAGNOSIS; SYSTEM; CLASSIFICATION; NODULES; BENIGN;
D O I
10.1016/j.media.2020.101814
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider machine-learning-based thyroid-malignancy prediction from cytopathology whole-slide images (WSI). Multiple instance learning (MIL) approaches, typically used for the analysis of WSIs, divide the image (bag) into patches (instances), which are used to predict a single bag-level label. These approaches perform poorly in cytopathology slides due to a unique bag structure: sparsely located informative instances with varying characteristics of abnormality. We address these challenges by considering multiple types of labels: bag-level malignancy and ordered diagnostic scores, as well as instance-level informativeness and abnormality labels. We study their contribution beyond the MIL setting by proposing a maximum likelihood estimation (MLE) framework, from which we derive a two-stage deep-learning-based algorithm. The algorithm identifies informative instances and assigns them local malignancy scores that are incorporated into a global malignancy prediction. We derive a lower bound of the MLE, leading to an improved training strategy based on weak supervision, that we motivate through statistical analysis. The lower bound further allows us to extend the proposed algorithm to simultaneously predict multiple bag and instance-level labels from a single output of a neural network. Experimental results demonstrate that the proposed algorithm provides competitive performance compared to several competing methods, achieves (expert) human-level performance, and allows augmentation of human decisions. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 43 条
[1]  
Agresti A., 2003, CATEGORICAL DATA ANA, P482
[2]   Single- vs. multiple-instance classification [J].
Alpaydin, Ethem ;
Cheplygina, Veronika ;
Loog, Marco ;
Tax, David M. J. .
PATTERN RECOGNITION, 2015, 48 (09) :2831-2838
[3]  
[Anonymous], 2017, Deep learning is robust to massive label noise
[4]  
[Anonymous], PROC CVPR IEEE
[5]  
[Anonymous], 2002, STAT INFERENCE
[6]   Ordinal Regression Methods: Survey and Experimental Study [J].
Antonio Gutierrez, Pedro ;
Perez-Ortiz, Maria ;
Sanchez-Monedero, Javier ;
Fernandez-Navarro, Francisco ;
Hervas-Martinez, Cesar .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) :127-146
[7]  
Aschebrook-Kilfoy B., 2013, CANC EPIDEMIOL PREVE
[8]   Clinical-grade computational pathology using weakly supervised deep learning on whole slide images [J].
Campanella, Gabriele ;
Hanna, Matthew G. ;
Geneslaw, Luke ;
Miraflor, Allen ;
Silva, Vitor Werneck Krauss ;
Busam, Klaus J. ;
Brogi, Edi ;
Reuter, Victor E. ;
Klimstra, David S. ;
Fuchs, Thomas J. .
NATURE MEDICINE, 2019, 25 (08) :1301-+
[9]   Not-so-supervised: A survey of semi-supervised, multi-instance, and transfer learning in medical image analysis [J].
Cheplygina, Veronika ;
de Bruijne, Marleen ;
Pluim, Josien P. W. .
MEDICAL IMAGE ANALYSIS, 2019, 54 :280-296
[10]   The Bethesda System for Reporting Thyroid Cytopathology [J].
Cibas, Edmund S. ;
Ali, Syed Z. .
AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2009, 132 (05) :658-665