Semi-supervised NPC segmentation with uncertainty and attention guided consistency

被引:52
作者
Hu, Lin [1 ]
Li, Jiaxin [2 ]
Peng, Xingchen [4 ]
Xiao, Jianghong [3 ]
Zhan, Bo [1 ]
Zu, Chen [5 ]
Wu, Xi [6 ]
Zhou, Jiliu [1 ,6 ]
Wang, Yan [1 ]
机构
[1] Sichuan Univ, Sch Comp Sci, Chengdu, Peoples R China
[2] Sichuan Univ, West China Hosp, Dept Liver Surg, Chengdu, Peoples R China
[3] Sichuan Univ, West China Hosp, Canc Ctr, Dept Radiat Oncol, Chengdu, Peoples R China
[4] Sichuan Univ, West China Hosp, Canc Ctr, Dept Biotherapy, Chengdu, Peoples R China
[5] JD Com, Dept Risk Controlling Res, Beijing, Peoples R China
[6] Chengdu Univ Informat Technol, Sch Comp Sci, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised segmentation; Nasopharyngeal carcinoma (NPC); Attention guided consistency; Uncertainty guided consistency; NASOPHARYNGEAL CARCINOMA; CLASSIFICATION; RADIOTHERAPY; IMAGES;
D O I
10.1016/j.knosys.2021.108021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Segmentation of nasopharyngeal carcinoma (NPC) from computed tomography (CT) image is conducive to the clinical healthcare. Nevertheless, due to the large shape variations, boundary ambiguity, as well as the limited available annotations, NPC segmentation remains to be a challenging task. In this paper, we propose a two-stage semi-supervised segmentation framework for automatic NPC segmentation, which includes a region of interest (ROI) cropping stage and a semi-supervised segmentation stage. Specifically, considering the large individual variability of NPC tumors, we first employ a coarse-ResUnet (CRU) to extract the rough target areas from the CT images and thus obtain the cropped ROI images. Then, both the entire CT images and the corresponding ROI images are input to a self-attention embedded semi-supervised mean teacher (SSMT) model to generate the ROI-focused segmentation results. Here, to relieve the potential misdirection from the teacher model caused by annotation scarcity, we introduce the uncertainty estimation to guide the student model to gradually learn the reliable predictions from the teacher model. Meanwhile, to fully explore the inherent semantic information of unlabeled data, we also encourage the attention maps from these two models to be consistent at feature level. Furthermore, we design a refinement procedure and reuse the ROI attention maps generated by the well-trained SSMT to retrain the first stage, improving the quality of ROI images. The updated ROI images are further leveraged to refine SSMT to predict the final segmentation results. Note that the uncertainty estimation and the attention maps reveal the confidence and attention of the model for the intermediate features respectively, which can provide explainable evaluation to the quality of segmentation results. Experimental results on an in-house NPC dataset and a public 2017 ACDC dataset demonstrate that our method is superior to other semi-supervised segmentation methods and also has good generalization ability.(C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 80 条
[11]   Semi-supervised Brain Lesion Segmentation with an Adapted Mean Teacher Model [J].
Cui, Wenhui ;
Liu, Yanlin ;
Li, Yuxing ;
Guo, Menghao ;
Li, Yiming ;
Li, Xiuli ;
Wang, Tianle ;
Zeng, Xiangzhu ;
Ye, Chuyang .
INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2019, 2019, 11492 :554-565
[12]   Semi-supervised classification via simultaneous label and discriminant embedding estimation [J].
Dornaika, F. ;
Baradaaji, A. ;
El Traboulsi, Y. .
INFORMATION SCIENCES, 2021, 546 (546) :146-165
[13]   Random walks in directed hypergraphs and application to semi-supervised image segmentation [J].
Ducournau, Aurelien ;
Bretto, Alain .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 120 :91-102
[14]   Semi-automatic delitneation using weighted CT-MRI registered images for radiotherapy of nasopharyngeal cancer [J].
Fitton, I. ;
Cornelissen, S. A. P. ;
Duppen, J. C. ;
Steenbakkers, R. J. H. M. ;
Peeters, S. T. H. ;
Hoebers, F. J. P. ;
Kaanders, J. H. A. M. ;
Nowak, P. J. C. M. ;
Rasch, C. R. N. ;
van Herk, M. .
MEDICAL PHYSICS, 2011, 38 (08) :4662-4666
[15]   Semi-supervised classification by graph p-Laplacian convolutional networks [J].
Fu, Sichao ;
Liu, Weifeng ;
Zhang, Kai ;
Zhou, Yicong ;
Tao, Dapeng .
INFORMATION SCIENCES, 2021, 560 :92-106
[16]   HesGCN: Hessian graph convolutional networks for semi-supervised classification [J].
Fu, Sichao ;
Liu, Weifeng ;
Tao, Dapeng ;
Zhou, Yicong ;
Nie, Liqiang .
INFORMATION SCIENCES, 2020, 514 :484-498
[17]   More Unlabelled Data or Label More Data? A Study on Semi-supervised Laparoscopic Image Segmentation [J].
Fu, Yunguan ;
Robu, Maria R. ;
Koo, Bongjin ;
Schneider, Crispin ;
van Laarhoven, Stijn ;
Stoyanov, Danail ;
Davidson, Brian ;
Clarkson, Matthew J. ;
Hu, Yipeng .
DOMAIN ADAPTATION AND REPRESENTATION TRANSFER AND MEDICAL IMAGE LEARNING WITH LESS LABELS AND IMPERFECT DATA, DART 2019, MIL3ID 2019, 2019, 11795 :173-180
[18]   Multiscale fused network with additive channel-spatial attention for image segmentation [J].
Gao, Chengling ;
Ye, Hailiang ;
Cao, Feilong ;
Wen, Chenglin ;
Zhang, Qinghua ;
Zhang, Feng .
KNOWLEDGE-BASED SYSTEMS, 2021, 214
[19]  
Gao Yan-Yu, 2010, Acta Automatica Sinica, V36, P960, DOI 10.3724/SP.J.1004.2010.00960
[20]  
Grandvalet Y., 2005, CAP