Focus U-Net: A novel dual attention-gated CNN for polyp segmentation during colonoscopy

被引:100
作者
Yeung, Michael [1 ,2 ]
Sala, Evis [1 ,3 ]
Schonlieb, Carola-Bibiane [4 ]
Rundo, Leonardo [1 ,3 ]
机构
[1] Univ Cambridge, Dept Radiol, Box 218,Cambridge Biomed Campus, Cambridge CB2 0QQ, England
[2] Univ Cambridge, Sch Clin Med, Cambridge CB2 0SP, England
[3] Univ Cambridge, Canc Res UK Cambridge Ctr, Cambridge CB2 0RE, England
[4] Univ Cambridge, Dept Appl Math & Theoret Phys, Cambridge CB3 0WA, England
基金
英国工程与自然科学研究理事会; 英国惠康基金; 欧盟地平线“2020”; 英国科学技术设施理事会;
关键词
Polyp segmentation; Colorectal cancer; Colonoscopy; Computer-aided diagnosis; Focus U-Net; Attention mechanisms; Loss function; COLORECTAL-CANCER; MISS RATE; NETWORKS; RISK;
D O I
10.1016/j.compbiomed.2021.104815
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Colonoscopy remains the gold-standard screening for colorectal cancer. However, significant miss rates for polyps have been reported, particularly when there are multiple small adenomas. This presents an opportunity to leverage computer-aided systems to support clinicians and reduce the number of polyps missed. Method: In this work we introduce the Focus U-Net, a novel dual attention-gated deep neural network, which combines efficient spatial and channel-based attention into a single Focus Gate module to encourage selective learning of polyp features. The Focus U-Net incorporates several further architectural modifications, including the addition of short-range skip connections and deep supervision. Furthermore, we introduce the Hybrid Focal loss, a new compound loss function based on the Focal loss and Focal Tversky loss, designed to handle classimbalanced image segmentation. For our experiments, we selected five public datasets containing images of polyps obtained during optical colonoscopy: CVC-ClinicDB, Kvasir-SEG, CVC-ColonDB, ETIS-Larib PolypDB and EndoScene test set. We first perform a series of ablation studies and then evaluate the Focus U-Net on the CVCClinicDB and Kvasir-SEG datasets separately, and on a combined dataset of all five public datasets. To evaluate model performance, we use the Dice similarity coefficient (DSC) and Intersection over Union (IoU) metrics. Results: Our model achieves state-of-the-art results for both CVC-ClinicDB and Kvasir-SEG, with a mean DSC of 0.941 and 0.910, respectively. When evaluated on a combination of five public polyp datasets, our model similarly achieves state-of-the-art results with a mean DSC of 0.878 and mean IoU of 0.809, a 14% and 15% improvement over the previous state-of-the-art results of 0.768 and 0.702, respectively. Conclusions: This study shows the potential for deep learning to provide fast and accurate polyp segmentation results for use during colonoscopy. The Focus U-Net may be adapted for future use in newer non-invasive colorectal cancer screening and more broadly to other biomedical image segmentation tasks similarly involving class imbalance and requiring efficiency.
引用
收藏
页数:11
相关论文
共 83 条
[31]   A Comprehensive Study on Colorectal Polyp Segmentation With ResUNet plus plus , Conditional Random Field and Test-Time Augmentation [J].
Jha, Debesh ;
Smedsrud, Pia H. ;
Johansen, Dag ;
de Lange, Thomas ;
Johansen, Havard D. ;
Halvorsen, Pal ;
Riegler, Michael A. .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (06) :2029-2040
[32]   DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation [J].
Jha, Debesh ;
Riegler, Michael A. ;
Johansen, Dag ;
Halvorsen, Pal ;
Johansen, Havard D. .
2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, :558-564
[33]   Kvasir-SEG: A Segmented Polyp Dataset [J].
Jha, Debesh ;
Smedsrud, Pia H. ;
Riegler, Michael A. ;
Halvorsen, Pal ;
de Lange, Thomas ;
Johansen, Dag ;
Johansen, Havard D. .
MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 :451-462
[34]   ResUNet plus plus : An Advanced Architecture for Medical Image Segmentation [J].
Jha, Debesh ;
Smedsrud, Pia H. ;
Riegler, Michael A. ;
Johansen, Dag ;
de Lange, Thomas ;
Halvorsen, Pal ;
Johansen, Havard D. .
2019 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2019), 2019, :225-230
[35]  
Ji G.-P., ARXIV PREPRINT ARXIV
[36]   Computer-aided tumor detection in endoscopic video using color wavelet features [J].
Karkanis, SA ;
Iakovidis, DK ;
Maroulis, DE ;
Karras, DA ;
Tzivras, M .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2003, 7 (03) :141-152
[37]  
Kim NH, 2017, INTEST RES, V15, P411, DOI 10.5217/ir.2017.15.3.411
[38]  
Krishnan SM, 1998, P ANN INT IEEE EMBS, V20, P895, DOI 10.1109/IEMBS.1998.745583
[39]   A Systematic Comparison of Microsimulation Models of Colorectal Cancer: The Role of Assumptions about Adenoma Progression [J].
Kuntz, Karen M. ;
Lansdorp-Vogelaar, Iris ;
Rutter, Carolyn M. ;
Knudsen, Amy B. ;
van Ballegooijen, Marjolein ;
Savarino, James E. ;
Feuer, Eric J. ;
Zauber, Ann G. .
MEDICAL DECISION MAKING, 2011, 31 (04) :530-539
[40]   Application of Artificial Intelligence to Gastroenterology and Hepatology [J].
Le Berre, Catherine ;
Sandborn, William J. ;
Aridhi, Sabeur ;
Devignes, Marie-Dominique ;
Fournier, Laure ;
Smail-Tabbone, Malika ;
Danese, Silvio ;
Peyrin-Biroulet, Laurent .
GASTROENTEROLOGY, 2020, 158 (01) :76-+