Using Guided Self-Attention with Local Information for Polyp Segmentation

被引:19
作者
Cai, Linghan [1 ]
Wu, Meijing [2 ]
Chen, Lijiang [1 ]
Bai, Wenpei [2 ]
Yang, Min [2 ]
Lyu, Shuchang [1 ]
Zhao, Qi [1 ]
机构
[1] Beihang Univ, Inst Elect Informat Engn, Beijing, Peoples R China
[2] Capital Med Univ, Beijing Shijitan Hosp, Dept Gynecol & Obstet, Beijing, Peoples R China
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV | 2022年 / 13434卷
基金
中国国家自然科学基金;
关键词
Colorectal cancer; Polyp segmentation; Transformer; Local-to-Global mechanism; PP-guided self-attention;
D O I
10.1007/978-3-031-16440-8_60
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatic and precise polyp segmentation is crucial for the early diagnosis of colorectal cancer. Existing polyp segmentation methods are mostly based on convolutional neural networks (CNNs), which usually utilize the global features to enhance local features through well-designed modules, thereby dealing with the diversity of polyps. Although CNN-based methods achieve impressive results, they are powerless to model explicit long-range relations, which limits their performance. Different from CNN, Transformer has a strong capability of modeling long-range relations owing to self-attention. However, self-attention always spreads attention to unexpected regions and the Transformer's ability of local feature extraction is insufficient, resulting in inaccurate localization and fuzzy boundary. To address these issues, we propose PPFormer for accurate polyp segmentation. Specifically, we first adopt a shallow CNN encoder and a deep Transformer encoder to extract rich features. In the decoder, we present the PP-guided self-attention that uses prediction maps to guide self-attention to focus on the hard regions so as to enhance the model's perception of polyp boundary. Meanwhile, the Local-to-Global mechanism is designed to encourage the Transformer to capture more information in the local-window for better polyp localization. Extensive experiments on five challenging datasets show that PPFormer outperforms other advanced methods and achieves state-of-the-art results with six metrics, i.e. mean Dice and mean IoU.
引用
收藏
页码:629 / 638
页数:10
相关论文
共 25 条
[1]  
Akbari M, 2018, IEEE ENG MED BIO, P69, DOI 10.1109/EMBC.2018.8512197
[2]   WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians [J].
Bernal, Jorge ;
Javier Sanchez, F. ;
Fernandez-Esparrach, Gloria ;
Gil, Debora ;
Rodriguez, Cristina ;
Vilarino, Fernando .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 :99-111
[3]  
Chen J., 2021, arXiv, DOI 10.48550/arXiv:2102.04306
[4]  
Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[5]  
Dosovitskiy A., 2021, P ICLR 2021, P1, DOI [10.48550/arXiv.2010.11929, DOI 10.48550/ARXIV.2010.11929]
[6]   Structure-measure: A New Way to Evaluate Foreground Maps [J].
Fan, Deng-Ping ;
Cheng, Ming-Ming ;
Liu, Yun ;
Li, Tao ;
Borji, Ali .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4558-4567
[7]  
Fan DP, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P698
[8]   A Survey on Vision Transformer [J].
Han, Kai ;
Wang, Yunhe ;
Chen, Hanting ;
Chen, Xinghao ;
Guo, Jianyuan ;
Liu, Zhenhua ;
Tang, Yehui ;
Xiao, An ;
Xu, Chunjing ;
Xu, Yixing ;
Yang, Zhaohui ;
Zhang, Yiman ;
Tao, Dacheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :87-110
[9]   A Comprehensive Study on Colorectal Polyp Segmentation With ResUNet plus plus , Conditional Random Field and Test-Time Augmentation [J].
Jha, Debesh ;
Smedsrud, Pia H. ;
Johansen, Dag ;
de Lange, Thomas ;
Johansen, Havard D. ;
Halvorsen, Pal ;
Riegler, Michael A. .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (06) :2029-2040
[10]   Kvasir-SEG: A Segmented Polyp Dataset [J].
Jha, Debesh ;
Smedsrud, Pia H. ;
Riegler, Michael A. ;
Halvorsen, Pal ;
de Lange, Thomas ;
Johansen, Dag ;
Johansen, Havard D. .
MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 :451-462