CrossFormer: Multi-scale cross-attention for polyp segmentation

被引:4
作者
Chen, Lifang [1 ]
Ge, Hongze [2 ]
Li, Jiawei [3 ]
机构
[1] JiangNan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi, Jiangsu, Peoples R China
[2] JiangNan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi, Jiangsu, Peoples R China
[3] JiangNan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi, Jiangsu, Peoples R China
关键词
channel enhancement; colorectal cancer; cross-attention; multi scale; polyp segmentation; VALIDATION;
D O I
10.1049/ipr2.12875
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Colonoscopy is a common method for the early detection of colorectal cancer (CRC). The segmentation of colonoscopy imagery is valuable for examining the lesion. However, as colonic polyps have various sizes and shapes, and their morphological characteristics are similar to those of mucosa, it is difficult to segment them accurately. To address this, a novel neural network architecture called CrossFormer is proposed. CrossFormer combines cross-attention and multi-scale methods, which can achieve high-precision automatic segmentation of the polyps. A multi-scale cross-attention module is proposed to enhance the ability to extract context information and learn different features. In addition, a novel channel enhancement module is used to focus on the useful channel information. The model is trained and tested on the Kvasir and CVC-ClinicDB datasets. Experimental results show that the proposed model outperforms most existing polyps segmentation methods.
引用
收藏
页码:3441 / 3452
页数:12
相关论文
共 41 条
[1]   Comparative Validation of Polyp Detection Methods in Video Colonoscopy: Results From the MICCAI 2015 Endoscopic Vision Challenge [J].
Bernal, Jorge ;
Tajkbaksh, Nima ;
Sanchez, Francisco Javier ;
Matuszewski, Bogdan J. ;
Chen, Hao ;
Yu, Lequan ;
Angermann, Quentin ;
Romain, Olivier ;
Rustad, Bjorn ;
Balasingham, Ilangko ;
Pogorelov, Konstantin ;
Choi, Sungbin ;
Debard, Quentin ;
Maier-Hein, Lena ;
Speidel, Stefanie ;
Stoyanov, Danail ;
Brandao, Patrick ;
Cordova, Henry ;
Sanchez-Montes, Cristina ;
Gurudu, Suryakanth R. ;
Fernandez-Esparrach, Gloria ;
Dray, Xavier ;
Liang, Jianming ;
Histace, Aymeric .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (06) :1231-1249
[2]   WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians [J].
Bernal, Jorge ;
Javier Sanchez, F. ;
Fernandez-Esparrach, Gloria ;
Gil, Debora ;
Rodriguez, Cristina ;
Vilarino, Fernando .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 :99-111
[3]   Understanding Robustness of Transformers for Image Classification [J].
Bhojanapalli, Srinadh ;
Chakrabarti, Ayan ;
Glasner, Daniel ;
Li, Daliang ;
Unterthiner, Thomas ;
Veit, Andreas .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :10211-10221
[4]  
Cao Y, 2019, IEEE ICC
[5]  
Carion N, 2020, Img Proc Comp Vis Re, V12346, P213, DOI 10.1007/978-3-030-58452-8_13
[6]  
Chen J, 2021, ARXIV PREPRINT
[7]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[8]  
Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[9]  
Dosovitskiy A., 2021, arXiv
[10]  
Hu J, 2018, ADV NEUR IN, V31