Progressively Normalized Self-Attention Network for Video Polyp Segmentation

被引:116
作者
Ji, Ge-Peng [1 ,2 ]
Chou, Yu-Cheng [2 ]
Fan, Deng-Ping [1 ]
Chen, Geng [1 ]
Fu, Huazhu [1 ]
Jha, Debesh [3 ]
Shao, Ling [1 ]
机构
[1] Incept Inst AI IIAI, Abu Dhabi, U Arab Emirates
[2] Wuhan Univ, Wuhan, Peoples R China
[3] SimulaMet, Oslo, Norway
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I | 2021年 / 12901卷
关键词
Normalized self-attention; Polyp segmentation; Colonoscopy;
D O I
10.1007/978-3-030-87193-2_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing video polyp segmentation(VPS) models typically employ convolutional neural networks (CNNs) to extract features. However, due to their limited receptive fields, CNNs cannot fully exploit the global temporal and spatial information in successive video frames, resulting in false positive segmentation results. In this paper, we propose the novel PNS-Net (Progressively Normalized Self-attention Network), which can efficiently learn representations from polyp videos with real-time speed (similar to 140fps) on a single RTX 2080 GPU and no post-processing. Our PNS-Net is based solely on a basic normalized self-attention block, equipping with recurrence and CNNs entirely. Experiments on challenging VPS datasets demonstrate that the proposed PNS-Net achieves state-of-the-art performance. We also conduct extensive experiments to study the effectiveness of the channel split, soft-attention, and progressive learning strategy. We find that our PNS-Net works well under different settings, making it a promising solution to the VPS task.
引用
收藏
页码:142 / 152
页数:11
相关论文
共 30 条
[1]  
Akbari M, 2018, IEEE ENG MED BIO, P69, DOI 10.1109/EMBC.2018.8512197
[2]  
Ba J.L., 2016, stat, VVolume 29, P3617, DOI 10.48550/arXiv.1607.06450
[3]   Towards automatic polyp detection with a polyp appearance model [J].
Bernal, J. ;
Sanchez, J. ;
Vilarino, F. .
PATTERN RECOGNITION, 2012, 45 (09) :3166-3182
[4]   WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians [J].
Bernal, Jorge ;
Javier Sanchez, F. ;
Fernandez-Esparrach, Gloria ;
Gil, Debora ;
Rodriguez, Cristina ;
Vilarino, Fernando .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 :99-111
[5]   Fully Convolutional Neural Networks for Polyp Segmentation in Colonoscopy [J].
Brandao, Patrick ;
Mazomenos, Evangelos ;
Ciuti, Gastone ;
Calio, Renato ;
Bianchi, Federico ;
Menciassi, Arianna ;
Dario, Paolo ;
Koulaouzidis, Anastasios ;
Arezzo, Alberto ;
Stoyanov, Danail .
MEDICAL IMAGING 2017: COMPUTER-AIDED DIAGNOSIS, 2017, 10134
[6]  
Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[7]  
Fan D.P, 2020, SSI
[8]  
Fan D.P, 2021, IEEE TPAMI, V66, P9909
[9]   Structure-measure: A New Way to Evaluate Foreground Maps [J].
Fan, Deng-Ping ;
Cheng, Ming-Ming ;
Liu, Yun ;
Li, Tao ;
Borji, Ali .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4558-4567
[10]   Selective Feature Aggregation Network with Area-Boundary Constraints for Polyp Segmentation [J].
Fang, Yuqi ;
Chen, Cheng ;
Yuan, Yixuan ;
Tong, Kai-yu .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT I, 2019, 11764 :302-310