Recalibrating Fully Convolutional Networks With Spatial and Channel "Squeeze and Excitation" Blocks

被引:381
作者
Roy, Abhijit Guha [1 ,2 ]
Navab, Nassir [2 ,3 ]
Wachinger, Christian [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Dept Child & Adolescent Psychiat, Lab Artificial Intelligence Med Imaging, D-80337 Munich, Germany
[2] Tech Univ Munich, Chair Comp Aided Med Procedures, D-85748 Munich, Germany
[3] Johns Hopkins Univ, CAMP, Baltimore, MD 21218 USA
关键词
Fully convolutional networks; image segmentation; squeeze & excitation; SEGMENTATION;
D O I
10.1109/TMI.2018.2867261
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In a wide range of semantic segmentation tasks, fully convolutional neural networks (F-CNNs) have been successfully leveraged to achieve the state-of-the-art performance. Architectural innovations of F-CNNs have mainly been on improving spatial encoding or network connectivity to aid gradient flow. In this paper, we aim toward an alternate direction of recalibrating the learned feature maps adaptively, boosting meaningful features while suppressing weak ones. The recalibration is achieved by simple computational blocks that can be easily integrated in F-CNNs architectures. We draw our inspiration from the recently proposed "squeeze and excitation" (SE) modules for channel recalibration for image classification. Toward this end, we introduce three variants of SE modules for segmentation: 1) squeezing spatially and exciting channel wise; 2) squeezing channel wise and exciting spatially; and 3) joint spatial and channel SE. We effectively incorporate the proposed SE blocks in three state-of-the-art F-CNNs and demonstrate a consistent improvement of segmentation accuracy on three challenging benchmark datasets. Importantly, SE blocks only lead to a minimal increase in model complexity of about 1.5%, while the Dice score increases by 4%-9% in the case of U-Net. Hence, we believe that SE blocks can be an integral part of future F-CNN architectures.
引用
收藏
页码:540 / 549
页数:10
相关论文
共 21 条
[1]  
[Anonymous], PROC CVPR IEEE
[2]  
[Anonymous], 2018, P INT C MED IM COMP
[3]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[4]  
[Anonymous], 2015, PROC CVPR IEEE
[5]  
[Anonymous], 2017, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2016.2644615
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]   Kernel regression based segmentation of optical coherence tomography images with diabetic macular edema [J].
Chiu, Stephanie J. ;
Allingham, Michael J. ;
Mettu, Priyatham S. ;
Cousins, Scott W. ;
Izatt, Joseph A. ;
Farsiu, Sina .
BIOMEDICAL OPTICS EXPRESS, 2015, 6 (04) :1172-1194
[8]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[9]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[10]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269