SRM : A Style-based Recalibration Module for Convolutional Neural Networks

被引：263

作者：

Lee, HyunJae ^{[1
]}

Kim, Hyo-Eun ^{[1
]}

Nam, Hyeonseob ^{[1
]}

机构：

[1] Lunit Inc, Seoul, South Korea

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年

关键词：

D O I：

10.1109/ICCV.2019.00194

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Following the advance of style transfer with Convolutional Neural Networks (CNNs), the role of styles in CNNs has drawn growing attention from a broader perspective. In this paper, we aim to fully leverage the potential of styles to improve the performance of CNNs in general vision tasks. We propose a Style-based Recalibration Module (SRM), a simple yet effective architectural unit, which adaptively recalibrates intermediate feature maps by exploiting their styles. SRM first extracts the style information from each channel of the feature maps by style pooling, then estimates per-channel recalibration weight via channel-independent style integration. By incorporating the relative importance of individual styles into feature maps, SRM effectively enhances the representational ability of a CNN. The proposed module is directly fed into existing CNN architectures with negligible overhead. We conduct comprehensive experiments on general image recognition as well as tasks related to styles, which verify the benefit of SRM over recent approaches such as Squeeze-and-Excitation (SE). To explain the inherent difference between SRM and SE, we provide an in-depth comparison of their representational properties.

引用

页码：1854 / 1862

页数：9

共 36 条

[1]

[Anonymous], 2015, Arxiv.Org, DOI DOI 10.3389/FPSYG.2013.00124

[2]

[Anonymous], 2018, PROC CVPR IEEE, DOI [DOI 10.1109/CVPR.2018.00745, DOI 10.1109/TPAMI.2019.2913372]

[3]

[Anonymous], 2016, COMPUTER VISIONECCV, DOI DOI 10.1007/978-3-319-46448-0_2

[4]

Brendel M., 2019, P INT C LEARN REPR

[5] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[6] Deep Filter Banks for Texture Recognition, Description, and Segmentation [J].

Cimpoi, Mircea ;

Maji, Subhransu ;

Kokkinos, Iasonas ;

Vedaldi, Andrea .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 118 (01) :65-94

[7] Describing Textures in the Wild [J].

Cimpoi, Mircea ;

Maji, Subhransu ;

Kokkinos, Iasonas ;

Mohamed, Sammy ;

Vedaldi, Andrea .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3606-3613

[8] Control of goal-directed and stimulus-driven attention in the brain [J].

Corbetta, M ;

Shulman, GL .

NATURE REVIEWS NEUROSCIENCE, 2002, 3 (03) :201-215

[9]

Gatys L., 2015, Texture Synthesis Using Convolutional Neural NetworksOpen Source Implementation on GitHub, P262

[10] Image Style Transfer Using Convolutional Neural Networks [J].

Gatys, Leon A. ;

Ecker, Alexander S. ;

Bethge, Matthias .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2414-2423

← 1 2 3 4 →