Gabor Convolutional Networks

被引：248

作者：

Luan, Shangzhen ^{[1
]}

Chen, Chen ^{[2
]}

Zhang, Baochang ^{[1
,3
]}

Han, Jungong ^{[4
]}

Liu, Jianzhuang ^{[5
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

[2] Univ N Carolina, Dept Elect & Comp Engn, Charlotte, NC 28223 USA

[3] Shenzhen Acad Aerosp Technol, Shenzhen 518057, Peoples R China

[4] Univ Lancaster, Sch Comp & Commun, Lancaster LA1 4YW, England

[5] Huawei Technol Co Ltd, Noahs Ark Lab, Shenzhen 518129, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2018年 / 27卷 / 09期

关键词：

Gabor CNNs; Gabor filters; convolutional neural networks; orientation; kernel modulation; SALIENCY DETECTION; RECOGNITION;

D O I：

10.1109/TIP.2018.2835143

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In steerable filters, a filter of arbitrary orientation can be generated by a linear combination of a set of "basis filters." Steerable properties dominate the design of the traditional filters, e.g., Gabor filters and endow features the capability of handling spatial transformations. However, such properties have not yet been well explored in the deep convolutional neural networks (DCNNs). In this paper, we develop a new deep model, namely, Gabor convolutional networks (GCNs or Gabor CNNs), with Gabor filters incorporated into DCNNs such that the robustness of learned features against the orientation and scale changes can be reinforced. By manipulating the basic element of DCNNs, i.e., the convolution operator, based on Gabor filters, GCNs can be easily implemented and are readily compatible with any popular deep learning architecture. We carry out extensive experiments to demonstrate the promising performance of our GCNs framework, and the results show its superiority in recognizing objects, especially when the scale and rotation changes take place frequently. Moreover, the proposed GCNs have much fewer network parameters to be learned and can effectively reduce the training complexity of the network, leading to a more compact deep learning model while still maintaining a high feature representation capacity.

引用

页码：4357 / 4366

页数：10

共 42 条

[1] [Anonymous], 2016, LOCAL BINARY CONVOLU
[2] [Anonymous], 2012, CoRR
[3] [Anonymous], 2012, Technical Report
[4] Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29
[5] Boureau Y, 2010, P 27 INT C MACH LEAR, P111, DOI DOI 10.5555/3104322.3104338
[6] Invariant Scattering Convolution Networks
Bruna, Joan
Mallat, Stephane
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1872 - 1886
[7] Chang SY, 2014, INTERSPEECH, P905
[8] Kernel Pooling for Convolutional Neural Networks
Cui, Yin
Zhou, Feng
Wang, Jiang
Liu, Xiao
Lin, Yuanqing
Belongie, Serge
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3049 - 3058
[9] Dai J., 2017, Deformable convolutional networks
[10] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 4 5 →