Gabor Convolutional Networks

被引:248
作者
Luan, Shangzhen [1 ]
Chen, Chen [2 ]
Zhang, Baochang [1 ,3 ]
Han, Jungong [4 ]
Liu, Jianzhuang [5 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Univ N Carolina, Dept Elect & Comp Engn, Charlotte, NC 28223 USA
[3] Shenzhen Acad Aerosp Technol, Shenzhen 518057, Peoples R China
[4] Univ Lancaster, Sch Comp & Commun, Lancaster LA1 4YW, England
[5] Huawei Technol Co Ltd, Noahs Ark Lab, Shenzhen 518129, Peoples R China
关键词
Gabor CNNs; Gabor filters; convolutional neural networks; orientation; kernel modulation; SALIENCY DETECTION; RECOGNITION;
D O I
10.1109/TIP.2018.2835143
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In steerable filters, a filter of arbitrary orientation can be generated by a linear combination of a set of "basis filters." Steerable properties dominate the design of the traditional filters, e.g., Gabor filters and endow features the capability of handling spatial transformations. However, such properties have not yet been well explored in the deep convolutional neural networks (DCNNs). In this paper, we develop a new deep model, namely, Gabor convolutional networks (GCNs or Gabor CNNs), with Gabor filters incorporated into DCNNs such that the robustness of learned features against the orientation and scale changes can be reinforced. By manipulating the basic element of DCNNs, i.e., the convolution operator, based on Gabor filters, GCNs can be easily implemented and are readily compatible with any popular deep learning architecture. We carry out extensive experiments to demonstrate the promising performance of our GCNs framework, and the results show its superiority in recognizing objects, especially when the scale and rotation changes take place frequently. Moreover, the proposed GCNs have much fewer network parameters to be learned and can effectively reduce the training complexity of the network, leading to a more compact deep learning model while still maintaining a high feature representation capacity.
引用
收藏
页码:4357 / 4366
页数:10
相关论文
共 42 条
  • [1] [Anonymous], 2016, LOCAL BINARY CONVOLU
  • [2] [Anonymous], 2012, CoRR
  • [3] [Anonymous], 2012, Technical Report
  • [4] Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29
  • [5] Boureau Y, 2010, P 27 INT C MACH LEAR, P111, DOI DOI 10.5555/3104322.3104338
  • [6] Invariant Scattering Convolution Networks
    Bruna, Joan
    Mallat, Stephane
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1872 - 1886
  • [7] Chang SY, 2014, INTERSPEECH, P905
  • [8] Kernel Pooling for Convolutional Neural Networks
    Cui, Yin
    Zhou, Feng
    Wang, Jiang
    Liu, Xiao
    Lin, Yuanqing
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3049 - 3058
  • [9] Dai J., 2017, Deformable convolutional networks
  • [10] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848