Towards Learning Affine-Invariant Representations via Data-Efficient CNNs

被引:0
作者
Xu, Wenju [1 ,4 ]
Wang, Guanghui [1 ,4 ]
Sullivan, Alan [2 ]
Zhang, Ziming [3 ]
机构
[1] Univ Kansas, Lawrence, KS 66045 USA
[2] Mitsubishi Elect Res Labs MERL, Cambridge, MA USA
[3] Worcester Polytech Inst, Worcester, MA 01609 USA
[4] MERL, Cambridge, MA 02139 USA
来源
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2020年
关键词
RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose integrating a priori knowledge into both design and training of convolutional neural networks (CNNs) to learn object representations that are invariant to affine transformations (i.e. translation, scale, rotation). Accordingly we propose a novel multi-scale maxout CNN and train it end-to-end with a novel rotation-invariant regularizer. This regularizer aims to enforce the weights in each 2D spatial filter to approximate circular patterns. In this way, we manage to handle affine transformations in training using convolution, multi-scale maxout, and circular filters. Empirically we demonstrate that such knowledge can significantly improve the data-efficiency as well as generalization and robustness of learned models. For instance, on the Traffic Sign data set and trained with only 10 images per class, our method can achieve 84.15% that outperforms the state-of-the-art by 29.80% in terms of test accuracy.
引用
收藏
页码:893 / 902
页数:10
相关论文
共 51 条
  • [41] Learning Steerable Filters for Rotation Equivariant CNNs
    Weiler, Maurice
    Hamprecht, Fred A.
    Storath, Martin
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 849 - 858
  • [42] A Discriminative Feature Learning Approach for Deep Face Recognition
    Wen, Yandong
    Zhang, Kaipeng
    Li, Zhifeng
    Qiao, Yu
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 499 - 515
  • [43] Worrall DE., 2017, CVPR, V2
  • [44] Xu Yichong, 2015, ARXIV14116369
  • [45] Yu F., 2015, arXiv preprint arXiv: 1506. 03365
  • [46] Zagoruyko S., 2016, BRIT MACHINE VISION, V87, P1
  • [47] Interpretable Convolutional Neural Networks
    Zhang, Quanshi
    Wu, Ying Nian
    Zhu, Song-Chun
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8827 - 8836
  • [48] Zhang TZ, 2017, PROC CVPR IEEE, P4819, DOI [10.1109/CVPR.2017.512, 10.1109/ICCV.2017.469]
  • [49] Rotation Invariant Local Binary Convolution Neural Networks
    Zhang, Xin
    Liu, Li
    Xie, Yuxiang
    Chen, Jie
    Wu, Lingda
    Pietikainen, Matti
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 1210 - 1219
  • [50] Zhang Z., 2017, ADV NEURAL INFORM PR, P1721