Inverting Visual Representations with Convolutional Networks

被引:330
作者
Dosovitskiy, Alexey [1 ]
Brox, Thomas [1 ]
机构
[1] Univ Freiburg, Freiburg, Germany
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
基金
欧洲研究理事会;
关键词
D O I
10.1109/CVPR.2016.522
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature representations, both hand-designed and learned ones, are often hard to analyze and interpret, even when they are extracted from visual data. We propose a new approach to study image representations by inverting them with an up-convolutional neural network. We apply the method to shallow representations ( HOG, SIFT, LBP), as well as to deep networks. For shallow representations our approach provides significantly better reconstructions than existing methods, revealing that there is surprisingly rich information contained in these features. Inverting a deep network trained on ImageNet provides several insights into the properties of the feature representation learned by the network. Most strikingly, the colors and the rough contours of an image can be reconstructed from activations in higher network layers and even from the predicted class probabilities.
引用
收藏
页码:4829 / 4837
页数:9
相关论文
共 28 条
  • [1] Agrawal P, 2014, LECT NOTES COMPUT SC, V8695, P329, DOI 10.1007/978-3-319-10584-0_22
  • [2] [Anonymous], 2011, CVPR
  • [3] [Anonymous], 2010, P 18 ACM INT C MULT, DOI [10.1145/1873951.1874249, 10.1145/1873951.1874249.2]
  • [4] [Anonymous], P INT C NEUR NETW
  • [5] Bishop CM, 1995, Neural Networks for Pattern Recognition
  • [6] From Bits to Images: Inversion of Local Binary Descriptors
    d'Angelo, Emmanuel
    Jacques, Laurent
    Alahi, Alexandre
    Vandergheynst, Pierre
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (05) : 874 - 887
  • [7] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [8] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [9] Dosovitskiy A, 2015, PROC CVPR IEEE, P1538, DOI 10.1109/CVPR.2015.7298761
  • [10] Felzenszwalb P. F., 2010, IEEE transactions on pattern analysis and machine intelligence, V32, P1627, DOI [DOI 10.1109/TPAMI.2009.167, 10.1109/TPAMI.2009.167]