Self-Taught Object Localization with Deep Networks

被引:44
作者
Bazzani, Loris [1 ]
Bergamo, Alessandro [1 ]
Anguelov, Dragomir [2 ]
Torresani, Lorenzo [1 ]
机构
[1] Dartmouth Coll, Dept Comp Sci, Hanover, NH 03755 USA
[2] Google Inc, Mountain View, CA USA
来源
2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016) | 2016年
基金
美国国家科学基金会;
关键词
D O I
10.1109/wacv.2016.7477688
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces self-taught object localization, a novel approach that leverages deep convolutional networks trained for whole-image recognition to localize objects in images without additional human supervision, i.e., without using any ground-truth bounding boxes for training. The key idea is to analyze the change in the recognition scores when artificially masking out different regions of the image. The masking out of a region that includes the object typically causes a significant drop in recognition score. This idea is embedded into an agglomerative clustering technique that generates self-taught localization hypotheses. Our object localization scheme outperforms existing proposal methods in both precision and recall for small number of subwindow proposals (e.g., on ILSVRC-2012 it produces a relative gain of 23.4% over the state-of-the-art for top-1 hypothesis). Furthermore, our experiments show that the annotations automatically-generated by our method can be used to train object detectors yielding recognition results remarkably close to those obtained by training on manually-annotated bounding boxes.
引用
收藏
页数:9
相关论文
共 36 条
  • [1] Measuring the Objectness of Image Windows
    Alexe, Bogdan
    Deselaers, Thomas
    Ferrari, Vittorio
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2189 - 2202
  • [2] [Anonymous], 2014, BMVC
  • [3] [Anonymous], 2015, ICLR
  • [4] [Anonymous], JMLR
  • [5] [Anonymous], 2014, WORKSH INT C LEARN R
  • [6] [Anonymous], 2012, ICML
  • [7] [Anonymous], 2 INT C LEARN REPR
  • [8] [Anonymous], 2015, NIPS
  • [9] [Anonymous], IEEE CVPR
  • [10] [Anonymous], 2014, IEEE CVPR