Weakly Supervised Localization and Learning with Generic Knowledge

被引:2
作者
Thomas Deselaers
Bogdan Alexe
Vittorio Ferrari
机构
[1] ETH Zurich,Computer Vision Laboratory
[2] Google,undefined
来源
International Journal of Computer Vision | 2012年 / 100卷
关键词
Object detection; Weakly supervised learning; Transfer learning; Conditional random fields;
D O I
暂无
中图分类号
学科分类号
摘要
Learning a new object class from cluttered training images is very challenging when the location of object instances is unknown, i.e. in a weakly supervised setting. Many previous works require objects covering a large portion of the images. We present a novel approach that can cope with extensive clutter as well as large scale and appearance variations between object instances. To make this possible we exploit generic knowledge learned beforehand from images of other classes for which location annotation is available. Generic knowledge facilitates learning any new class from weakly supervised images, because it reduces the uncertainty in the location of its object instances. We propose a conditional random field that starts from generic knowledge and then progressively adapts to the new class. Our approach simultaneously localizes object instances while learning an appearance model specific for the class. We demonstrate this on several datasets, including the very challenging Pascal VOC 2007. Furthermore, our method allows training any state-of-the-art object detector in a weakly supervised fashion, although it would normally require object location annotations.
引用
收藏
页码:275 / 293
页数:18
相关论文
共 29 条
[1]  
Chen Y.(2006)MILES: multiple-instance learning via embedded instance selection IEEE Transactions on Pattern Analysis and Machine Intelligence 28 1931-1947
[2]  
Bi J.(2010)Object detection with discriminatively trained part based models IEEE Transactions on Pattern Analysis and Machine Intelligence 32 1627-1645
[3]  
Wang J. Z.(2006)Convergent tree-reweighted message passing for energy minimization IEEE Transactions on Pattern Analysis and Machine Intelligence 28 1568-1583
[4]  
Felzenszwalb P.(2006)Convergent tree-reweighted message passing for energy minimization IEEE Transactions on Pattern Analysis and Machine Intelligence 28 1568-1583
[5]  
Girshick R.(2009)Efficient subwindow search: A branch and bound framework for object localization IEEE Transactions on Pattern Analysis and Machine Intelligence 31 2129-2142
[6]  
McAllester D.(2009)Foreground focus: unsupervised learning from partially matching images International Journal of Computer Vision 85 143-166
[7]  
Ramanan D.(2001)Modeling the shape of the scene: a holistic representation of the spatial envelope International Journal of Computer Vision 42 145-175
[8]  
Kolmogorov V.(2004)Grabcut: interactive foreground extraction using iterated graph cuts Computer Graphics 23 309-314
[9]  
Kolmogorov V.(2008)LabelMe: a database and web-based tool for image annotation International Journal of Computer Vision 77 157-173
[10]  
Lampert C. H.(2005)Large margin methods for structured and interdependent output variables Journal of Machine Learning Research 6 1453-1484