What Makes a Chair a Chair?

被引:0
作者
Grabner, Helmut [1 ]
Gall, Juergen [1 ]
Van Gool, Luc [1 ]
机构
[1] ETH, Comp Vis Lab, Zurich, Switzerland
来源
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2011年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many object classes are primarily defined by their functions. However, this fact has been left largely unexploited by visual object categorization or detection systems. We propose a method to learn an affordance detector. It identifies locations in the 3d space which "support" the particular function. Our novel approach "imagines" an actor performing an action typical for the target object class, instead of relying purely on the visual object appearance. So, function is handled as a cue complementary to appearance, rather than being a consideration after appearance-based detection. Experimental results are given for the functional category "sitting". Such affordance is tested on a 3d representation of the scene, as can be realistically obtained through SfM or depth cameras. In contrast to appearance-based object detectors, affordance detection requires only very few training examples and generalizes very well to other sittable objects like benches or sofas when trained on a few chairs.
引用
收藏
页码:1529 / 1536
页数:8
相关论文
共 26 条
[1]  
Aksoy E., 2010, P INT C ROB AUT
[2]  
[Anonymous], 2010, P CVPR
[3]  
[Anonymous], 2010, CVIU
[4]  
[Anonymous], P ICCV
[5]  
[Anonymous], P INT C COMP VIS SYS
[6]   Object names and object functions serve as cues to categories for infants [J].
Booth, AE ;
Waxman, S .
DEVELOPMENTAL PSYCHOLOGY, 2002, 38 (06) :948-957
[7]  
Bulthoff I., 2003, ANAL HOLISTIC PROCES, P146
[8]   What effects on "where": Functional influences on spatial relations [J].
Carlson-Radvansky, LA ;
Covey, ES ;
Lattanzi, KM .
PSYCHOLOGICAL SCIENCE, 1999, 10 (06) :516-521
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338