Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition

被引:336
作者
Gupta, Abhinav [1 ]
Kembhavi, Aniruddha [2 ]
Davis, Larry S. [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, AV Williams Bldg, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
关键词
Action recognition; object recognition; functional recognition; REPRESENTATION;
D O I
10.1109/TPAMI.2009.83
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interpretation of images and videos containing humans interacting with different objects is a daunting task. It involves understanding scene/event, analyzing human movements, recognizing manipulable objects, and observing the effect of the human movement on those objects. While each of these perceptual tasks can be conducted independently, recognition rate improves when interactions between them are considered. Motivated by psychological studies of human perception, we present a Bayesian approach which integrates various perceptual tasks involved in understanding human-object interactions. Previous approaches to object and action recognition rely on static shape/appearance feature matching and motion analysis, respectively. Our approach goes beyond these traditional approaches and applies spatial and functional constraints on each of the perceptual elements for coherent semantic interpretation. Such constraints allow us to recognize objects and actions when the appearances are not discriminative enough. We also demonstrate the use of such constraints in recognition of actions from static images without using any motion information.
引用
收藏
页码:1775 / 1789
页数:15
相关论文
共 62 条
[21]   Representation of manipulable man-made objects in the dorsal stream [J].
Chao, LL ;
Martin, A .
NEUROIMAGE, 2000, 12 (04) :478-484
[22]  
Dalal N., CVPR, P886, DOI [10.1109/CVPR.2005.177, DOI 10.1109/CVPR.2005.177]
[23]  
DAVIS J, 2002, P IEEE WORKSH MOT VI
[24]   Function from motion [J].
Duric, Z ;
Fayman, JA ;
Rivlin, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (06) :579-591
[25]   Pictorial structures for object recognition [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 61 (01) :55-79
[26]   Progressive search space reduction for human pose estimation [J].
Ferrari, Vittorio ;
Marin-Jimenez, Manuel ;
Zisserman, Andrew .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008,
[27]  
Filipovych R., 2008, P IEEE C COMP VIS PA
[28]   Action recognition in the premotor cortex [J].
Gallese, V ;
Fadiga, L ;
Fogassi, L ;
Rizzolatti, G .
BRAIN, 1996, 119 :593-609
[29]  
GUERRA G, 2005, P ASS ADV ART INT WO
[30]  
Gupta A., 2008, P IEEE C COMP VIS PA