Pairwise Body-Part Attention for Recognizing Human-Object Interactions

被引:91
作者
Fang, Hao-Shu [1 ]
Cao, Jinkun [1 ]
Tai, Yu-Wing [2 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Tencent YouTu Lab, Shanghai, Peoples R China
来源
COMPUTER VISION - ECCV 2018, PT X | 2018年 / 11214卷
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Human-object interactions; Body-part correlations; Attention model; ACTION RECOGNITION;
D O I
10.1007/978-3-030-01249-6_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In human-object interactions (HOI) recognition, conventional methods consider the human body as a whole and pay a uniform attention to the entire body region. They ignore the fact that normally, human interacts with an object by using some parts of the body. In this paper, we argue that different body parts should be paid with different attention in HOI recognition, and the correlations between different body parts should be further considered. This is because our body parts always work collaboratively. We propose a new pairwise body-part attention model which can learn to focus on crucial parts, and their correlations for HOI recognition. A novel attention based feature selection method and a feature representation scheme that can capture pairwise correlations between body parts are introduced in the model. Our proposed approach achieved 10% relative improvement (36.1mAP -> 39.9mAP) over the state-of-the-art results in HOI recognition on the HICO dataset. We will make our model and source codes publicly available.
引用
收藏
页码:52 / 68
页数:17
相关论文
共 56 条
[41]   Attentional biases for faces and body parts [J].
Ro, Tony ;
Friggel, Ashley ;
Lavie, Nilli .
VISUAL COGNITION, 2007, 15 (03) :322-348
[42]   Expanded Parts Model for Human Attribute and Action Recognition in Still Images [J].
Sharma, Gaurav ;
Jurie, Frederic ;
Schmid, Cordelia .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :652-659
[43]   Where To Look: Focus Regions for Visual Question Answering [J].
Shih, Kevin J. ;
Singh, Saurabh ;
Hoiem, Derek .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4613-4621
[44]  
Song SJ, 2017, AAAI CONF ARTIF INTE, P4263
[45]   Action Recognition with Improved Trajectories [J].
Wang, Heng ;
Schmid, Cordelia .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3551-3558
[46]   A Simple Ontology of Manipulation Actions Based on Hand-Object Relations [J].
Woergoetter, Florentin ;
Aksoy, Eren Erdal ;
Krueger, Norbert ;
Piater, Justus ;
Ude, Ales ;
Tamosiunaite, Minija .
IEEE TRANSACTIONS ON AUTONOMOUS MENTAL DEVELOPMENT, 2013, 5 (02) :117-134
[47]  
Xiao TJ, 2015, PROC CVPR IEEE, P842, DOI 10.1109/CVPR.2015.7298685
[48]  
Xu K, 2015, PR MACH LEARN RES, V37, P2048
[49]   Recognizing Human Actions from Still Images with Latent Poses [J].
Yang, Weilong ;
Wang, Yang ;
Mori, Greg .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :2030-2037
[50]  
Yang Yezhou., 2013, CVPR