Contextual Heterogeneous Graph Network for Human-Object Interaction Detection

被引:43
作者
Hai Wang [1 ,2 ]
Zheng, Wei-shi [1 ,2 ]
Ling Yingbiao [1 ,3 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
[2] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518005, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT XVII | 2020年 / 12362卷
关键词
Human-object interaction; Heterogeneous graph; Neural network;
D O I
10.1007/978-3-030-58520-4_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-object interaction (HOI) detection is an important task for understanding human activity. Graph structure is appropriate to denote the HOIs in the scene. Since there is an subordination between human and object-human play subjective role and object play objective role in HOI, the relations between homogeneous entities and heterogeneous entities in the scene should also not be equally the same. However, previous graph models regard human and object as the same kind of nodes and do not consider that the messages are not equally the same between different entities. In this work, we address such a problem for HOI task by proposing a heterogeneous graph network that models humans and objects as different kinds of nodes and incorporates intra-class messages between homogeneous nodes and inter-class messages between heterogeneous nodes. In addition, a graph attention mechanism based on the intra-class context and inter-class context is exploited to improve the learning. Extensive experiments on the benchmark datasets V-COCO and HICO-DET verify the effectiveness of our method and demonstrate the importance to extract intra-class and interclass messages which are not equally the same in HOI detection.
引用
收藏
页码:248 / 264
页数:17
相关论文
共 39 条
[11]   Detecting and Recognizing Human-Object Interactions [J].
Gkioxari, Georgia ;
Girshick, Ross ;
Dollar, Piotr ;
He, Kaiming .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8359-8367
[12]   Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition [J].
Gupta, Abhinav ;
Kembhavi, Aniruddha ;
Davis, Larry S. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (10) :1775-1789
[13]  
Gupta S, 2015, Arxiv, DOI arXiv:1505.04474
[14]   No-Frills Human-Object Interaction Detection: Factorization, Layout Encodings, and Training Techniques [J].
Gupta, Tanmay ;
Schwing, Alexander ;
Hoiem, Derek .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9676-9684
[15]  
Hamilton WL, 2017, ADV NEUR IN, V30
[16]   Recognising Human-Object Interaction via Exemplar based Modelling [J].
Hu, Jian-Fang ;
Zheng, Wei-Shi ;
Lai, Jianhuang ;
Gong, Shaogang ;
Xiang, Tao .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3144-3151
[17]   Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations [J].
Krishna, Ranjay ;
Zhu, Yuke ;
Groth, Oliver ;
Johnson, Justin ;
Hata, Kenji ;
Kravitz, Joshua ;
Chen, Stephanie ;
Kalantidis, Yannis ;
Li, Li-Jia ;
Shamma, David A. ;
Bernstein, Michael S. ;
Li Fei-Fei .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 123 (01) :32-73
[18]   Multi-Label Zero-Shot Learning with Structured Knowledge Graphs [J].
Lee, Chung-Wei ;
Fang, Wei ;
Yeh, Chih-Kuan ;
Wang, Yu-Chiang Frank .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1576-1585
[19]   Actional-Structural Graph Convolutional Networks for Skeleton-based Action Recognition [J].
Li, Maosen ;
Chen, Siheng ;
Chen, Xu ;
Zhang, Ya ;
Wang, Yanfeng ;
Tian, Qi .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3590-3598
[20]  
Li YL, 2019, Arxiv, DOI arXiv:1811.08264