Hierarchical Reasoning Network for Human-Object Interaction Detection

被引:12
作者
Gao, Yiming [1 ]
Kuang, Zhanghui [2 ]
Li, Guanbin [1 ]
Zhang, Wayne [2 ]
Lin, Liang [1 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Cognition; Correlation; Benchmark testing; Task analysis; Sports; Periodic structures; Human-object interaction; hierarchical reasoning network; graph neural network; REPRESENTATION; CNNS;
D O I
10.1109/TIP.2021.3093784
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-object interaction detection that aims at detecting <human, verb, object> triplets is critical for the holistic human-centric scene understanding. Existing approaches ignore the modeling of correlations among hierarchical human parts and objects. In this work, we introduce a Hierarchical Reasoning Network (HRNet) to capture relations among human parts at multiple scales (including the holistic human, human region, and human keypoint levels) and objects via a unified graph. In particular, HRNet first constructs one multi-level human parts graph, each level of which consists of human parts at one specific scale, objects, and the unions of human part-object pairs as nodes, and their mutual visual and spatial layout relations as intra-level reasoning. To also capture the relations across scales, we further introduce inter-level reasoning between the nodes of two consecutive levels based on the prior of human body structure. The representations of graph nodes are propagated along intra-level and inter-level reasoning in turn during reasoning. Extensive experiments demonstrate our HRNet obtains new state-of-the-art results on three challenging HICO-DET, V-COCO and HOI-A benchmarks, validating the compelling effectiveness of the proposed method.
引用
收藏
页码:8306 / 8317
页数:12
相关论文
共 50 条
  • [41] Human-object interaction detection based on cascade multi-scale transformer
    Limin Xia
    Xiaoyue Ding
    Applied Intelligence, 2024, 54 : 2831 - 2850
  • [42] Human-object interaction detection via recycling of ground-truth annotations
    Lin, Xue
    Zou, Qi
    Xu, Xixia
    PATTERN RECOGNITION, 2025, 157
  • [43] Human-object interaction detection based on cascade multi-scale transformer
    Xia, Limin
    Ding, Xiaoyue
    APPLIED INTELLIGENCE, 2024, 54 (03) : 2831 - 2850
  • [44] HOI as Embeddings: Advancements of Model Representation Capability in Human-Object Interaction Detection
    Chen, Junwen
    Wang, Yingcheng
    Yanai, Keiji
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL, MIPR 2024, 2024, : 116 - 122
  • [45] SKGHOI: Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection
    Zhu, Lijing
    Lan, Qizhen
    Velasquez, Alvaro
    Song, Houbing
    Kamal, Acharya
    Tian, Qing
    Niu, Shuteng
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 1186 - 1193
  • [46] Cognition Guided Human-Object Relationship Detection
    Zeng, Zhitao
    Dai, Pengwen
    Zhang, Xuan
    Zhang, Lei
    Cao, Xiaochun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2468 - 2480
  • [47] An Optimization Model For Human-Object Interaction Detection Inspired By Multi-Features
    Kuang, Hailan
    Dong, Jian
    Liu, Xinhua
    Ma, Xiaolin
    2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 733 - 737
  • [48] Human–Object Interaction Detection: An Overview
    Wang, Jia
    Shuai, Hong-Han
    Li, Yung-Hui
    Cheng, Wen-Huang
    IEEE CONSUMER ELECTRONICS MAGAZINE, 2024, 13 (06) : 56 - 72
  • [49] Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection
    Li, Gongyang
    Liu, Zhi
    Chen, Minyu
    Bai, Zhen
    Lin, Weisi
    Ling, Haibin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3528 - 3542
  • [50] Spatial-Aware Multi-Level Parsing Network for Human-Object Interaction
    Su, Zhan
    Yu, Ruiyun
    Zou, Shihao
    Guo, Bingyang
    Cheng, Li
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, : 39 - 48