Semantic Inference Network for Human-Object Interaction Detection

被引:0
|
作者
Liu, Hongyi [1 ]
Mo, Lisha [1 ]
Ma, Huimin [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
来源
IMAGE AND GRAPHICS, ICIG 2019, PT I | 2019年 / 11901卷
基金
中国国家自然科学基金;
关键词
Human-object interaction; Visual relationship detection; Word embedding;
D O I
10.1007/978-3-030-34120-6_42
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently many efforts have been made to understand the scenes in images. The interactions between human and objects are usually of great significance to scene understanding. In this paper, we focus on the task of detecting human-object interactions (HOI), which is to detect triplets < human, verb, object > in challenging daily images. We propose a novel model which introduces a semantic stream and a new form of loss function. Our intuition is that the semantic information of object classes is beneficial to HOI detection. Semantic information is extracted by embedding the category information of objects with pre-trained BERT model. On the other hand, we find that the HOI task suffers severely from extreme imbalance between positive and negative samples. We propose a weighted focal loss (WFL) to tackle this problem. The results show that our method achieves a gain of 5% compared with our baseline.
引用
收藏
页码:518 / 529
页数:12
相关论文
共 50 条
  • [31] Enhanced Transformer Interaction Components for Human-Object Interaction Detection
    Zhang, JinHui
    Zhao, Yuxiao
    Zhang, Xian
    Wang, Xiang
    Zhao, Yuxuan
    Wang, Peng
    Hu, Jian
    ACM SYMPOSIUM ON SPATIAL USER INTERACTION, SUI 2023, 2023,
  • [32] Discovering Syntactic Interaction Clues for Human-Object Interaction Detection
    Lu, Jinguo
    Ren, Weihong
    Jiang, Weibo
    Chen, Xi'ai
    Wang, Qiang
    Han, Zhi
    Liu, Honghai
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 28212 - 28222
  • [33] Learning Human-Object Interaction Detection using Interaction Points
    Wang, Tiancai
    Yang, Tong
    Danelljan, Martin
    Khan, Fahad Shahbaz
    Zhang, Xiangyu
    Sun, Jian
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4115 - 4124
  • [34] A Survey of Human-Object Interaction Detection With Deep Learning
    Han, Geng
    Zhao, Jiachen
    Zhang, Lele
    Deng, Fang
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 3 - 26
  • [35] DSSF: Dynamic Semantic Sampling and Fusion for One-Stage Human-Object Interaction Detection
    Gu, Dongzhou
    Ma, Shiwei
    Cai, Shuang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [36] Relational Context Learning for Human-Object Interaction Detection
    Kim, Sanghyun
    Jung, Deunsol
    Cho, Minsu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2925 - 2934
  • [37] Neural-Logic Human-Object Interaction Detection
    Li, Liulei
    Wei, Jianan
    Wang, Wenguan
    Yang, Yi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [38] Structured LSTM for Human-Object Interaction Detection and Anticipation
    Anh Minh Truong
    Yoshitaka, Atsuo
    2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2017,
  • [39] Deep Contextual Attention for Human-Object Interaction Detection
    Wang, Tiancai
    Anwer, Rao Muhammad
    Khan, Muhammad Haris
    Khan, Fahad Shahbaz
    Pang, Yanwei
    Shao, Ling
    Laaksonen, Jorma
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5693 - 5701
  • [40] Spatial-Net for Human-Object Interaction Detection
    Mansour, Ahmed E.
    Mohammed, Ammar
    Elsayed, Hussein Abd El Atty
    Elramly, Salwa
    IEEE ACCESS, 2022, 10 : 88920 - 88931