Zero-Shot Human-Object Interaction Detection via Similarity Propagation

被引:4
|
作者
Zong, Daoming [1 ]
Sun, Shiliang [1 ,2 ,3 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China
[2] East China Normal Univ, Key Lab Adv Theory & Applicat Stat & Data Sci, Minist Educ, Shanghai 200062, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Human-object interaction (HOI) detection; object detection; zero-shot learning (ZSL);
D O I
10.1109/TNNLS.2023.3309104
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-object interaction (HOI) detection involves identifying interactions represented as < human, action, object >, requiring the localization of human-object pairs and interaction classification within an image. This work focuses on the challenge of detecting HOIs with unseen objects using the prevalent Transformer architecture. Our empirical analysis reveals that the performance degradation of novel HOI instances primarily arises from misclassifying unseen objects as confusable seen objects. To address this issue, we propose a similarity propagation (SP) scheme that leverages cosine similarity distance to regulate the prediction margin between seen and unseen objects. In addition, we introduce pseudo-supervision for unseen objects based on class semantic similarities during training. Furthermore, we incorporate semantic-aware instance-level and interaction-level contrastive losses with Transformer to enhance intraclass compactness and interclass separability, resulting in improved visual representations. Extensive experiments on two challenging benchmarks, V-COCO and HICO-DET, demonstrate the effectiveness of our model, outperforming current state-of-the-art methods under various zero-shot settings.
引用
收藏
页码:17805 / 17816
页数:12
相关论文
共 50 条
  • [21] ZERO-SHOT DETECTION WITH TRANSFERABLE OBJECT PROPOSAL MECHANISM
    Shao, Yilan
    Li, Yanan
    Wang, Donghui
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3666 - 3670
  • [22] Learning Human-Object Interaction Detection via Deformable Transformer
    Cai, Shuang
    Ma, Shiwei
    Gu, Dongzhou
    2021 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2021, 12076
  • [23] Zero-Shot Object Counting
    Xu, Jingyi
    Le, Hieu
    Nguyen, Vu
    Ranjan, Viresh
    Samaras, Dimitris
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15548 - 15557
  • [24] A Survey of Human-Object Interaction Detection
    Gong X.
    Zhang Z.
    Liu L.
    Ma B.
    Wu K.
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (04): : 693 - 704
  • [25] Zero-Shot Learning via Joint Latent Similarity Embedding
    Zhang, Ziming
    Saligrama, Venkatesh
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 6034 - 6042
  • [26] Zero-Shot Hashing via Asymmetric Ratio Similarity Matrix
    Shi, Yang
    Nie, Xiushan
    Liu, Xingbo
    Yang, Lu
    Yin, Yilong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 5426 - 5437
  • [27] NudgeSeg: Zero-Shot Object Segmentation by Repeated Physical Interaction
    Singh, Chahat Deep
    Sanket, Nitin J.
    Parameshwara, Chethan M.
    Fermuller, Cornelia
    Aloimonos, Yiannis
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2714 - 2721
  • [28] Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows
    Tu, Danyang
    Min, Xiongkuo
    Duan, Huiyu
    Guo, Guodong
    Zhai, Guangtao
    Shen, Wei
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 87 - 103
  • [29] Improving Human-Object Interaction Detection via Virtual Image Learning
    Fang, Shuman
    Liu, Shuai
    Li, Jie
    Jiang, Guannan
    Lin, Xianming
    Ji, Rongrong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5455 - 5463
  • [30] Towards Generalizable Zero-Shot Manipulation via Translating Human Interaction Plans
    Bharadhwaj, Homanga
    Gupta, Abhinav
    Kumar, Vikash
    Tulsiani, Shubham
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 6904 - 6911