Zero-Shot Human-Object Interaction Detection via Similarity Propagation

被引:4
|
作者
Zong, Daoming [1 ]
Sun, Shiliang [1 ,2 ,3 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China
[2] East China Normal Univ, Key Lab Adv Theory & Applicat Stat & Data Sci, Minist Educ, Shanghai 200062, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Human-object interaction (HOI) detection; object detection; zero-shot learning (ZSL);
D O I
10.1109/TNNLS.2023.3309104
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-object interaction (HOI) detection involves identifying interactions represented as < human, action, object >, requiring the localization of human-object pairs and interaction classification within an image. This work focuses on the challenge of detecting HOIs with unseen objects using the prevalent Transformer architecture. Our empirical analysis reveals that the performance degradation of novel HOI instances primarily arises from misclassifying unseen objects as confusable seen objects. To address this issue, we propose a similarity propagation (SP) scheme that leverages cosine similarity distance to regulate the prediction margin between seen and unseen objects. In addition, we introduce pseudo-supervision for unseen objects based on class semantic similarities during training. Furthermore, we incorporate semantic-aware instance-level and interaction-level contrastive losses with Transformer to enhance intraclass compactness and interclass separability, resulting in improved visual representations. Extensive experiments on two challenging benchmarks, V-COCO and HICO-DET, demonstrate the effectiveness of our model, outperforming current state-of-the-art methods under various zero-shot settings.
引用
收藏
页码:17805 / 17816
页数:12
相关论文
共 50 条
  • [31] Robust Region Feature Synthesizer for Zero-Shot Object Detection
    Huang, Peiliang
    Han, Junwei
    Cheng, De
    Zhang, Dingwen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7612 - 7621
  • [32] Zero-shot object detection with contrastive semantic association network
    Haohe Li
    Chong Wang
    Weijie Liu
    Yilin Gong
    Xinmiao Dai
    Applied Intelligence, 2023, 53 : 30056 - 30068
  • [33] A Multi-Space Approach to Zero-Shot Object Detection
    Gupta, Dikshant
    Anantharaman, Aditya
    Mamgain, Nehal
    Kamath, Sowmya S.
    Balasubramanian, Vineeth N.
    Jawahar, C., V
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1198 - 1206
  • [34] A dynamic semantic knowledge graph for zero-shot object detection
    Lv, Wen
    Shi, Hongbo
    Tan, Shuai
    Song, Bing
    Tao, Yang
    VISUAL COMPUTER, 2023, 39 (10): : 4513 - 4527
  • [35] Zero-shot object detection with contrastive semantic association network
    Li, Haohe
    Wang, Chong
    Liu, Weijie
    Gong, Yilin
    Dai, Xinmiao
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30056 - 30068
  • [36] Zero-Shot Aerial Object Detection with Visual Description Regularization
    Zang, Zhengqing
    Lin, Chenyu
    Tang, Chenwei
    Wang, Tao
    Lv, Jiancheng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6926 - 6934
  • [37] A dynamic semantic knowledge graph for zero-shot object detection
    Wen Lv
    Hongbo Shi
    Shuai Tan
    Bing Song
    Yang Tao
    The Visual Computer, 2023, 39 : 4513 - 4527
  • [38] Learning Latent Semantic Attributes for Zero-Shot Object Detection
    Wang, Kang
    Zhang, Lu
    Tan, Yifan
    Zhao, Jiajia
    Zhou, Shuigeng
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 230 - 237
  • [39] Adaptive adjustment with semantic embedding for zero-shot object detection
    Lv, Wen
    Shi, Hongbo
    Tan, Shuai
    Song, Bing
    Tao, Yang
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [40] GTNet: Generative Transfer Network for Zero-Shot Object Detection
    Zhao, Shizhen
    Gao, Changxin
    Shao, Yuanjie
    Li, Lerenhan
    Yu, Changqian
    Ji, Zhong
    Sang, Nang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12967 - 12974