Scaling Human-Object Interaction Recognition through Zero-Shot Learning

被引:95
|
作者
Shen, Liyue [1 ]
Yeung, Serena [1 ]
Hoffman, Judy [2 ]
Mori, Greg [3 ]
Li Fei-Fei [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
[3] Simon Fraser Univ, Burnaby, BC, Canada
关键词
D O I
10.1109/WACV.2018.00181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing human object interactions (HOI) is an important part of distinguishing the rich variety of human action in the visual world. While recent progress has been made in improving HOI recognition in the fully supervised setting, the space of possible human-object interactions is large and it is impractical to obtain labeled training data for all interactions of interest. In this work, we tackle the challenge of scaling HOI recognition to the long tail of categories through a zero-shot learning approach. We introduce a factorized model for HOI detection that disentangles reasoning on verbs and objects, and at test-time can therefore produce detections for novel verb-object pairs. We present experiments on the recently introduced large-scale HICO-DET dataset, and show that our model is able to both perform comparably to state-of-the-art in fully-supervised HOI detection, while simultaneously achieving effective zero-shot detection of new HOI categories.
引用
收藏
页码:1568 / 1576
页数:9
相关论文
共 50 条
  • [1] Scaling Human-Object Interaction Recognition in the Video through Zero-Shot Learning
    Maraghi, Vali Ollah
    Faez, Karim
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [2] Zero-Shot Learning on Human-Object Interaction Recognition in video
    Maraghi, Vali Ollah
    Faez, Karim
    2019 5TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS 2019), 2019,
  • [3] ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection
    Liu, Ye
    Yuan, Junsong
    Chen, Chang Wen
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4235 - 4243
  • [4] Zero-Shot Human-Object Interaction Detection via Similarity Propagation
    Zong, Daoming
    Sun, Shiliang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 17805 - 17816
  • [5] Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations
    Huynh, Dat
    Elhamifar, Ehsan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8452 - 8463
  • [6] Towards zero-shot human-object interaction detection via vision-language integration
    Xue, Weiying
    Liu, Qi
    Wang, Yuxiao
    Wei, Zhenao
    Xing, Xiaofen
    Xu, Xiangmin
    NEURAL NETWORKS, 2025, 187
  • [7] Context-Aware Zero-Shot Learning for Object Recognition
    Zablocki, Eloi
    Bordes, Patrick
    Piwowarski, Benjamin
    Soulier, Laure
    Gallinari, Patrick
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [8] ZERO-SHOT HUMAN-OBJECT INTERACTION (HOI) CLASSIFICATION BY BRIDGING GENERATIVE AND CONTRASTIVE IMAGE-LANGUAGE MODELS
    Jin, Ying
    Chen, Yinpeng
    Wang, Jianfeng
    Wang, Lijuan
    Hwang, Jenq-Neng
    Liu, Zicheng
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1970 - 1974
  • [9] Human Motion Recognition Using Zero-Shot Learning
    Mohammadi, Farid Ghareh
    Imteaj, Ahmed
    Amini, M. Hadi
    Arabnia, Hamid R.
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND APPLIED COGNITIVE COMPUTING, 2021, : 171 - 181
  • [10] Learning temporal information and object relation for zero-shot action recognition
    Qi, Qiuping
    Wang, Hanli
    Su, Taiyi
    Liu, Xianhui
    DISPLAYS, 2022, 73