Alternative Semantic Representations for Zero-Shot Human Action Recognition

被引:42
|
作者
Wang, Qian [1 ]
Chen, Ke [1 ]
机构
[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT I | 2017年 / 10534卷
关键词
Zero-shot learning; Semantic representation; Human action recognition; Image deep representation; Textual description representation; Fisher Vector;
D O I
10.1007/978-3-319-71249-9_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from still images relevant to human actions. Such side information are accessible on Web with little cost, which paves a new way in gaining side information for large-scale zero-shot human action recognition. We investigate different encoding methods to generate semantic representations for human actions from such side information. Based on our zero-shot visual recognition method, we conducted experiments on UCF101 and HMDB51 to evaluate two proposed semantic representations. The results suggest that our proposed text- and image-based semantic representations outperform traditional attributes and word vectors considerably for zero-shot human action recognition. In particular, the image-based semantic representations yield the favourable performance even though the representation is extracted from a small number of images per class. Code related to this chapter is available at: http://staffcs.manchester.ac.uk/similar to kechen/BiDi LEL/ Data related to this chapter are available at: http://staff.cs.manchester.ac.uk/similar to kechen/ASRHAR/
引用
收藏
页码:87 / 102
页数:16
相关论文
共 50 条
  • [1] Global Semantic Descriptors for Zero-Shot Action Recognition
    Estevam, Valter
    Laroca, Rayson
    Pedrini, Helio
    Menotti, David
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1843 - 1847
  • [2] SEMANTIC EMBEDDING SPACE FOR ZERO-SHOT ACTION RECOGNITION
    Xu, Xun
    Hospedales, Timothy
    Gong, Shaogang
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 63 - 67
  • [3] Semantic matters: A constrained approach for zero-shot video action recognition
    Quan, Zhenzhen
    Chen, Jialei
    Deguchi, Daisuke
    Sun, Jie
    Zhang, Chenkai
    Li, Yujun
    Murase, Hiroshi
    PATTERN RECOGNITION, 2025, 162
  • [4] Spatiotemporal visual-semantic embedding network for zero-shot action recognition
    An, Rongqiao
    Miao, Zhenjiang
    Li, Qingyu
    Xu, Wanru
    Zhang, Qiang
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (02)
  • [5] Zero-shot action recognition in videos: A survey
    Estevam, Valter
    Pedrini, Helio
    Menotti, David
    NEUROCOMPUTING, 2021, 439 : 159 - 175
  • [6] Elaborative Rehearsal for Zero-shot Action Recognition
    Chen, Shizhe
    Huang, Dong
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13618 - 13627
  • [7] Zero-Shot Object Recognition by Semantic Manifold Distance
    Fu, Zhenyong
    Xiang, Tao
    Kodirov, Elyor
    Gong, Shaogang
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2635 - 2644
  • [8] Learning complementary semantic information for zero-shot recognition
    Hu, Xiaoming
    Wang, Zilei
    Li, Junjie
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 115
  • [9] Exploring Semantic Inter-Class Relationships (SIR) for Zero-Shot Action Recognition
    Gan, Chuang
    Lin, Ming
    Yang, Yi
    Zhuang, Yueting
    Hauptmann, Alexander G.
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3769 - 3775
  • [10] A ZERO-SHOT ARCHITECTURE FOR ACTION RECOGNITION IN STILL IMAGES
    Safaei, Marjaneh
    Foroosh, Hassan
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 460 - 464