Alternative Semantic Representations for Zero-Shot Human Action Recognition

被引：42

作者：

Wang, Qian ^{[1
]}

Chen, Ke ^{[1
]}

机构：

[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT I | 2017年 / 10534卷

关键词：

Zero-shot learning; Semantic representation; Human action recognition; Image deep representation; Textual description representation; Fisher Vector;

D O I：

10.1007/978-3-319-71249-9_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from still images relevant to human actions. Such side information are accessible on Web with little cost, which paves a new way in gaining side information for large-scale zero-shot human action recognition. We investigate different encoding methods to generate semantic representations for human actions from such side information. Based on our zero-shot visual recognition method, we conducted experiments on UCF101 and HMDB51 to evaluate two proposed semantic representations. The results suggest that our proposed text- and image-based semantic representations outperform traditional attributes and word vectors considerably for zero-shot human action recognition. In particular, the image-based semantic representations yield the favourable performance even though the representation is extracted from a small number of images per class. Code related to this chapter is available at: http://staffcs.manchester.ac.uk/similar to kechen/BiDi LEL/ Data related to this chapter are available at: http://staff.cs.manchester.ac.uk/similar to kechen/ASRHAR/

引用

页码：87 / 102

页数：16

共 50 条

[1] Global Semantic Descriptors for Zero-Shot Action Recognition
Estevam, Valter
Laroca, Rayson
Pedrini, Helio
Menotti, David
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1843 - 1847
[2] SEMANTIC EMBEDDING SPACE FOR ZERO-SHOT ACTION RECOGNITION
Xu, Xun
Hospedales, Timothy
Gong, Shaogang
2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 63 - 67
[3] Semantic matters: A constrained approach for zero-shot video action recognition
Quan, Zhenzhen
Chen, Jialei
Deguchi, Daisuke
Sun, Jie
Zhang, Chenkai
Li, Yujun
Murase, Hiroshi
PATTERN RECOGNITION, 2025, 162
[4] Spatiotemporal visual-semantic embedding network for zero-shot action recognition
An, Rongqiao
Miao, Zhenjiang
Li, Qingyu
Xu, Wanru
Zhang, Qiang
JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (02)
[5] Zero-shot action recognition in videos: A survey
Estevam, Valter
Pedrini, Helio
Menotti, David
NEUROCOMPUTING, 2021, 439 : 159 - 175
[6] Elaborative Rehearsal for Zero-shot Action Recognition
Chen, Shizhe
Huang, Dong
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13618 - 13627
[7] Zero-Shot Object Recognition by Semantic Manifold Distance
Fu, Zhenyong
Xiang, Tao
Kodirov, Elyor
Gong, Shaogang
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2635 - 2644
[8] Learning complementary semantic information for zero-shot recognition
Hu, Xiaoming
Wang, Zilei
Li, Junjie
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 115
[9] Exploring Semantic Inter-Class Relationships (SIR) for Zero-Shot Action Recognition
Gan, Chuang
Lin, Ming
Yang, Yi
Zhuang, Yueting
Hauptmann, Alexander G.
PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3769 - 3775
[10] A ZERO-SHOT ARCHITECTURE FOR ACTION RECOGNITION IN STILL IMAGES
Safaei, Marjaneh
Foroosh, Hassan
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 460 - 464

← 1 2 3 4 5 →