Semantic-Guided Relation Propagation Network for Few-shot Action Recognition

被引：21

作者：

Wang, Xiao ^{[1
]}

Ye, Weirong ^{[1
]}

Qi, Zhongang ^{[2
]}

Zhao, Xun ^{[2
]}

Wang, Guangge ^{[1
]}

Shan, Ying ^{[2
]}

Wang, Hanzi ^{[1
]}

机构：

[1] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart City, Xiamen, Peoples R China

[2] Tencent PCG, Appl Res Ctr ARC, Shenzhen, Peoples R China

来源：

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年

关键词：

Few-shot action recognition; Semantic information; Supervisory; signal; Spatial-temporal difference;

D O I：

10.1145/3474085.3475253

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few-shot action recognition has drawn growing attention as it can recognize novel action classes by using only a few labeled samples. In this paper, we propose a novel semantic-guided relation propagation network (SRPN), which leverages semantic information together with visual information for few-shot action recognition. Different from most previous works that neglect semantic information in the labeled data, our SRPN directly utilizes the semantic label as an additional supervisory signal to improve the generalization ability of the network. Besides, we treat the relation of each visual-semantic pair as a relational node, and we use a graph convolutional network to model and propagate such sample relations across visual-semantic pairs, including both intra-class commonality and inter-class uniqueness, to guide the relation propagation in the graph. However, since videos contain crucial sequences and ordering information, we propose a novel spatial-temporal difference module, which can facilitate the network to enhance the visual feature learning ability at both feature level and granular level for videos. Extensive experiments conducted on several challenging benchmarks demonstrate that our SRPN outperforms several state-of-the-art methods with a significant margin.

引用

页码：816 / 825

页数：10

共 50 条

[1] Semantic-guided spatio-temporal attention for few-shot action recognition
Jianyu Wang
Baolin Liu
Applied Intelligence, 2024, 54 : 2458 - 2471
[2] Semantic-guided spatio-temporal attention for few-shot action recognition
Wang, Jianyu
Liu, Baolin
APPLIED INTELLIGENCE, 2024, 54 (03) : 2458 - 2471
[3] Few-Shot Human-Object Interaction Recognition With Semantic-Guided Attentive Prototypes Network
Ji, Zhong
Liu, Xiyao
Pang, Yanwei
Ouyang, Wangli
Li, Xuelong
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (1648-1661) : 1648 - 1661
[4] Hybrid Relation Guided Set Matching for Few-shot Action Recognition
Wang, Xiang
Zhang, Shiwei
Qing, Zhiwu
Tang, Mingqian
Zuo, Zhengrong
Gao, Changxin
Jin, Rong
Sang, Nong
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19916 - 19925
[5] SGAP-Net: Semantic-Guided Attentive Prototypes Network for Few-Shot Human-Object Interaction Recognition
Ji, Zhong
Liu, Xiyao
Pang, Yanwei
Li, Xuelong
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11085 - 11092
[6] Knowledge-Guided Semantic Transfer Network for Few-Shot Image Recognition
Li, Zechao
Tang, Hao
Peng, Zhimao
Qi, Guo-Jun
Tang, Jinhui
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
[7] Semantic-Guided Robustness Tuning for Few-Shot Transfer Across Extreme Domain Shift
Xiao, Kangyu
Wang, Zilei
Li, Junjie
COMPUTER VISION - ECCV 2024, PT XLIX, 2025, 15107 : 303 - 320
[8] Relation fusion propagation network for transductive few-shot learning
Huang, Yixiang
Hao, Hongyu
Ge, Weichao
Cao, Yang
Wu, Ming
Zhang, Chuang
Guo, Jun
PATTERN RECOGNITION, 2024, 151
[9] Transductive Relation-Propagation Network for Few-shot Learning
Ma, Yuqing
Bai, Shihao
An, Shan
Liu, Wei
Liu, Aishan
Zhen, Xiantong
Liu, Xianglong
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 804 - 810
[10] VDARN: Video Disentangling Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition
Su, Yong
Xing, Meng
An, Simin
Peng, Weilong
Feng, Zhiyong
AD HOC NETWORKS, 2021, 113

← 1 2 3 4 5 →