Semantic-Guided Relation Propagation Network for Few-shot Action Recognition

被引:21
|
作者
Wang, Xiao [1 ]
Ye, Weirong [1 ]
Qi, Zhongang [2 ]
Zhao, Xun [2 ]
Wang, Guangge [1 ]
Shan, Ying [2 ]
Wang, Hanzi [1 ]
机构
[1] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart City, Xiamen, Peoples R China
[2] Tencent PCG, Appl Res Ctr ARC, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年
关键词
Few-shot action recognition; Semantic information; Supervisory; signal; Spatial-temporal difference;
D O I
10.1145/3474085.3475253
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot action recognition has drawn growing attention as it can recognize novel action classes by using only a few labeled samples. In this paper, we propose a novel semantic-guided relation propagation network (SRPN), which leverages semantic information together with visual information for few-shot action recognition. Different from most previous works that neglect semantic information in the labeled data, our SRPN directly utilizes the semantic label as an additional supervisory signal to improve the generalization ability of the network. Besides, we treat the relation of each visual-semantic pair as a relational node, and we use a graph convolutional network to model and propagate such sample relations across visual-semantic pairs, including both intra-class commonality and inter-class uniqueness, to guide the relation propagation in the graph. However, since videos contain crucial sequences and ordering information, we propose a novel spatial-temporal difference module, which can facilitate the network to enhance the visual feature learning ability at both feature level and granular level for videos. Extensive experiments conducted on several challenging benchmarks demonstrate that our SRPN outperforms several state-of-the-art methods with a significant margin.
引用
收藏
页码:816 / 825
页数:10
相关论文
共 50 条
  • [1] Semantic-guided spatio-temporal attention for few-shot action recognition
    Jianyu Wang
    Baolin Liu
    Applied Intelligence, 2024, 54 : 2458 - 2471
  • [2] Semantic-guided spatio-temporal attention for few-shot action recognition
    Wang, Jianyu
    Liu, Baolin
    APPLIED INTELLIGENCE, 2024, 54 (03) : 2458 - 2471
  • [3] Few-Shot Human-Object Interaction Recognition With Semantic-Guided Attentive Prototypes Network
    Ji, Zhong
    Liu, Xiyao
    Pang, Yanwei
    Ouyang, Wangli
    Li, Xuelong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (1648-1661) : 1648 - 1661
  • [4] Hybrid Relation Guided Set Matching for Few-shot Action Recognition
    Wang, Xiang
    Zhang, Shiwei
    Qing, Zhiwu
    Tang, Mingqian
    Zuo, Zhengrong
    Gao, Changxin
    Jin, Rong
    Sang, Nong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19916 - 19925
  • [5] SGAP-Net: Semantic-Guided Attentive Prototypes Network for Few-Shot Human-Object Interaction Recognition
    Ji, Zhong
    Liu, Xiyao
    Pang, Yanwei
    Li, Xuelong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11085 - 11092
  • [6] Knowledge-Guided Semantic Transfer Network for Few-Shot Image Recognition
    Li, Zechao
    Tang, Hao
    Peng, Zhimao
    Qi, Guo-Jun
    Tang, Jinhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [7] Semantic-Guided Robustness Tuning for Few-Shot Transfer Across Extreme Domain Shift
    Xiao, Kangyu
    Wang, Zilei
    Li, Junjie
    COMPUTER VISION - ECCV 2024, PT XLIX, 2025, 15107 : 303 - 320
  • [8] Relation fusion propagation network for transductive few-shot learning
    Huang, Yixiang
    Hao, Hongyu
    Ge, Weichao
    Cao, Yang
    Wu, Ming
    Zhang, Chuang
    Guo, Jun
    PATTERN RECOGNITION, 2024, 151
  • [9] Transductive Relation-Propagation Network for Few-shot Learning
    Ma, Yuqing
    Bai, Shihao
    An, Shan
    Liu, Wei
    Liu, Aishan
    Zhen, Xiantong
    Liu, Xianglong
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 804 - 810
  • [10] VDARN: Video Disentangling Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition
    Su, Yong
    Xing, Meng
    An, Simin
    Peng, Weilong
    Feng, Zhiyong
    AD HOC NETWORKS, 2021, 113