Temporal-Viewpoint Transportation Plan for Skeletal Few-Shot Action Recognition

被引:10
|
作者
Wang, Lei [1 ,2 ]
Koniusz, Piotr [1 ,2 ]
机构
[1] Australian Natl Univ, Canberra, Australia
[2] Data61 CSIRO, Sydney, Australia
来源
关键词
EXAMPLE;
D O I
10.1007/978-3-031-26316-3_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a Few-shot Learning pipeline for 3D skeleton-based action recognition by Joint tEmporal and cAmera viewpoiNt alIgnmEnt (JEANIE). To factor out misalignment between query and support sequences of 3D body joints, we propose an advanced variant of Dynamic Time Warping which jointly models each smooth path between the query and support frames to achieve simultaneously the best alignment in the temporal and simulated camera viewpoint spaces for end-to-end learning under the limited few-shot training data. Sequences are encoded with a temporal block encoder based on Simple Spectral Graph Convolution, a lightweight linear Graph Neural Network backbone. We also include a setting with a transformer. Finally, we propose a similarity-based loss which encourages the alignment of sequences of the same class while preventing the alignment of unrelated sequences. We show state-of-the-art results on NTU-60, NTU-120, Kinetics-skeleton and UWA3D Multiview Activity II.
引用
收藏
页码:307 / 326
页数:20
相关论文
共 50 条
  • [31] Commonsense Knowledge Prompting for Few-Shot Action Recognition in Videos
    Shi, Yuheng
    Wu, Xinxiao
    Lin, Hanxi
    Luo, Jiebo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8395 - 8405
  • [32] Exploring the Adaptation Strategy of CLIP for Few-Shot Action Recognition
    Cao, Congqi
    Zhang, Yueran
    Lv, Qinyi
    Min, Lingtong
    Zhang, Yanning
    PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON EFFICIENT MULTIMEDIA COMPUTING UNDER LIMITED RESOURCES, EMCLR 2024, 2024, : 39 - 48
  • [33] FedFSLAR: A Federated Learning Framework for Few-shot Action Recognition
    Nguyen Anh Tu
    Abu, Assanali
    Aikyn, Nartay
    Makhanov, Nursultan
    Lee, Min-Ho
    Khiem Le-Huy
    Wong, Kok-Seng
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 270 - 279
  • [34] Few-Shot Action Recognition with A Transductive Maximum Margin Classifier
    Pan, Fei
    Guo, Jie
    Guo, Yanwen
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [35] Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning
    Zheng, Sipeng
    Chen, Shizhe
    Jin, Qin
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 297 - 313
  • [36] Hybrid attentive prototypical network for few-shot action recognition
    Ruan, Zanxi
    Wei, Yingmei
    Guo, Yanming
    Xie, Yuxiang
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (06) : 8249 - 8272
  • [37] Revisiting Few-Shot Compositional Action Recognition With Knowledge Calibration
    Huang, Peng
    Qu, Hongyu
    Shu, Xiangbo
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 1216 - 1220
  • [38] Cross-modal guides spatio-temporal enrichment network for few-shot action recognition
    Chen, Zhiwen
    Yang, Yi
    Li, Li
    Li, Min
    APPLIED INTELLIGENCE, 2024, 54 (22) : 11196 - 11211
  • [39] FTAN: Frame-to-frame temporal alignment network with contrastive learning for few-shot action recognition
    Yu, Bin
    Hou, Yonghong
    Guo, Zihui
    Gao, Zhiyi
    Li, Yueyang
    IMAGE AND VISION COMPUTING, 2024, 149
  • [40] A Novel Few-Shot Action Recognition Method: Temporal Relational CrossTransformers Based on Image Difference Pyramid
    Ding, Yihang
    Liu, Youyuan
    IEEE ACCESS, 2022, 10 : 94536 - 94544