Temporal-Viewpoint Transportation Plan for Skeletal Few-Shot Action Recognition

被引:10
|
作者
Wang, Lei [1 ,2 ]
Koniusz, Piotr [1 ,2 ]
机构
[1] Australian Natl Univ, Canberra, Australia
[2] Data61 CSIRO, Sydney, Australia
来源
关键词
EXAMPLE;
D O I
10.1007/978-3-031-26316-3_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a Few-shot Learning pipeline for 3D skeleton-based action recognition by Joint tEmporal and cAmera viewpoiNt alIgnmEnt (JEANIE). To factor out misalignment between query and support sequences of 3D body joints, we propose an advanced variant of Dynamic Time Warping which jointly models each smooth path between the query and support frames to achieve simultaneously the best alignment in the temporal and simulated camera viewpoint spaces for end-to-end learning under the limited few-shot training data. Sequences are encoded with a temporal block encoder based on Simple Spectral Graph Convolution, a lightweight linear Graph Neural Network backbone. We also include a setting with a transformer. Finally, we propose a similarity-based loss which encourages the alignment of sequences of the same class while preventing the alignment of unrelated sequences. We show state-of-the-art results on NTU-60, NTU-120, Kinetics-skeleton and UWA3D Multiview Activity II.
引用
收藏
页码:307 / 326
页数:20
相关论文
共 50 条
  • [41] HyRSM plus plus : Hybrid relation guided temporal set matching for few-shot action recognition
    Wang, Xiang
    Zhang, Shiwei
    Qing, Zhiwu
    Zuo, Zhengrong
    Gao, Changxin
    Jin, Rong
    Sang, Nong
    PATTERN RECOGNITION, 2024, 147
  • [42] Adversarial Style Mixup and Improved Temporal Alignment for Cross-Domain Few-Shot Action Recognition
    Cao, Kaiyan
    Peng, Jiawen
    Chen, Jiaxin
    Hou, Xinyuan
    Ma, Andy J.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 255
  • [43] Multi-level alignment for few-shot temporal action localization
    Keisham, Kanchan
    Jalali, Amin
    Kim, Jonghong
    Lee, Minho
    INFORMATION SCIENCES, 2023, 650
  • [44] Hybrid Relation Guided Set Matching for Few-shot Action Recognition
    Wang, Xiang
    Zhang, Shiwei
    Qing, Zhiwu
    Tang, Mingqian
    Zuo, Zhengrong
    Gao, Changxin
    Jin, Rong
    Sang, Nong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19916 - 19925
  • [45] Cross-domain few-shot action recognition with unlabeled videos
    Wang, Xiang
    Zhang, Shiwei
    Qing, Zhiwu
    Lv, Yiliang
    Gao, Changxin
    Sang, Nong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233
  • [46] Few-shot action recognition using task-adaptive parameters
    Zong, Pengcheng
    Chen, Peng
    Yu, Tianwei
    Yan, Lingqiang
    Huan, Ruohong
    ELECTRONICS LETTERS, 2021, 57 (22) : 848 - 850
  • [47] Multidimensional Prototype Refactor Enhanced Network for Few-Shot Action Recognition
    Liu, Shuwen
    Jiang, Min
    Kong, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6955 - 6966
  • [48] CLIP-guided Prototype Modulating for Few-shot Action Recognition
    Wang, Xiang
    Zhang, Shiwei
    Cen, Jun
    Gao, Changxin
    Zhang, Yingya
    Zhao, Deli
    Sang, Nong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (06) : 1899 - 1912
  • [49] Few-shot learning for ear recognition
    Zhang, Jie
    Yu, Wen
    Yang, Xudong
    Deng, Fang
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 50 - 54
  • [50] Multimodal Prototype-Enhanced Network for Few-Shot Action Recognition
    Ni, Xinzhe
    Liu, Yong
    Wen, Hao
    Ji, Yatai
    Xiao, Jing
    Yang, Yujiu
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1 - 10