Temporal-Viewpoint Transportation Plan for Skeletal Few-Shot Action Recognition

被引:10
|
作者
Wang, Lei [1 ,2 ]
Koniusz, Piotr [1 ,2 ]
机构
[1] Australian Natl Univ, Canberra, Australia
[2] Data61 CSIRO, Sydney, Australia
来源
关键词
EXAMPLE;
D O I
10.1007/978-3-031-26316-3_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a Few-shot Learning pipeline for 3D skeleton-based action recognition by Joint tEmporal and cAmera viewpoiNt alIgnmEnt (JEANIE). To factor out misalignment between query and support sequences of 3D body joints, we propose an advanced variant of Dynamic Time Warping which jointly models each smooth path between the query and support frames to achieve simultaneously the best alignment in the temporal and simulated camera viewpoint spaces for end-to-end learning under the limited few-shot training data. Sequences are encoded with a temporal block encoder based on Simple Spectral Graph Convolution, a lightweight linear Graph Neural Network backbone. We also include a setting with a transformer. Finally, we propose a similarity-based loss which encourages the alignment of sequences of the same class while preventing the alignment of unrelated sequences. We show state-of-the-art results on NTU-60, NTU-120, Kinetics-skeleton and UWA3D Multiview Activity II.
引用
收藏
页码:307 / 326
页数:20
相关论文
共 50 条
  • [21] Advances in Few-Shot Action Recognition: A Comprehensive Review
    Ruan, Zanxi
    Wei, Yingmei
    Yuan, Yifei
    Li, Yu
    Guo, Yanming
    Xie, Yuxiang
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 390 - 398
  • [22] Semantic-guided spatio-temporal attention for few-shot action recognition
    Jianyu Wang
    Baolin Liu
    Applied Intelligence, 2024, 54 : 2458 - 2471
  • [23] Motion-modulated Temporal Fragment Alignment Network For Few-Shot Action Recognition
    Wu, Jiamin
    Zhang, Tianzhu
    Zhang, Zhe
    Wu, Feng
    Zhang, Yongdong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9141 - 9150
  • [24] Semantic-guided spatio-temporal attention for few-shot action recognition
    Wang, Jianyu
    Liu, Baolin
    APPLIED INTELLIGENCE, 2024, 54 (03) : 2458 - 2471
  • [25] A Generative Approach to Zero-Shot and Few-Shot Action Recognition
    Mishra, Ashish
    Verma, Vinay Kumar
    Reddy, M. Shiva Krishna
    Arulkumar, S.
    Rai, Piyush
    Mittal, Anurag
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 372 - 380
  • [26] Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition
    Liu, Huabin
    Lv, Weixian
    See, John
    Lin, Weiyao
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6230 - 6240
  • [27] Two-Stream Temporal Feature Aggregation Based on Clustering for Few-Shot Action Recognition
    Deng, Long
    Li, Ao
    Zhou, Bingxin
    Ge, Yongxin
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2435 - 2439
  • [28] Joint image-instance spatial-temporal attention for few-shot action recognition
    Qian, Zefeng
    Zhang, Chongyang
    Huang, Yifei
    Wang, Gang
    Ying, Jiangyong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 254
  • [29] Active Exploration of Multimodal Complementarity for Few-Shot Action Recognition
    Wanyan, Yuyang
    Yang, Xiaoshan
    Chen, Chaofan
    Xu, Changsheng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6492 - 6502
  • [30] VISUAL TEMPO CONTRASTIVE LEARNING FOR FEW-SHOT ACTION RECOGNITION
    Wang, Guangge
    Ye, Weirong
    Wang, Xiao
    Jin, Rongrong
    Wang, Hanzi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1096 - 1100