Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition

被引:0
作者
Lin, Lilang [1 ]
Wu, Lehong [1 ]
Zhang, Jiahang [1 ]
Wang, Jiaying [1 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
来源
COMPUTER VISION - ECCV 2024, PT XXVI | 2025年 / 15084卷
基金
中国国家自然科学基金;
关键词
Self-supervised learning; skeleton-based action recognition; contrastive learning;
D O I
10.1007/978-3-031-73347-5_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative models, as a powerful technique for generation, also gradually become a critical tool for recognition tasks. However, in skeleton-based action recognition, the features obtained from existing pre-trained generative methods contain redundant information unrelated to recognition, which contradicts the nature of the skeleton's spatially sparse and temporally consistent properties, leading to undesirable performance. To address this challenge, we make efforts to bridge the gap in theory and methodology and propose a novel skeleton-based idempotent generative model (IGM) for unsupervised representation learning. More specifically, we first theoretically demonstrate the equivalence between generative models and maximum entropy coding, which demonstrates a potential route that makes the features of generative models more compact by introducing contrastive learning. To this end, we introduce the idempotency constraint to form a stronger consistency regularization in the feature space, to push the features only to maintain the critical information of motion semantics for the recognition task. Our extensive experiments on benchmark datasets, NTU RGB+D and PKUMMD, demonstrate the effectiveness of our proposed method. On the NTU 60 xsub dataset, we observe a performance improvement from 84.6% to 86.2%. Furthermore, in zero-shot adaptation scenarios, our model demonstrates significant efficacy by achieving promising results in cases that were previously unrecognizable. Our project is available at https://github.com/LanglandsLin/IGM.
引用
收藏
页码:75 / 92
页数:18
相关论文
共 50 条
  • [31] Skeleton-based Action Recognition via Adaptive Cross-Form Learning
    Wang, Xuanhan
    Dai, Yan
    Gao, Lianli
    Song, Jingkuan
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1670 - 1678
  • [32] Temporal-masked skeleton-based action recognition with supervised contrastive learning
    Zhifeng Zhao
    Guodong Chen
    Yuxiang Lin
    Signal, Image and Video Processing, 2023, 17 : 2267 - 2275
  • [33] CdCLR: Clip- Driven Contrastive Learning for Skeleton-Based Action Recognition
    Gao, Rong
    Liu, Xin
    Yang, Jingyu
    Yue, Huanjing
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [34] Fully Attentional Network for Skeleton-Based Action Recognition
    Liu, Caifeng
    Zhou, Hongcheng
    IEEE ACCESS, 2023, 11 : 20478 - 20485
  • [35] Hierarchical Aggregated Graph Neural Network for Skeleton-Based Action Recognition
    Geng, Pei
    Lu, Xuequan
    Li, Wanqing
    Lyu, Lei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 11003 - 11017
  • [36] Hypergraph Neural Network for Skeleton-Based Action Recognition
    Hao, Xiaoke
    Li, Jie
    Guo, Yingchun
    Jiang, Tao
    Yu, Ming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2263 - 2275
  • [37] Skeleton-based Action Recognition for Industrial Packing Process
    Chen, Zhenhui
    Hu, Haiyang
    Li, Zhongjin
    Qi, Xingchen
    Zhang, Haiping
    Hu, Hua
    Chang, Victor
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 36 - 45
  • [38] Memory Attention Networks for Skeleton-Based Action Recognition
    Li, Ce
    Xie, Chunyu
    Zhang, Baochang
    Han, Jungong
    Zhen, Xiantong
    Chen, Jie
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (09) : 4800 - 4814
  • [39] Multi-Granularity Anchor-Contrastive Representation Learning for Semi-Supervised Skeleton-Based Action Recognition
    Shu, Xiangbo
    Xu, Binqian
    Zhang, Liyan
    Tang, Jinhui
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7559 - 7576
  • [40] ACL-SAR: model agnostic adversarial contrastive learning for robust skeleton-based action recognition
    Zhu, Jiaxuan
    Shao, Ming
    Sun, Libo
    Xia, Siyu
    VISUAL COMPUTER, 2025, 41 (04) : 2495 - 2510