Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition

被引：0

作者：

Lin, Lilang ^{[1
]}

Wu, Lehong ^{[1
]}

Zhang, Jiahang ^{[1
]}

Wang, Jiaying ^{[1
]}

机构：

[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China

来源：

COMPUTER VISION - ECCV 2024, PT XXVI | 2025年 / 15084卷

基金：

中国国家自然科学基金;

关键词：

Self-supervised learning; skeleton-based action recognition; contrastive learning;

D O I：

10.1007/978-3-031-73347-5_5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Generative models, as a powerful technique for generation, also gradually become a critical tool for recognition tasks. However, in skeleton-based action recognition, the features obtained from existing pre-trained generative methods contain redundant information unrelated to recognition, which contradicts the nature of the skeleton's spatially sparse and temporally consistent properties, leading to undesirable performance. To address this challenge, we make efforts to bridge the gap in theory and methodology and propose a novel skeleton-based idempotent generative model (IGM) for unsupervised representation learning. More specifically, we first theoretically demonstrate the equivalence between generative models and maximum entropy coding, which demonstrates a potential route that makes the features of generative models more compact by introducing contrastive learning. To this end, we introduce the idempotency constraint to form a stronger consistency regularization in the feature space, to push the features only to maintain the critical information of motion semantics for the recognition task. Our extensive experiments on benchmark datasets, NTU RGB+D and PKUMMD, demonstrate the effectiveness of our proposed method. On the NTU 60 xsub dataset, we observe a performance improvement from 84.6% to 86.2%. Furthermore, in zero-shot adaptation scenarios, our model demonstrates significant efficacy by achieving promising results in cases that were previously unrecognizable. Our project is available at https://github.com/LanglandsLin/IGM.

引用

页码：75 / 92

页数：18

共 50 条

[31] Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Wang, Xuanhan
Dai, Yan
Gao, Lianli
Song, Jingkuan
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1670 - 1678
[32] Temporal-masked skeleton-based action recognition with supervised contrastive learning
Zhifeng Zhao
Guodong Chen
Yuxiang Lin
Signal, Image and Video Processing, 2023, 17 : 2267 - 2275
[33] CdCLR: Clip- Driven Contrastive Learning for Skeleton-Based Action Recognition
Gao, Rong
Liu, Xin
Yang, Jingyu
Yue, Huanjing
2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
[34] Fully Attentional Network for Skeleton-Based Action Recognition
Liu, Caifeng
Zhou, Hongcheng
IEEE ACCESS, 2023, 11 : 20478 - 20485
[35] Hierarchical Aggregated Graph Neural Network for Skeleton-Based Action Recognition
Geng, Pei
Lu, Xuequan
Li, Wanqing
Lyu, Lei
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 11003 - 11017
[36] Hypergraph Neural Network for Skeleton-Based Action Recognition
Hao, Xiaoke
Li, Jie
Guo, Yingchun
Jiang, Tao
Yu, Ming
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2263 - 2275
[37] Skeleton-based Action Recognition for Industrial Packing Process
Chen, Zhenhui
Hu, Haiyang
Li, Zhongjin
Qi, Xingchen
Zhang, Haiping
Hu, Hua
Chang, Victor
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 36 - 45
[38] Memory Attention Networks for Skeleton-Based Action Recognition
Li, Ce
Xie, Chunyu
Zhang, Baochang
Han, Jungong
Zhen, Xiantong
Chen, Jie
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (09) : 4800 - 4814
[39] Multi-Granularity Anchor-Contrastive Representation Learning for Semi-Supervised Skeleton-Based Action Recognition
Shu, Xiangbo
Xu, Binqian
Zhang, Liyan
Tang, Jinhui
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7559 - 7576
[40] ACL-SAR: model agnostic adversarial contrastive learning for robust skeleton-based action recognition
Zhu, Jiaxuan
Shao, Ming
Sun, Libo
Xia, Siyu
VISUAL COMPUTER, 2025, 41 (04) : 2495 - 2510

← 1 2 3 4 5 →