Skeleton-based Action Recognition via Adaptive Cross-Form Learning

被引：14

作者：

Wang, Xuanhan ^{[1
]}

Dai, Yan ^{[1
]}

Gao, Lianli ^{[2
]}

Song, Jingkuan ^{[2
,3
]}

机构：

[1] Univ Elect Sci & Technol China, Ctr Future Media, Chengdu, Peoples R China

[2] Univ Elect Sci & Technol China, Shenzhen Inst Adv Study, Shenzhen, Peoples R China

[3] Peng Cheng Lab, Shenzhen, Peoples R China

来源：

PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年

基金：

中国国家自然科学基金;

关键词：

skeleton-based action recognition; adaptive cross-form learning;

D O I：

10.1145/3503161.3547811

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Skeleton-based action recognition aims to project skeleton sequences to action categories, where skeleton sequences are derived from multiple forms of pre-detected points. Compared with earlier methods that focus on exploring single-form skeletons via Graph Convolutional Networks (GCNs), existing methods tend to improve GCNs by leveraging multi-form skeletons due to their complementary cues. However, these methods (either adapting structure of GCNs or model ensemble) require the co-existence of all skeleton forms during both training and inference stages, while a typical situation in real life is the existence of only partial forms for inference. To tackle this, we present Adaptive Cross-Form Learning (ACFL), which empowers well-designed GCNs to generate complementary representation from single-form skeletons without changing model capacity. Specifically, each GCN model in ACFL not only learns action representation from the single-form skeletons, but also adaptively mimics useful representations derived from other forms of skeletons. In this way, each GCN can learn how to strengthen what has been learned, thus exploiting model potential and facilitating action recognition as well. Extensive experiments conducted on three challenging benchmarks, i.e., NTU-RGB+D 120, NTU-RGB+D 60 and UAV-Human, demonstrate the effectiveness and generalizability of our method. Specifically, the ACFL significantly improves various GCN models (i.e., CTR-GCN, MS-G3D, and Shift-GCN), achieving a new record for skeleton-based action recognition.

引用

页码：1670 / 1678

页数：9

共 50 条

[1] Cross-Scale Spatiotemporal Refinement Learning for Skeleton-Based Action Recognition
Zhang, Yu
Sun, Zhonghua
Dai, Meng
Feng, Jinchao
Jia, Kebin
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 441 - 445
[2] Skeleton-based action recognition based on multidimensional adaptive convolutional network
Xia, Yu
Gao, Qingyuan
Wu, Weiguan
Cao, Yi
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
[3] EnsCLR: Unsupervised skeleton-based action recognition via ensemble contrastive learning of representation
Wang, Kun
Cao, Jiuxin
Cao, Biwei
Liu, Bo
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 247
[4] Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition
Lin, Lilang
Wu, Lehong
Zhang, Jiahang
Wang, Jiaying
COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 75 - 92
[5] Adaptive multi-level graph convolution with contrastive learning for skeleton-based action recognition
Geng, Pei
Li, Haowei
Wang, Fuyun
Lyu, Lei
SIGNAL PROCESSING, 2022, 201
[6] Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Tang, Yansong
Liu, Xingyu
Yu, Xumin
Zhang, Danyang
Lu, Jiwen
Zhou, Jie
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
[7] Adaptive Pitfall: Exploring the Effectiveness of Adaptation in Skeleton-Based Action Recognition
Miao, Qiguang
Xin, Wentian
Liu, Ruyi
Liu, Yi
Wu, Mengyao
Shi, Cheng
Pun, Chi-Man
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 56 - 71
[8] Cross-stream contrastive learning for self-supervised skeleton-based action recognition
Li, Ding
Tang, Yongqiang
Zhang, Zhizhong
Zhang, Wensheng
IMAGE AND VISION COMPUTING, 2023, 135
[9] AL-SAR: Active Learning for Skeleton-Based Action Recognition
Li, Jingyuan
Le, Trung
Shlizerman, Eli
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16966 - 16974
[10] Optimized Skeleton-based Action Recognition via Sparsified Graph Regression
Gao, Xiang
Hu, Wei
Tang, Jiaxiang
Liu, Jiaying
Guo, Zongming
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 601 - 610

← 1 2 3 4 5 →