Skeleton-based Action Recognition via Adaptive Cross-Form Learning

被引:14
|
作者
Wang, Xuanhan [1 ]
Dai, Yan [1 ]
Gao, Lianli [2 ]
Song, Jingkuan [2 ,3 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Future Media, Chengdu, Peoples R China
[2] Univ Elect Sci & Technol China, Shenzhen Inst Adv Study, Shenzhen, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年
基金
中国国家自然科学基金;
关键词
skeleton-based action recognition; adaptive cross-form learning;
D O I
10.1145/3503161.3547811
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Skeleton-based action recognition aims to project skeleton sequences to action categories, where skeleton sequences are derived from multiple forms of pre-detected points. Compared with earlier methods that focus on exploring single-form skeletons via Graph Convolutional Networks (GCNs), existing methods tend to improve GCNs by leveraging multi-form skeletons due to their complementary cues. However, these methods (either adapting structure of GCNs or model ensemble) require the co-existence of all skeleton forms during both training and inference stages, while a typical situation in real life is the existence of only partial forms for inference. To tackle this, we present Adaptive Cross-Form Learning (ACFL), which empowers well-designed GCNs to generate complementary representation from single-form skeletons without changing model capacity. Specifically, each GCN model in ACFL not only learns action representation from the single-form skeletons, but also adaptively mimics useful representations derived from other forms of skeletons. In this way, each GCN can learn how to strengthen what has been learned, thus exploiting model potential and facilitating action recognition as well. Extensive experiments conducted on three challenging benchmarks, i.e., NTU-RGB+D 120, NTU-RGB+D 60 and UAV-Human, demonstrate the effectiveness and generalizability of our method. Specifically, the ACFL significantly improves various GCN models (i.e., CTR-GCN, MS-G3D, and Shift-GCN), achieving a new record for skeleton-based action recognition.
引用
收藏
页码:1670 / 1678
页数:9
相关论文
共 50 条
  • [1] Cross-Scale Spatiotemporal Refinement Learning for Skeleton-Based Action Recognition
    Zhang, Yu
    Sun, Zhonghua
    Dai, Meng
    Feng, Jinchao
    Jia, Kebin
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 441 - 445
  • [2] Skeleton-based action recognition based on multidimensional adaptive convolutional network
    Xia, Yu
    Gao, Qingyuan
    Wu, Weiguan
    Cao, Yi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [3] EnsCLR: Unsupervised skeleton-based action recognition via ensemble contrastive learning of representation
    Wang, Kun
    Cao, Jiuxin
    Cao, Biwei
    Liu, Bo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 247
  • [4] Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition
    Lin, Lilang
    Wu, Lehong
    Zhang, Jiahang
    Wang, Jiaying
    COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 75 - 92
  • [5] Adaptive multi-level graph convolution with contrastive learning for skeleton-based action recognition
    Geng, Pei
    Li, Haowei
    Wang, Fuyun
    Lyu, Lei
    SIGNAL PROCESSING, 2022, 201
  • [6] Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
    Tang, Yansong
    Liu, Xingyu
    Yu, Xumin
    Zhang, Danyang
    Lu, Jiwen
    Zhou, Jie
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
  • [7] Adaptive Pitfall: Exploring the Effectiveness of Adaptation in Skeleton-Based Action Recognition
    Miao, Qiguang
    Xin, Wentian
    Liu, Ruyi
    Liu, Yi
    Wu, Mengyao
    Shi, Cheng
    Pun, Chi-Man
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 56 - 71
  • [8] Cross-stream contrastive learning for self-supervised skeleton-based action recognition
    Li, Ding
    Tang, Yongqiang
    Zhang, Zhizhong
    Zhang, Wensheng
    IMAGE AND VISION COMPUTING, 2023, 135
  • [9] AL-SAR: Active Learning for Skeleton-Based Action Recognition
    Li, Jingyuan
    Le, Trung
    Shlizerman, Eli
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16966 - 16974
  • [10] Optimized Skeleton-based Action Recognition via Sparsified Graph Regression
    Gao, Xiang
    Hu, Wei
    Tang, Jiaxiang
    Liu, Jiaying
    Guo, Zongming
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 601 - 610