Bayesian Contrastive Learning with Manifold Regularization for Self-Supervised Skeleton Based Action Recognition

被引:0
|
作者
Lin, Lilang [1 ]
Zhang, Jiahang [1 ]
Liu, Jiaying [1 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
skeleton based action recognition; contrastive learning; bayesian neural network; self-supervised learning;
D O I
10.1109/ISCAS46773.2023.10181797
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address skeleton-based action recognition under the self-supervised setting. We propose a novel framework Bayesian Contrastive Learning with Manifold Regularization (BCLR). In Bayesian contrastive learning, we employ Monte Carlo Dropout sampling on the adjacency matrix of the skeleton data to obtain positive/negative samples for model robustness. A novel entropy-based memory bank updating strategy is further proposed to take full advantage of hard negative samples for better separability. The feature manifold regularization, including projection-based data reconstruction and similarity-based feature decoupling, on the other hand, is designed to extract comprehensive information to avoid overfitting and increase feature diversity to prevent a collapse of the model. With Bayesian contrastive learning and feature manifold regularization, our model learns stronger and more discriminative features. Extensive experiments on NTU RGB+D and PKUMMD show that the proposed method achieves remarkable action recognition performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Modeling the Relative Visual Tempo for Self-supervised Skeleton-based Action Recognition
    Zhu, Yisheng
    Han, Hu
    Yu, Zhengtao
    Liu, Guangcan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13867 - 13876
  • [32] A puzzle questions form training for self-supervised skeleton-based action recognition
    Moutik, Oumaima
    Sekkat, Hiba
    Tchakoucht, Taha Ait
    El Kari, Badr
    Alaoui, Ahmed El Hilali
    IMAGE AND VISION COMPUTING, 2024, 148
  • [33] MS2L: Multi-Task Self-Supervised Learning for Skeleton Based Action Recognition
    Lin, Lilang
    Song, Sijie
    Yang, Wenhan
    Liu, Jiaying
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2490 - 2498
  • [34] Localized Linear Temporal Dynamics for Self-Supervised Skeleton Action Recognition
    Wang, Xinghan
    Mu, Yadong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10189 - 10199
  • [35] Self-supervised group meiosis contrastive learning for EEG-based emotion recognition
    Haoning Kan
    Jiale Yu
    Jiajin Huang
    Zihe Liu
    Heqian Wang
    Haiyan Zhou
    Applied Intelligence, 2023, 53 : 27207 - 27225
  • [36] Contrastive Self-Supervised Learning for Sensor-Based Human Activity Recognition: A Review
    Chen, Hui
    Gouin-Vallerand, Charles
    Bouchard, Kevin
    Gaboury, Sebastien
    Couture, Melanie
    Bier, Nathalie
    Giroux, Sylvain
    IEEE ACCESS, 2024, 12 : 152511 - 152531
  • [37] Self-supervised group meiosis contrastive learning for EEG-based emotion recognition
    Kan, Haoning
    Yu, Jiale
    Huang, Jiajin
    Liu, Zihe
    Wang, Heqian
    Zhou, Haiyan
    APPLIED INTELLIGENCE, 2023, 53 (22) : 27207 - 27225
  • [38] Self-Supervised Learning for Action Recognition by Video Denoising
    Thi Thu Trang Phung
    Thi Hong Thu Ma
    Van Truong Nguyen
    Duc Quang Vu
    2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 76 - 81
  • [39] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
    Bi, Shuai
    Hu, Zhengping
    Zhao, Mengyao
    Zhang, Hehao
    Di, Jirui
    Sun, Zhe
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (07) : 3775 - 3782
  • [40] Self-supervised pretext task collaborative multi-view contrastive learning for video action recognition
    Shuai Bi
    Zhengping Hu
    Mengyao Zhao
    Hehao Zhang
    Jirui Di
    Zhe Sun
    Signal, Image and Video Processing, 2023, 17 : 3775 - 3782