Focalized contrastive view-invariant learning for self-supervised skeleton-based action recognition

被引:13
|
作者
Men, Qianhui [1 ,2 ]
Ho, Edmond S. L. [3 ]
Shum, Hubert P. H. [4 ]
Leung, Howard [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Univ Oxford, Dept Engn Sci, Oxford OX1 3PJ, England
[3] Univ Glasgow, Sch Comp Sci, Glasgow G12 8RZ, Scotland
[4] Univ Durham, Dept Comp Sci, Durham DH1 3LE, England
关键词
Self -supervised learning; Skeleton -based action recognition; Contrastive learning;
D O I
10.1016/j.neucom.2023.03.070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning view-invariant representation is a key to improving feature discrimination power for skeleton -based action recognition. Existing approaches cannot effectively remove the impact of viewpoint due to the implicit view-dependent representations. In this work, we propose a self-supervised framework called Focalized Contrastive View-invariant Learning (FoCoViL), which significantly suppresses the view-specific information on the representation space where the viewpoints are coarsely aligned. By maximizing mutual information with an effective contrastive loss between multi-view sample pairs, FoCoViL associates actions with common view-invariant properties and simultaneously separates the dissimilar ones. We further propose an adaptive focalization method based on pairwise similarity to enhance contrastive learning for a clearer cluster boundary in the learned space. Different from many existing self-supervised representation learning work that rely heavily on supervised classifiers, FoCoViL performs well on both unsupervised and supervised classifiers with superior recognition perfor-mance. Extensive experiments also show that the proposed contrastive-based focalization generates a more discriminative latent representation.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:198 / 209
页数:12
相关论文
共 50 条
  • [31] View-Invariant Skeleton Action Representation Learning via Motion Retargeting
    Yang, Di
    Wang, Yaohui
    Dantcheva, Antitza
    Garattoni, Lorenzo
    Francesca, Gianpiero
    Bremond, Francois
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2351 - 2366
  • [32] Self-Supervised Learning: Generative or Contrastive
    Liu, Xiao
    Zhang, Fanjin
    Hou, Zhenyu
    Mian, Li
    Wang, Zhaoyu
    Zhang, Jing
    Tang, Jie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 857 - 876
  • [33] A Survey on Contrastive Self-Supervised Learning
    Jaiswal, Ashish
    Babu, Ashwin Ramesh
    Zadeh, Mohammad Zaki
    Banerjee, Debapriya
    Makedon, Fillia
    TECHNOLOGIES, 2021, 9 (01)
  • [34] Reconstruction-driven contrastive learning for unsupervised skeleton-based human action recognition
    Liu, Xing
    Gao, Bo
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01)
  • [35] Spatiotemporal Decouple-and-Squeeze Contrastive Learning for Semisupervised Skeleton-Based Action Recognition
    Xu, Binqian
    Shu, Xiangbo
    Zhang, Jiachao
    Dai, Guangzhao
    Song, Yan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 11035 - 11048
  • [36] Self-supervised group meiosis contrastive learning for EEG-based emotion recognition
    Haoning Kan
    Jiale Yu
    Jiajin Huang
    Zihe Liu
    Heqian Wang
    Haiyan Zhou
    Applied Intelligence, 2023, 53 : 27207 - 27225
  • [37] Contrastive Self-Supervised Learning for Sensor-Based Human Activity Recognition: A Review
    Chen, Hui
    Gouin-Vallerand, Charles
    Bouchard, Kevin
    Gaboury, Sebastien
    Couture, Melanie
    Bier, Nathalie
    Giroux, Sylvain
    IEEE ACCESS, 2024, 12 : 152511 - 152531
  • [38] ACL-SAR: model agnostic adversarial contrastive learning for robust skeleton-based action recognition
    Zhu, Jiaxuan
    Shao, Ming
    Sun, Libo
    Xia, Siyu
    VISUAL COMPUTER, 2025, 41 (04) : 2495 - 2510
  • [39] Self-supervised group meiosis contrastive learning for EEG-based emotion recognition
    Kan, Haoning
    Yu, Jiale
    Huang, Jiajin
    Liu, Zihe
    Wang, Heqian
    Zhou, Haiyan
    APPLIED INTELLIGENCE, 2023, 53 (22) : 27207 - 27225
  • [40] Exploring Self-Supervised Multi-view Contrastive Learning for Speech Emotion Recognition with Limited Annotations
    Khaertdinov, Bulat
    Jeuris, Pedro
    Sousa, Annanda
    Hortal, Enrique
    INTERSPEECH 2024, 2024, : 4708 - 4712