Focalized contrastive view-invariant learning for self-supervised skeleton-based action recognition

被引:13
|
作者
Men, Qianhui [1 ,2 ]
Ho, Edmond S. L. [3 ]
Shum, Hubert P. H. [4 ]
Leung, Howard [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Univ Oxford, Dept Engn Sci, Oxford OX1 3PJ, England
[3] Univ Glasgow, Sch Comp Sci, Glasgow G12 8RZ, Scotland
[4] Univ Durham, Dept Comp Sci, Durham DH1 3LE, England
关键词
Self -supervised learning; Skeleton -based action recognition; Contrastive learning;
D O I
10.1016/j.neucom.2023.03.070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning view-invariant representation is a key to improving feature discrimination power for skeleton -based action recognition. Existing approaches cannot effectively remove the impact of viewpoint due to the implicit view-dependent representations. In this work, we propose a self-supervised framework called Focalized Contrastive View-invariant Learning (FoCoViL), which significantly suppresses the view-specific information on the representation space where the viewpoints are coarsely aligned. By maximizing mutual information with an effective contrastive loss between multi-view sample pairs, FoCoViL associates actions with common view-invariant properties and simultaneously separates the dissimilar ones. We further propose an adaptive focalization method based on pairwise similarity to enhance contrastive learning for a clearer cluster boundary in the learned space. Different from many existing self-supervised representation learning work that rely heavily on supervised classifiers, FoCoViL performs well on both unsupervised and supervised classifiers with superior recognition perfor-mance. Extensive experiments also show that the proposed contrastive-based focalization generates a more discriminative latent representation.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:198 / 209
页数:12
相关论文
共 50 条
  • [41] Self-Supervised Contrastive Learning for Radar-Based Human Activity Recognition
    Rahman, Mohammad Mahbubur
    Gurbuz, Sevgi Zubeyde
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [42] Image classification framework based on contrastive self-supervised learning
    Zhao H.-W.
    Zhang J.-R.
    Zhu J.-P.
    Li H.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (08): : 1850 - 1856
  • [43] Temporal Contrastive Learning for Sensor-Based Human Activity Recognition: A Self-Supervised Approach
    Chen, Xiaobing
    Zhou, Xiangwei
    Sun, Mingxuan
    Wang, Hao
    IEEE SENSORS JOURNAL, 2025, 25 (01) : 1839 - 1850
  • [44] Micro-expression recognition using dual-view self-supervised contrastive learning with intensity perception
    Li, Jingting
    Zhou, Haoliang
    Qian, Yu
    Dong, Zizhao
    Wang, Su-Jing
    NEUROCOMPUTING, 2025, 619
  • [45] Collaboratively Self-Supervised Video Representation Learning for Action Recognition
    Zhang, Jie
    Wan, Zhifan
    Hu, Lanqing
    Lin, Stephen
    Wu, Shuzhe
    Shan, Shiguang
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 1895 - 1907
  • [46] Self-Supervised Action Representation Learning Based on Asymmetric Skeleton Data Augmentation
    Zhou, Hualing
    Li, Xi
    Xu, Dahong
    Liu, Hong
    Guo, Jianping
    Zhang, Yihan
    SENSORS, 2022, 22 (22)
  • [47] JointContrast: Skeleton-Based Interaction Recognition with New Representation and Contrastive Learning
    Zhang, Ji
    Jia, Xiangze
    Wang, Zhen
    Luo, Yonglong
    Chen, Fulong
    Yang, Gaoming
    Zhao, Lihui
    ALGORITHMS, 2023, 16 (04)
  • [48] Sleep Stage Classification Via Multi-View Based Self-Supervised Contrastive Learning of EEG
    Zhao, Chen
    Wu, Wei
    Zhang, Haoyi
    Zhang, Ruiyan
    Zheng, Xinyue
    Kong, Xiangzeng
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (12) : 7068 - 7077
  • [49] MS2L: Multi-Task Self-Supervised Learning for Skeleton Based Action Recognition
    Lin, Lilang
    Song, Sijie
    Yang, Wenhan
    Liu, Jiaying
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2490 - 2498
  • [50] Malicious Domain Detection Based on Self-supervised HGNNs with Contrastive Learning
    Li, Zhiping
    Yuan, Fangfang
    Cao, Cong
    Su, Majing
    Lu, Yuhai
    Liu, Yanbing
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 62 - 73