Focalized contrastive view-invariant learning for self-supervised skeleton-based action recognition

被引:13
|
作者
Men, Qianhui [1 ,2 ]
Ho, Edmond S. L. [3 ]
Shum, Hubert P. H. [4 ]
Leung, Howard [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Univ Oxford, Dept Engn Sci, Oxford OX1 3PJ, England
[3] Univ Glasgow, Sch Comp Sci, Glasgow G12 8RZ, Scotland
[4] Univ Durham, Dept Comp Sci, Durham DH1 3LE, England
关键词
Self -supervised learning; Skeleton -based action recognition; Contrastive learning;
D O I
10.1016/j.neucom.2023.03.070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning view-invariant representation is a key to improving feature discrimination power for skeleton -based action recognition. Existing approaches cannot effectively remove the impact of viewpoint due to the implicit view-dependent representations. In this work, we propose a self-supervised framework called Focalized Contrastive View-invariant Learning (FoCoViL), which significantly suppresses the view-specific information on the representation space where the viewpoints are coarsely aligned. By maximizing mutual information with an effective contrastive loss between multi-view sample pairs, FoCoViL associates actions with common view-invariant properties and simultaneously separates the dissimilar ones. We further propose an adaptive focalization method based on pairwise similarity to enhance contrastive learning for a clearer cluster boundary in the learned space. Different from many existing self-supervised representation learning work that rely heavily on supervised classifiers, FoCoViL performs well on both unsupervised and supervised classifiers with superior recognition perfor-mance. Extensive experiments also show that the proposed contrastive-based focalization generates a more discriminative latent representation.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:198 / 209
页数:12
相关论文
共 50 条
  • [21] Learning shape and motion representations for view invariant skeleton-based action recognition
    Li, Yanshan
    Xia, Rongjie
    Liu, Xing
    PATTERN RECOGNITION, 2020, 103 (103)
  • [22] JointContrast: Skeleton-Based Mutual Action Recognition with Contrastive Learning
    Jia, Xiangze
    Zhang, Ji
    Wang, Zhen
    Luo, Yonglong
    Chen, Fulong
    Xiao, Jing
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 478 - 489
  • [23] Masked cosine similarity prediction for self-supervised skeleton-based action representation learning
    Ziliang Ren
    Ronggui Liu
    Yong Qin
    Xiangyang Gao
    Qieshi Zhang
    Pattern Analysis and Applications, 2025, 28 (2)
  • [24] Part Aware Contrastive Learning for Self-Supervised Action Recognition
    Hua, Yilei
    Wu, Wenhan
    Zheng, Ce
    Lu, Aidong
    Liu, Mengyuan
    Chen, Chen
    Wu, Shiqian
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 855 - 863
  • [25] DIDA: Dynamic Individual-to-integrateD Augmentation for Self-supervised Skeleton-Based Action Recognition
    Hu, Haobo
    Li, Jianan
    Fan, Hongbin
    Zhao, Zhifu
    Zhou, Yangtao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 496 - 510
  • [26] DMMG: Dual Min-Max Games for Self-Supervised Skeleton-Based Action Recognition
    Guan, Shannan
    Yu, Xin
    Huang, Wei
    Fang, Gengfa
    Lu, Haiyan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 395 - 407
  • [27] Frequency Decoupled Masked Auto-Encoder for Self-Supervised Skeleton-Based Action Recognition
    Liu, Ye
    Shi, Tianhao
    Zhai, Mingliang
    Liu, Jun
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 546 - 550
  • [28] Contrastive Learning of View-invariant Representations for Facial Expressions Recognition
    Roy, Shuvendu
    Etemad, Ali
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
  • [29] A Cross View Learning Approach for Skeleton-Based Action Recognition
    Zheng, Hui
    Zhang, Xinming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3061 - 3072
  • [30] SELF-SUPERVISED CONTRASTIVE LEARNING FOR AUDIO-VISUAL ACTION RECOGNITION
    Liu, Yang
    Tan, Ying
    Lan, Haoyuan
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1000 - 1004