Multi-granularity hierarchical contrastive learning between foreground and background for semi-supervised video action detection

被引:0
|
作者
Zhang, Qiming [1 ]
Hu, Zhengping [1 ,2 ,3 ]
Wang, Yulu [1 ]
Zhang, Hehao [1 ]
Di, Jirui [1 ]
机构
[1] Yanshan Univ, Sch Informat & Engn, Qinhuangdao 066004, Hebei, Peoples R China
[2] Yanshan Univ, Qinhuangdao 066004, Hebei, Peoples R China
[3] Hebei Key Lab Informat Transmiss & Signal Proc, Qinhuangdao 066004, Hebei, Peoples R China
关键词
Semi-supervised learning; Video action detection; Multi-granularity; Contrastive learning; NETWORK;
D O I
10.1016/j.knosys.2024.112853
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised video action detection has received increasing attention due to its lower data annotation cost and performance comparable to fully supervised methods. However, due to the presence of dynamic background regions in the video, existing methods may encounter biases when interpreting the foreground and background of the video. This bias causes the model to mistakenly identify dynamic background areas as action foregrounds or to overlook background information, leading to misjudgment of the foreground. In response to this issue, this paper proposes a multi-granularity hierarchical contrastive learning between foreground and background for semi-supervised video action detection method termed as Multi-FB. Specifically, this paper proposes a multi- granularity encoding network based on foreground and background. This network uses a unified encoder to represent and learn foreground and background regions in videos at different granularities, thereby improving the model's understanding of action foreground and related background. Secondly, this paper proposes an Intramodel multi-granularity hierarchical contrastive strategy, which aims to minimize the representation discrepancies of foreground-to-foreground and background-to-background at different granularities within the same video, while maximizing the representation differences between the foreground and background at various granularities within the video. Furthermore, this paper proposes a Cross-model multi-granularity hierarchical contrastive strategy, which aims to enhance the consistency of the model's representations of foregrounds and backgrounds between the original data and the augmented data. A large number of experimental results on JHMDB-21 and UCF101-24 show that the proposed method can significantly distinguish feature representations between different categories, achieving performance comparable to state-of-the-art methods under semi- supervised conditions.
引用
收藏
页数:16
相关论文
共 50 条
  • [11] Audio-Visual Contrastive and Consistency Learning for Semi-Supervised Action Recognition
    Assefa, Maregu
    Jiang, Wei
    Zhan, Jinyu
    Gedamu, Kumie
    Yilma, Getinet
    Ayalew, Melese
    Adhikari, Deepak
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3491 - 3504
  • [12] Neighbor-Guided Consistent and Contrastive Learning for Semi-Supervised Action Recognition
    Wu, Jianlong
    Sun, Wei
    Gan, Tian
    Ding, Ning
    Jiang, Feijun
    Shen, Jialie
    Nie, Liqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2215 - 2227
  • [13] Hierarchical Differential Amplifier Contrastive Learning for Semi-supervised Extractive Summarization
    Zhou, Xiaobing (zhouxb@ynu.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [14] Multi-Augmentation-Based Contrastive Learning for Semi-Supervised Learning
    Wang, Jie
    Yang, Jie
    He, Jiafan
    Peng, Dongliang
    ALGORITHMS, 2024, 17 (03)
  • [15] Multi granularity based label propagation with active learning for semi-supervised classification
    Hu, Shengdan
    Miao, Duoqian
    Pedrycz, Witold
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
  • [16] SSCL: Semi-supervised Contrastive Learning for Industrial Anomaly Detection
    Cai, Wei
    Gao, Jiechao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IV, 2024, 14428 : 100 - 112
  • [17] Semi-supervised multi-label cardiovascular diseases detection via contrastive learning and label inference
    Wang, Ning
    Wang, Haiyan
    Feng, Panpan
    Li, Shihua
    Tan, Jian
    Wang, Zongmin
    Zhou, Bing
    KNOWLEDGE-BASED SYSTEMS, 2025, 314
  • [18] Semi-Supervised Multi-Granularity CNNs for Text Classification: An Application in Human-Car Interaction
    Zhao, Fen
    Li, Yinguo
    Bai, Ling
    Tian, Zhen
    Wang, Xinheng
    IEEE ACCESS, 2020, 8 : 68000 - 68012
  • [19] SMGCL: Semi-supervised Multi-view Graph Contrastive Learning
    Zhou, Hui
    Gong, Maoguo
    Wang, Shanfeng
    Gao, Yuan
    Zhao, Zhongying
    KNOWLEDGE-BASED SYSTEMS, 2023, 260
  • [20] Multi-Granularity approach for Enhancing the Performance of Network Intrusion Detection with Supervised Learning
    Saraswathy, V. R.
    Kasthuri, N.
    Ramyadevi, I. P.
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,