UPL-Net: Uncertainty-aware prompt learning network for semi-supervised action recognition

被引:0
作者
Yang, Shu [1 ]
Li, Ya-Li [1 ]
Wang, Shengjin [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
关键词
Semi-supervised learning; Prompt learning; Vision-language pre-training; Action recognition; Uncertainty estimation;
D O I
10.1016/j.neucom.2024.129126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on understanding human behavior in videos by reframing the traditional video classification task as a transfer learning problem centered on visual concepts. Unlike existing action recognition approaches that rely solely on single-modal representations and video classifiers, our method leverages an uncertainty- aware prompt learning network (UPL-Net). This network is designed to extract spatiotemporal features that are pertinent to action-related concepts in videos while ensuring that the visual concepts derived from images are preserved. Furthermore, we introduce an uncertainty-guided semi-supervised learning strategy that harnesses unlabeled videos to enhance the model's generalizability. Extensive experiments conducted on benchmark datasets, namely UCF and HMDB, demonstrate the superiority of our approach over state-of-the-art semi- supervised action recognition methods. Notably, under a 1% labeling rate on the UCF dataset, our method achieves a significant improvement of 12.8%, underscoring its effectiveness in leveraging limited labeled data and abundant unlabeled videos for improved performance.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Neighborhood-Aware Attention Network for Semi-supervised Face Recognition
    Zhang, Qi
    Lei, Zhen
    Li, Stan Z.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [22] DANet: Semi-supervised differentiated auxiliaries guided network for video action recognition
    Gao, Guangyu
    Liu, Ziming
    Zhang, Guangjun
    Li, Jinyang
    Qin, A. K.
    NEURAL NETWORKS, 2023, 158 : 121 - 131
  • [23] Boosted multi-class semi-supervised learning for human action recognition
    Zhang, Tianzhu
    Liu, Si
    Xu, Changsheng
    Lu, Hanqing
    PATTERN RECOGNITION, 2011, 44 (10-11) : 2334 - 2342
  • [24] Audio-Visual Contrastive and Consistency Learning for Semi-Supervised Action Recognition
    Assefa, Maregu
    Jiang, Wei
    Zhan, Jinyu
    Gedamu, Kumie
    Yilma, Getinet
    Ayalew, Melese
    Adhikari, Deepak
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3491 - 3504
  • [25] Semi-Supervised Action Recognition From Temporal Augmentation Using Curriculum Learning
    Tong, Anyang
    Tang, Chao
    Wang, Wenjian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1305 - 1319
  • [26] Neighbor-Guided Consistent and Contrastive Learning for Semi-Supervised Action Recognition
    Wu, Jianlong
    Sun, Wei
    Gan, Tian
    Ding, Ning
    Jiang, Feijun
    Shen, Jialie
    Nie, Liqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2215 - 2227
  • [27] Uncertainty-aware self-training with adversarial data augmentation for semi-supervised medical image segmentation
    Cao, Juan
    Chen, Jiaran
    Liu, Jinjia
    Gu, Yuanyuan
    Chen, Lili
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105
  • [28] Uncertainty Minimization for Personalized Federated Semi-Supervised Learning
    Shi, Yanhang
    Chen, Siguang
    Zhang, Haijun
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (02): : 1060 - 1073
  • [29] Dual Attention Based Uncertainty-aware Mean Teacher Model for Semi-supervised Cardiac Image Segmentation
    Xu, An
    Wang, Shaoyu
    Fan, Jingyi
    Shi, Xiujin
    Chen, Qiang
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 82 - 86
  • [30] Uncertainty-Aware Semi-Supervised Method Using Large Unlabeled and Limited Labeled COVID-19 Data
    Alizadehsani, Roohallah
    Sharifrazi, Danial
    Izadi, Navid Hoseini
    Joloudari, Javad Hassannataj
    Shoeibi, Afshin
    Gorriz, Juan M.
    Hussain, Sadiq
    Arco, Juan E.
    Sani, Zahra Alizadeh
    Khozeimeh, Fahime
    Khosravi, Abbas
    Nahavandi, Saeid
    Islam, Sheikh Mohammed Shariful
    Acharya, U. Rajendra
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (03)