UPL-Net: Uncertainty-aware prompt learning network for semi-supervised action recognition

被引:0
作者
Yang, Shu [1 ]
Li, Ya-Li [1 ]
Wang, Shengjin [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
关键词
Semi-supervised learning; Prompt learning; Vision-language pre-training; Action recognition; Uncertainty estimation;
D O I
10.1016/j.neucom.2024.129126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on understanding human behavior in videos by reframing the traditional video classification task as a transfer learning problem centered on visual concepts. Unlike existing action recognition approaches that rely solely on single-modal representations and video classifiers, our method leverages an uncertainty- aware prompt learning network (UPL-Net). This network is designed to extract spatiotemporal features that are pertinent to action-related concepts in videos while ensuring that the visual concepts derived from images are preserved. Furthermore, we introduce an uncertainty-guided semi-supervised learning strategy that harnesses unlabeled videos to enhance the model's generalizability. Extensive experiments conducted on benchmark datasets, namely UCF and HMDB, demonstrate the superiority of our approach over state-of-the-art semi- supervised action recognition methods. Notably, under a 1% labeling rate on the UCF dataset, our method achieves a significant improvement of 12.8%, underscoring its effectiveness in leveraging limited labeled data and abundant unlabeled videos for improved performance.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] 3D Features for human action recognition with semi-supervised learning
    Sahoo, Suraj Prakash
    Srinivasu, Ulli
    Ari, Samit
    IET IMAGE PROCESSING, 2019, 13 (06) : 983 - 990
  • [42] Human Action Recognition Based on Multi-view Semi-supervised Learning
    Tang C.
    Wang W.
    Wang X.
    Zhang C.
    Zou L.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (04): : 376 - 384
  • [43] Learning sample-aware threshold for semi-supervised learning
    Wei, Qi
    Feng, Lei
    Sun, Haoliang
    Wang, Ren
    He, Rundong
    Yin, Yilong
    MACHINE LEARNING, 2024, 113 (08) : 5423 - 5445
  • [44] Semi-Supervised Multiple Feature Analysis for Action Recognition
    Wang, Sen
    Ma, Zhigang
    Yang, Yi
    Li, Xue
    Pang, Chaoyi
    Hauptmann, Alexander G.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (02) : 289 - 298
  • [45] A Neural Network for Semi-supervised Learning on Manifolds
    Genkin, Alexander
    Sengupta, Anirvan M.
    Chklovskii, Dmitri
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 375 - 386
  • [46] EMPC: Efficient multi-view parallel co-learning for semi-supervised action recognition
    Tong, Anyang
    Tang, Chao
    Wang, Wenjian
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [47] CONTRASTIVE SIAMESE NETWORK FOR SEMI-SUPERVISED SPEECH RECOGNITION
    Khorram, Soheil
    Kim, Jaeyoung
    Tripathi, Anshuman
    Lu, Han
    Zhang, Qian
    Sak, Hasim
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7207 - 7211
  • [48] Regularized extreme learning machine for multi-view semi-supervised action recognition
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    NEUROCOMPUTING, 2014, 145 : 250 - 262
  • [49] Face and Gait Recognition Based on Semi-supervised Learning
    Yu, Qiuhong
    Yin, Yilong
    Yang, Gongping
    Ning, Yanbing
    Li, Yanan
    PATTERN RECOGNITION, 2012, 321 : 284 - 291
  • [50] Named entity recognition: a semi-supervised learning approach
    Sintayehu H.
    Lehal G.S.
    International Journal of Information Technology, 2021, 13 (4) : 1659 - 1665