UPL-Net: Uncertainty-aware prompt learning network for semi-supervised action recognition

被引：0

作者：

Yang, Shu ^{[1
]}

Li, Ya-Li ^{[1
]}

Wang, Shengjin ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

来源：

NEUROCOMPUTING | 2025年 / 619卷

关键词：

Semi-supervised learning; Prompt learning; Vision-language pre-training; Action recognition; Uncertainty estimation;

D O I：

10.1016/j.neucom.2024.129126

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper focuses on understanding human behavior in videos by reframing the traditional video classification task as a transfer learning problem centered on visual concepts. Unlike existing action recognition approaches that rely solely on single-modal representations and video classifiers, our method leverages an uncertainty- aware prompt learning network (UPL-Net). This network is designed to extract spatiotemporal features that are pertinent to action-related concepts in videos while ensuring that the visual concepts derived from images are preserved. Furthermore, we introduce an uncertainty-guided semi-supervised learning strategy that harnesses unlabeled videos to enhance the model's generalizability. Extensive experiments conducted on benchmark datasets, namely UCF and HMDB, demonstrate the superiority of our approach over state-of-the-art semi- supervised action recognition methods. Notably, under a 1% labeling rate on the UCF dataset, our method achieves a significant improvement of 12.8%, underscoring its effectiveness in leveraging limited labeled data and abundant unlabeled videos for improved performance.

引用

页数：11

共 50 条

[41] 3D Features for human action recognition with semi-supervised learning
Sahoo, Suraj Prakash
Srinivasu, Ulli
Ari, Samit
IET IMAGE PROCESSING, 2019, 13 (06) : 983 - 990
[42] Human Action Recognition Based on Multi-view Semi-supervised Learning
Tang C.
Wang W.
Wang X.
Zhang C.
Zou L.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (04): : 376 - 384
[43] Learning sample-aware threshold for semi-supervised learning
Wei, Qi
Feng, Lei
Sun, Haoliang
Wang, Ren
He, Rundong
Yin, Yilong
MACHINE LEARNING, 2024, 113 (08) : 5423 - 5445
[44] Semi-Supervised Multiple Feature Analysis for Action Recognition
Wang, Sen
Ma, Zhigang
Yang, Yi
Li, Xue
Pang, Chaoyi
Hauptmann, Alexander G.
IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (02) : 289 - 298
[45] A Neural Network for Semi-supervised Learning on Manifolds
Genkin, Alexander
Sengupta, Anirvan M.
Chklovskii, Dmitri
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 375 - 386
[46] EMPC: Efficient multi-view parallel co-learning for semi-supervised action recognition
Tong, Anyang
Tang, Chao
Wang, Wenjian
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
[47] CONTRASTIVE SIAMESE NETWORK FOR SEMI-SUPERVISED SPEECH RECOGNITION
Khorram, Soheil
Kim, Jaeyoung
Tripathi, Anshuman
Lu, Han
Zhang, Qian
Sak, Hasim
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7207 - 7211
[48] Regularized extreme learning machine for multi-view semi-supervised action recognition
Iosifidis, Alexandros
Tefas, Anastasios
Pitas, Ioannis
NEUROCOMPUTING, 2014, 145 : 250 - 262
[49] Face and Gait Recognition Based on Semi-supervised Learning
Yu, Qiuhong
Yin, Yilong
Yang, Gongping
Ning, Yanbing
Li, Yanan
PATTERN RECOGNITION, 2012, 321 : 284 - 291
[50] Named entity recognition: a semi-supervised learning approach
Sintayehu H.
Lehal G.S.
International Journal of Information Technology, 2021, 13 (4) : 1659 - 1665

← 1 2 3 4 5 →