Temporal RPN Learning for Weakly-Supervised Temporal Action Localization

被引：0

作者：

Huang, Jing ^{[1
]}

Kong, Ming ^{[2
,3
]}

Chen, Luyuan ^{[4
]}

Liang, Tian ^{[1
]}

Zhu, Qiang ^{[2
]}

机构：

[1] Zhejiang Univ, Hangzhou 310058, Peoples R China

[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310058, Peoples R China

[3] Hikvis Res Inst, Hangzhou 310051, Peoples R China

[4] Beijing Informat Sci & Technol Univ, Beijing 100101, Peoples R China

来源：

ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222 | 2023年 / 222卷

关键词：

Weakly-Supervised Learning; Action Localization; Temporal Region Proposal;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly-Supervised Temporal Action Localization (WSTAL) aims to train an action instance localization model from untrimmed videos with only video-level labels, similar to the Object Detection (OD) task. Existing Top-k MIL-based WSTAL methods cannot flexibly define the learning space, which limits the model's learning efficiency and performance. Faster R-CNN is a classic two-stage object detection architecture with an efficient Region Proposal Network. This paper successfully migrates the Faster R-CNN liked two-stage architecture to the WSTAL task: first to build a T-RPN and integrate it with the traditional WSTAL framework; and then to propose a pseudo label generation mechanism to enable the T-RPN learning without temporal annotations. Our new framework has achieved breakthrough performances on THUMOS-14 and ActivityNet-v1.2 datasets, and comprehensive ablation experiments have verified the effectiveness of the innovations. Code will be available at: https://github.com/ZJUHJ/TRPN.

引用

页数：16

共 50 条

[1] Self-supervised temporal adaptive learning for weakly-supervised temporal action localization
Sheng, Jinrong
Yu, Jiaruo
Li, Ziqiang
Li, Ao
Ge, Yongxin
INFORMATION SCIENCES, 2025, 705
[2] Semantic and Temporal Contextual Correlation Learning for Weakly-Supervised Temporal Action Localization
Fu, Jie
Gao, Junyu
Xu, Changsheng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12427 - 12443
[3] Vectorized Evidential Learning for Weakly-Supervised Temporal Action Localization
Gao, Junyu
Chen, Mengyuan
Xu, Changsheng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15949 - 15963
[4] Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning
Du, Jia-Run
Feng, Jia-Chang
Lin, Kun-Yu
Hong, Fa-Ting
Qi, Zhongang
Shan, Ying
Hu, Jian-Fang
Zheng, Wei-Shi
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 938 - 952
[5] Weakly-supervised temporal action localization: a survey
Baraka, AbdulRahman
Noor, Mohd Halim Mohd
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (11): : 8479 - 8499
[6] Weakly-supervised temporal action localization: a survey
AbdulRahman Baraka
Mohd Halim Mohd Noor
Neural Computing and Applications, 2022, 34 : 8479 - 8499
[7] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
Gao, Junyu
Chen, Mengyuan
Xu, Changsheng
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19967 - 19977
[8] Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization
Lee, Pilhyeon
Byun, Hyeran
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13628 - 13637
[9] CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
Zhang, Can
Cao, Meng
Yang, Dongming
Chen, Jie
Zou, Yuexian
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16005 - 16014
[10] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization
Chen, Mengyuan
Gao, Junyu
Yang, Shicai
Xu, Changsheng
COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 192 - 208

← 1 2 3 4 5 →