Temporal RPN Learning for Weakly-Supervised Temporal Action Localization

被引:0
作者
Huang, Jing [1 ]
Kong, Ming [2 ,3 ]
Chen, Luyuan [4 ]
Liang, Tian [1 ]
Zhu, Qiang [2 ]
机构
[1] Zhejiang Univ, Hangzhou 310058, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310058, Peoples R China
[3] Hikvis Res Inst, Hangzhou 310051, Peoples R China
[4] Beijing Informat Sci & Technol Univ, Beijing 100101, Peoples R China
来源
ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222 | 2023年 / 222卷
关键词
Weakly-Supervised Learning; Action Localization; Temporal Region Proposal;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly-Supervised Temporal Action Localization (WSTAL) aims to train an action instance localization model from untrimmed videos with only video-level labels, similar to the Object Detection (OD) task. Existing Top-k MIL-based WSTAL methods cannot flexibly define the learning space, which limits the model's learning efficiency and performance. Faster R-CNN is a classic two-stage object detection architecture with an efficient Region Proposal Network. This paper successfully migrates the Faster R-CNN liked two-stage architecture to the WSTAL task: first to build a T-RPN and integrate it with the traditional WSTAL framework; and then to propose a pseudo label generation mechanism to enable the T-RPN learning without temporal annotations. Our new framework has achieved breakthrough performances on THUMOS-14 and ActivityNet-v1.2 datasets, and comprehensive ablation experiments have verified the effectiveness of the innovations. Code will be available at: https://github.com/ZJUHJ/TRPN.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Weakly-Supervised Temporal Action Localization with Multi-Head Cross-Modal Attention
    Ren, Hao
    Ren, Haoran
    Ran, Wu
    Lu, Hong
    Jin, Cheng
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 281 - 295
  • [22] Graph Representation for Weakly-Supervised Spatio-Temporal Action Detection
    Singh, Dinesh
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [23] Multilevel semantic and adaptive actionness learning for weakly supervised temporal action localization
    Li, Zhilin
    Wang, Zilei
    Dong, Cerui
    NEURAL NETWORKS, 2025, 182
  • [24] PFWNet: Pretraining neural network via feature jigsaw puzzle for weakly-supervised temporal action localization
    Wang, Binglu
    Zhao, Yongqiang
    Zhang, Yani
    NEUROCOMPUTING, 2021, 443 : 162 - 173
  • [25] Bilateral Relation Distillation for Weakly Supervised Temporal Action Localization
    Xu, Zhe
    Wei, Kun
    Yang, Erkun
    Deng, Cheng
    Liu, Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11458 - 11471
  • [26] Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks
    Liu, Ziyi
    Wang, Le
    Zhang, Qilin
    Tang, Wei
    Zheng, Nanning
    Hua, Gang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5886 - 5902
  • [27] Weakly-supervised action localization based on seed superpixels
    Sami Ullah
    Naeem Bhatti
    Tehreem Qasim
    Najmul Hassan
    Muhammad Zia
    Multimedia Tools and Applications, 2021, 80 : 6203 - 6220
  • [28] Weakly-supervised action localization based on seed superpixels
    Ullah, Sami
    Bhatti, Naeem
    Qasim, Tehreem
    Hassan, Najmul
    Zia, Muhammad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (04) : 6203 - 6220
  • [29] Exploring Sub-Action Granularity for Weakly Supervised Temporal Action Localization
    Wang, Binglu
    Zhang, Xun
    Zhao, Yongqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2186 - 2198
  • [30] MODAL CONSENSUS AND CONTEXTUAL SEPARATION FOR WEAKLY SUPERVISED TEMPORAL ACTION LOCALIZATION
    Liu, Peng
    Wang, Chuanxu
    Zhao, Min
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4220 - 4224