Weakly-supervised Temporal Action Localization with Adaptive Clustering and Refining Network

被引:0
|
作者
Ren, Hao [1 ]
Ran, Wu [1 ]
Liu, Xingson [1 ]
Ren, Haoran [1 ]
Lu, Hong [1 ]
Zhang, Rui [1 ]
Jin, Cheng [1 ,2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
关键词
Temporal Action Localization; Weakly-supervised Learning; Adaptive Clustering;
D O I
10.1109/ICME55011.2023.00177
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly-supervised temporal action localization task aims to localize temporal boundaries of action instances by using only video-level labels. Existing methods primarily adopt Multi-Instance-Learning (MIL) scheme to handle this task. The effectiveness of MIL scheme depends heavily on the selection of top-k action snippets, which is unstable and requires manual tuning. To address these deficiencies, we propose an Adaptive Clustering and Refining Network (ACRNet). Specifically, we present an action-aware clustering strategy that is adaptable and requires no manual tuning to separate action and background snippets of diverse videos based on intra-class activation distribution. And a cluster refining step is included to eliminate false action snippets by considering inter-class activation distribution, which greatly improves robustness and localization accuracy. Extensive experiments on THUMOS14, ActivityNet 1.2&1.3 benchmarks show that our method achieves state-of-the-art performance.
引用
收藏
页码:1008 / 1013
页数:6
相关论文
共 50 条
  • [41] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization
    Chen, Mengyuan
    Gao, Junyu
    Yang, Shicai
    Xu, Changsheng
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 192 - 208
  • [42] Learning Background Suppression Model for Weakly-supervised Temporal Action Localization
    Liu, Mengxue
    Gao, Xiangjun
    Ge, Fangzhen
    Liu, Huaiyu
    Li, Wenjing
    IAENG International Journal of Computer Science, 2021, 48 (04):
  • [43] Unleashing the Potential of Adjacent Snippets for Weakly-supervised Temporal Action Localization
    Liu, Qinying
    Wang, Zilei
    Chen, Ruoxi
    Li, Zhilin
    Proceedings - IEEE International Conference on Multimedia and Expo, 2023, 2023-July : 1032 - 1037
  • [44] Unleashing the Potential of Adjacent Snippets for Weakly-supervised Temporal Action Localization
    Liu, Qinying
    Wang, Zilei
    Chen, Ruoxi
    Li, Zhilin
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1032 - 1037
  • [45] TSCANet: a two-stream context aggregation network for weakly-supervised temporal action localization
    Zhang, Haiping
    Lin, Haixiang
    Wang, Dongjing
    Xu, Dongyang
    Zhou, Fuxing
    Guan, Liming
    Yu, Dongjing
    Fang, Xujian
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [46] A Snippets Relation and Hard-Snippets Mask Network for Weakly-Supervised Temporal Action Localization
    Zhao, Yibo
    Zhang, Hua
    Gao, Zan
    Guan, Weili
    Wang, Meng
    Chen, Shengyong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7202 - 7215
  • [47] Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach
    Liu, Qinying
    Wang, Zilei
    Rong, Shenghai
    Li, Junjie
    Zhang, Yixin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10399 - 10409
  • [48] ACTION COHERENCE NETWORK FOR WEAKLY SUPERVISED TEMPORAL ACTION LOCALIZATION
    Zhai, Yuanhao
    Wang, Le
    Liu, Ziyi
    Zhang, Qilin
    Hua, Gang
    Zheng, Nanning
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3696 - 3700
  • [49] Weakly-supervised Action Localization with Background Modeling
    Phuc Xuan Nguyen
    Ramanan, Deva
    Fowlkes, Charless C.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5501 - 5510
  • [50] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
    Gao, Junyu
    Chen, Mengyuan
    Xu, Changsheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19967 - 19977