Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization

被引:43
作者
Lee, Pilhyeon [1 ]
Byun, Hyeran [1 ,2 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul, South Korea
[2] Yonsei Univ, Grad Sch AI, Seoul, South Korea
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/ICCV48922.2021.01339
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We tackle the problem of localizing temporal intervals of actions with only a single frame label for each action instance for training. Owing to label sparsity, existing work fails to learn action completeness, resulting in fragmentary action predictions. In this paper, we propose a novel framework, where dense pseudo-labels are generated to provide completeness guidance for the model. Concretely, we first select pseudo background points to supplement point-level action labels. Then, by taking the points as seeds, we search for the optimal sequence that is likely to contain complete action instances while agreeing with the seeds. To learn completeness from the obtained sequence, we introduce two novel losses that contrast action instances with background ones in terms of action score and feature similarity, respectively. Experimental results demonstrate that our completeness guidance indeed helps the model to locate complete action instances, leading to large performance gains especially under high IoU thresholds. Moreover, we demonstrate the superiority of our method over existing state-of-the-art methods on four benchmarks: THUMOS'14, GTEA, BEOID, and ActivityNet. Notably, our method even performs comparably to recent fully-supervised methods, at the 6x cheaper annotation cost. Our code is available at https://github.com/Pilhyeon.
引用
收藏
页码:13628 / 13637
页数:10
相关论文
共 50 条
  • [21] Spatial–temporal correlations learning and action-background jointed attention for weakly-supervised temporal action localization
    Huifen Xia
    Yongzhao Zhan
    Keyang Cheng
    Multimedia Systems, 2022, 28 : 1529 - 1541
  • [22] Dynamic Graph Modeling for Weakly-Supervised Temporal Action Localization
    Shi, Haichao
    Zhang, Xiao-Yu
    Li, Changsheng
    Gong, Lixing
    Li, Yong
    Bao, Yongjun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3820 - 3828
  • [23] Deep Motion Prior for Weakly-Supervised Temporal Action Localization
    Cao, Meng
    Zhang, Can
    Chen, Long
    Shou, Mike Zheng
    Zou, Yuexian
    IEEE Transactions on Image Processing, 2022, 31 : 5203 - 5213
  • [24] Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization
    Ju, Chen
    Zhao, Peisen
    Chen, Siheng
    Zhang, Ya
    Zhang, Xiaoyun
    Wang, Yanfeng
    Tian, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6688 - 6701
  • [25] Dynamic Graph Modeling for Weakly-Supervised Temporal Action Localization
    Shi, Haichao
    Zhang, Xiao-Yu
    Li, Changsheng
    Gong, Lixing
    Li, Yong
    Bao, Yongjun
    MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia, 2022, : 3820 - 3828
  • [26] ACGNet: Action Complement Graph Network for Weakly-Supervised Temporal Action Localization
    Yang, Zichen
    Qin, Jie
    Huang, Di
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3090 - 3098
  • [27] Boosting Weakly-Supervised Temporal Action Localization with Text Information
    Li, Guozhang
    Cheng, De
    Ding, Xinpeng
    Wang, Nannan
    Wang, Xiaoyu
    Gao, Xinbo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10648 - 10657
  • [28] Feature Matching Network for Weakly-Supervised Temporal Action Localization
    Dou, Peng
    Zhou, Wei
    Liao, Zhongke
    Hu, Haifeng
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 459 - 471
  • [29] Complementary adversarial mechanisms for weakly-supervised temporal action localization
    Wang, Chuanxu
    Wang, Jing
    Liu, Peng
    PATTERN RECOGNITION, 2023, 139
  • [30] Deep Motion Prior for Weakly-Supervised Temporal Action Localization
    Cao, Meng
    Zhang, Can
    Chen, Long
    Shou, Mike Zheng
    Zou, Yuexian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5203 - 5213