Relational Prototypical Network for Weakly Supervised Temporal Action Localization

被引:0
|
作者
Huang, Linjiang [1 ,3 ]
Huang, Yan [1 ,3 ]
Ouyang, Wanli [4 ]
Wang, Liang [1 ,2 ,3 ]
机构
[1] Natl Lab Pattern Recognit NLPR, Ctr Res Intelligent Percept & Comp CRIPAC, Sydney, NSW, Australia
[2] Chinese Acad Sci CASIA, Ctr Excellence Brain Sci & Intelligence Technol C, Inst Automat, Beijing, Peoples R China
[3] Univ Chinese Acad Sci UCAS, Beijing, Peoples R China
[4] Univ Sydney, Sydney, NSW, Australia
来源
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a weakly supervised temporal action localization method on untrimmed videos based on prototypical networks. We observe two challenges posed by weakly supervision, namely action-background separation and action relation construction. Unlike the previous method, we propose to achieve action-background separation only by the original videos. To achieve this, a clustering loss is adopted to separate actions from backgrounds and learn intra-compact features, which helps in detecting complete action instances. Besides, a similarity weighting module is devised to further separate actions from backgrounds. To effectively identify actions, we propose to construct relations among actions for prototype learning. A GCN-based prototype embedding module is introduced to generate relational prototypes. Experiments on THUMOS14 and ActivityNet1.2 datasets show that our method outperforms the state-of-the-art methods.
引用
收藏
页码:11053 / 11060
页数:8
相关论文
共 50 条
  • [21] Deep cascaded action attention network for weakly-supervised temporal action localization
    Xia, Hui-fen
    Zhan, Yong-zhao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29769 - 29787
  • [22] ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization
    Liu, Ziyi
    Wang, Le
    Zhang, Qilin
    Tang, Wei
    Yuan, Junsong
    Zheng, Nanning
    Hua, Gang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2233 - 2241
  • [23] Collaborative Foreground, Background, and Action Modeling Network for Weakly Supervised Temporal Action Localization
    Moniruzzaman, Md.
    Yin, Zhaozheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6939 - 6951
  • [24] Deep cascaded action attention network for weakly-supervised temporal action localization
    Hui-fen Xia
    Yong-zhao Zhan
    Multimedia Tools and Applications, 2023, 82 : 29769 - 29787
  • [25] Weakly Supervised Temporal Action Localization by Multi-Stage Fusion Network
    Shen, Zhengyang
    Wang, Feng
    Dai, Jin
    IEEE ACCESS, 2020, 8 : 17287 - 17298
  • [26] Progressive enhancement network with pseudo labels for weakly supervised temporal action localization
    Wang, Qingyun
    Song, Yan
    Zou, Rong
    Shu, Xiangbo
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [27] Deep feature enhancing and selecting network for weakly supervised temporal action localization
    Yu, Jiaruo
    Ge, Yongxin
    Qin, Xiaolei
    Li, Ziqiang
    Huang, Sheng
    Chen, Feiyu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80
  • [28] Weakly-supervised Temporal Action Localization with Adaptive Clustering and Refining Network
    Ren, Hao
    Ran, Wu
    Liu, Xingson
    Ren, Haoran
    Lu, Hong
    Zhang, Rui
    Jin, Cheng
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1008 - 1013
  • [29] Self-attention relational modeling and background suppression for weakly supervised temporal action localization
    Wang, Jing
    Wang, Chuanxu
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [30] Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization
    Zhou, Jianxiong
    Wu, Ying
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6017 - 6026