Multi-scale Dynamic Network for Temporal Action Detection

被引:2
作者
Ren, Yifan [1 ,2 ]
Xu, Xing [1 ,2 ]
Shen, Fumin [1 ,2 ]
Wang, Zheng [1 ,2 ]
Yang, Yang [1 ,2 ]
Shen, Heng Tao [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Future Media, Chengdu, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
来源
PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21) | 2021年
基金
中国国家自然科学基金;
关键词
Temporal Action Detection; Dynamic Filters; Multi-scale Features;
D O I
10.1145/3460426.3463613
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, as the fundamental task in video understanding, Temporal Action Detection is attracting extensive attention. Most existing approaches use the same model parameters to process all input videos, which are not adaptive to the input video during the inference stage. In this paper, we propose a novel model termed Multi-scale Dynamic Network (MDN) to tackle this problem. The proposed MDN model incorporates multiple Multi-scale Dynamic Modules (MDMs). Each MDM can generate video-specific and segment-specific convolution kernels based on video content from different scales and adaptively capture rich semantic information for the prediction. Besides, we also design a new Edge Suppression Loss (ESL) function for MDN to pay more attention to hard examples. Extensive experiments conducted on two popular benchmarks ActivityNet-1.3 and THUMOS-14 show that the proposed MDN model achieves the state-of-the-art performance.
引用
收藏
页码:267 / 275
页数:9
相关论文
共 50 条
  • [21] ParaLkResNet: an efficient multi-scale image classification network
    Yu, Tongshuai
    Liu, Ye
    Liu, Hao
    Chen, Ji
    Wang, Xing
    VISUAL COMPUTER, 2024, 40 (07) : 5057 - 5066
  • [22] Multi-Scale Context Fusion Network for Urban Solid Waste Detection in Remote Sensing Images
    Li, Yangke
    Zhang, Xinman
    REMOTE SENSING, 2024, 16 (19)
  • [23] MFSFNet: Multi-Scale Feature Subtraction Fusion Network for Remote Sensing Image Change Detection
    Huang, Zhiqi
    You, Hongjian
    REMOTE SENSING, 2023, 15 (15)
  • [24] Remote sensing image change detection network with multi-scale feature information mining and fusion
    Xue, Songdong
    Zhang, Minming
    Qiao, Gangzhu
    Zhang, Chaofan
    Wang, Bin
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (02)
  • [25] Multi-Scale Aggregation Transformers for Multispectral Object Detection
    You, Shuai
    Xie, Xuedong
    Feng, Yujian
    Mei, Chaojun
    Ji, Yimu
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1172 - 1176
  • [26] Action recognition algorithm based on multi-scale and multi-branch features
    Zhang Lei
    Han Guang-Liang
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2022, 37 (12) : 1614 - 1625
  • [27] Capsule Boundary Network With 3D Convolutional Dynamic Routing for Temporal Action Detection
    Chen, Yaosen
    Guo, Bing
    Shen, Yan
    Wang, Wei
    Lu, Weichen
    Suo, Xinhua
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 2962 - 2975
  • [28] DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
    Yang, Le
    Zheng, Ziwei
    Han, Yizeng
    Cheng, Hao
    Song, Shiji
    Huang, Gao
    Li, Fan
    COMPUTER VISION-ECCV 2024, PT XLVI, 2025, 15104 : 305 - 322
  • [29] Temporal-visual proposal graph network for temporal action detection
    Ming-Gang Gan
    Yan Zhang
    Shaowen Su
    Applied Intelligence, 2023, 53 : 26008 - 26026
  • [30] SMC: Single-Stage Multi-location Convolutional Network for Temporal Action Detection
    Liu, Zhikang
    Wang, Zilei
    Zhao, Yan
    Tian, Ye
    COMPUTER VISION - ACCV 2018, PT II, 2019, 11362 : 179 - 195