Detecting action tubes via spatial action estimation and temporal path inference

被引:4
|
作者
Li, Nannan [1 ]
Huang, Jingjia [1 ]
Li, Thomas [2 ]
Guo, Huiwen [3 ]
Li, Ge [1 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Sch Elect & Comp Engn, Beijing, Peoples R China
[2] Gpower Semicond Inc, Suzhou, Peoples R China
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Action detection; Spatial localization; Region proposal network; Tracking-by-detection; SUM-PRODUCT NETWORKS; ACTION RECOGNITION;
D O I
10.1016/j.neucom.2018.05.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of action detection in unconstrained video clips. Our approach starts from action detection on object proposals at each frame, then aggregates the frame-level detection results belonging to the same actor across the whole video via linking, associating, and tracking to generate action tubes that are spatially compact and temporally continuous. To achieve the target, a novel action detection model with two-stream architecture is firstly proposed, which utilizes the fused feature from both appearance and motion cues and can be trained end-to-end. Then, the association of the action paths is formulated as a maximum set coverage problem with the results of action detection as a priori. We utilize an incremental search algorithm to obtain all the action proposals at one-pass operation with great efficiency, especially while dealing with the video of long duration or with multiple action instances. Finally, a tracking-by-detection scheme is designed to further refine the generated action paths. Extensive experiments on three validation datasets, UCF-Sports, UCF-101 and J-HMDB, show that the proposed approach advances state-of-the-art action detection performance in terms of both accuracy and proposal quality. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:65 / 77
页数:13
相关论文
共 50 条
  • [21] Extracting hierarchical spatial and temporal features for human action recognition
    Keting Zhang
    Liqing Zhang
    Multimedia Tools and Applications, 2018, 77 : 16053 - 16068
  • [22] Spatial-Temporal Interleaved Network for Efficient Action Recognition
    Jiang, Shengqin
    Zhang, Haokui
    Qi, Yuankai
    Liu, Qingshan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2025, 21 (01) : 178 - 187
  • [23] Extracting hierarchical spatial and temporal features for human action recognition
    Zhang, Keting
    Zhang, Liqing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (13) : 16053 - 16068
  • [24] Human Action Recognition Using Spatial and Temporal Sequences Alignment
    Li, Yandi
    Zhao, Zhihao
    SECOND INTERNATIONAL CONFERENCE ON OPTICS AND IMAGE PROCESSING (ICOIP 2022), 2022, 12328
  • [25] Select and Focus: Action Recognition with Spatial-Temporal Attention
    Chan, Wensong
    Tian, Zhiqiang
    Liu, Shuai
    Ren, Jing
    Lan, Xuguang
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT III, 2019, 11742 : 461 - 471
  • [26] Active Temporal Action Detection in Untrimmed Videos via Deep Reinforcement Learning
    Li, Nan-Nan
    Guo, Hui-Wen
    Zhao, Yang
    Li, Thomas
    Li, Ge
    IEEE ACCESS, 2018, 6 : 59126 - 59140
  • [27] Human Action Recognition by Fusion of Convolutional Neural Networks and spatial-temporal Information
    Li, Weisheng
    Ding, Yahui
    8TH INTERNATIONAL CONFERENCE ON INTERNET MULTIMEDIA COMPUTING AND SERVICE (ICIMCS2016), 2016, : 255 - 259
  • [28] Spatial-Temporal Pyramid Graph Reasoning for Action Recognition
    Geng, Tiantian
    Zheng, Feng
    Hou, Xiaorong
    Lu, Ke
    Qi, Guo-Jun
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5484 - 5497
  • [29] Learning Spatial and Temporal Extents of Human Actions for Action Detection
    Zhou, Zhong
    Shi, Feng
    Wu, Wei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (04) : 512 - 525
  • [30] Video Based Action Recognition using Spatial and Temporal Feature
    Dai, Cheng
    Liu, Xingang
    Zhong, Luhao
    Yu, Tao
    IEEE 2018 INTERNATIONAL CONGRESS ON CYBERMATICS / 2018 IEEE CONFERENCES ON INTERNET OF THINGS, GREEN COMPUTING AND COMMUNICATIONS, CYBER, PHYSICAL AND SOCIAL COMPUTING, SMART DATA, BLOCKCHAIN, COMPUTER AND INFORMATION TECHNOLOGY, 2018, : 635 - 638