Detecting Human Action as the Spatio-Temporal Tube of Maximum Mutual Information

被引:18
|
作者
Wang, Taiqing [1 ,2 ]
Wang, Shengjin [1 ,2 ]
Ding, Xiaoqing [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[2] Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
Action detection; feature trajectory; mutual information; spatio-temporal cuboid (ST-cuboid); spatio-temporal tube (ST-tube); RECOGNITION; MOTION; DENSE;
D O I
10.1109/TCSVT.2013.2276856
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Human action detection in complex scenes is a challenging problem due to its high-dimensional search space and dynamic backgrounds. To achieve efficient and accurate action detection, we represent a video sequence as a collection of feature trajectories and model human action as the spatio-temporal tube (ST-tube) of maximum mutual information. First, a random forest is built to evaluate the mutual information of feature trajectories toward the action class, and then a one-order Markov model is introduced to recursively infer the action regions at consecutive frames. By exploring the time-continuity property of feature trajectories, the action region is efficiently inferred at large temporal intervals. Finally, we obtain an ST-tube by concatenating the consecutive action regions bounding the human bodies. Compared with the popular spatio-temporal cuboid action model, the proposed ST-tube model is not only more efficient, but also more accurate in action localization. Experimental results on the KTH, CMU and UCF sports datasets validate the superiority of our approach over the state-of-the-art methods in both localization accuracy and time efficiency.
引用
收藏
页码:277 / 290
页数:14
相关论文
共 50 条
  • [41] Management of spatio-temporal information in CIS
    Xu, Zhihong
    Shentu, Haigang
    Bian, Fuling
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2004, 29 (07):
  • [42] Spatio-temporal information integration in XML
    Yi, SZ
    Huang, B
    Chan, WT
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING (WORKSHOPS), 2002, : 103 - 110
  • [43] Spatio-temporal information integration in XML
    Huang, B
    Yi, SZ
    Chan, WT
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2004, 20 (07): : 1157 - 1170
  • [44] Visualization of spatio-temporal information in the Internet
    Andrienko, N
    Andrienko, G
    Gatalsky, P
    11TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, PROCEEDINGS, 2000, : 577 - 585
  • [45] Action Tubelet Detector for Spatio-Temporal Action Localization
    Kalogeiton, Vicky
    Weinzaepfel, Philippe
    Ferrari, Vittorio
    Schmid, Cordelia
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4415 - 4423
  • [46] Spatio-temporal action localization and detection for human recognition in big dataset
    Megrhi, Sameh
    Jmal, Marwa
    Souidene, Wided
    Beghdadi, Azeddine
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 41 : 375 - 390
  • [47] Evaluation of Color Spatio-Temporal Interest Points for Human Action Recognition
    Everts, Ivo
    van Gemert, Jan C.
    Gevers, Theo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (04) : 1569 - 1580
  • [48] Study of human action recognition based on improved spatio-temporal features
    Ji X.-F.
    Wu Q.-Q.
    Ju Z.-J.
    Wang Y.-Y.
    International Journal of Automation and Computing, 2014, 11 (05) : 500 - 509
  • [49] A fast human action recognition network based on spatio-temporal features
    Xu, Jie
    Song, Rui
    Wei, Haoliang
    Guo, Jinhong
    Zhou, Yifei
    Huang, Xiwei
    NEUROCOMPUTING, 2021, 441 : 350 - 358
  • [50] Study of Human Action Recognition Based on Improved Spatio-temporal Features
    Xiao-Fei Ji
    Qian-Qian Wu
    Zhao-Jie Ju
    Yang-Yang Wang
    International Journal of Automation and Computing, 2014, (05) : 500 - 509