STV-based video feature processing for action recognition

被引:9
|
作者
Wang, Jing [1 ]
Xu, Zhijie [1 ]
机构
[1] Univ Huddersfield, Sch Comp & Engn, Huddersfield HD1 3DH, W Yorkshire, England
关键词
Video events; Spatio-temporal volume; 3D segmentation; Region intersection; Action recognition; HUMAN MOVEMENT; MOTION; VISUALIZATION; SEGMENTATION; DISTANCE; MODELS;
D O I
10.1016/j.sigpro.2012.06.009
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In comparison to still image-based processes, video features can provide rich and intuitive information about dynamic events occurred over a period of time, such as human actions, crowd behaviours, and other subject pattern changes. Although substantial progresses have been made in the last decade on image processing and seen its successful applications in face matching and object recognition, video-based event detection still remains one of the most difficult challenges in computer vision research due to its complex continuous or discrete input signals, arbitrary dynamic feature definitions, and the often ambiguous analytical methods. In this paper, a Spatio-Temporal Volume (STV) and region intersection (RI) based 3D shape-matching method has been proposed to facilitate the definition and recognition of human actions recorded in videos. The distinctive characteristics and the performance gain of the devised approach stemmed from a coefficient factor-boosted 3D region intersection and matching mechanism developed in this research. This paper also reported the investigation into techniques for efficient STV data filtering to reduce the amount of voxels (volumetric-pixels) that need to be processed in each operational cycle in the implemented system. The encouraging features and improvements on the operational performance registered in the experiments have been discussed at the end. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:2151 / 2168
页数:18
相关论文
共 50 条
  • [31] Video action recognition with Key-detail Motion Capturing based on motion spectrum analysis and multiscale feature fusion
    Ganghan Zhang
    Guoheng Huang
    Haiyuan Chen
    Chi-Man Pun
    Zhiwen Yu
    Wing-Kuen Ling
    The Visual Computer, 2023, 39 : 539 - 556
  • [32] Video action recognition with Key-detail Motion Capturing based on motion spectrum analysis and multiscale feature fusion
    Zhang, Ganghan
    Huang, Guoheng
    Chen, Haiyuan
    Pun, Chi-Man
    Yu, Zhiwen
    Ling, Wing-Kuen
    VISUAL COMPUTER, 2023, 39 (02): : 539 - 556
  • [33] Robust human action recognition scheme based on high-level feature fusion
    Benmokhtar, Rachid
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 69 (02) : 253 - 275
  • [34] Action Recognition Based on Feature Extraction From Time Series
    Keceli, Ali Seydi
    Can, Ahmet Burak
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 485 - 488
  • [35] ACTION RECOGNITION BASED ON SEMANTIC FEATURE DESCRIPTION AND CROSS CLASSIFICATION
    Zhao, Yang
    Wang, Qi
    Yuan, Yuan
    2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 626 - 630
  • [36] Action Recognition of Temporal Segment Network Based on Feature Fusion
    Li H.
    Ding Y.
    Li C.
    Zhang S.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (01): : 145 - 158
  • [37] Temporal Segment Networks Based on Feature Propagation for Action Recognition
    Shi Y.
    Zeng Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (04): : 582 - 589
  • [38] Human Action Recognition Based on Motion Feature and Manifold Learning
    Wang, Jun
    Xia, Limin
    Ma, Wentao
    IEEE ACCESS, 2021, 9 : 89287 - 89299
  • [39] Motion-Driven Visual Tempo Learning for Video-Based Action Recognition
    Liu, Yuanzhong
    Yuan, Junsong
    Tu, Zhigang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4104 - 4116
  • [40] Fast Binary-Based Video Descriptors for Action Recognition
    Leyva, Roberto
    Sanchez, Victor
    Tsun-Li, Chang
    2016 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2016, : 380 - 387