Human action and event recognition using a novel descriptor based on improved dense trajectories

被引:1
作者
Mukherjee, Snehasis [1 ]
Singh, Krit Karan [1 ]
机构
[1] IIIT Chittoor, Sricity 517646, Andhra Pradesh, India
关键词
Event recognition; Action recognition; Dense trajectories; Fisher vector; HISTOGRAMS;
D O I
10.1007/s11042-017-4980-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a unified method for recognizing human action and human related events in a realistic video. We use an efficient pipeline of (a) a 3D representation of the Improved Dense Trajectory Feature (DTF) and (b) Fisher Vector (FV). Further, a novel descriptor is proposed, capable of representing human actions and human related events based on the FV representation of the input video. The proposed unified descriptor is a 168-dimensional vector obtained from each video sequence by statistically analyzing the motion patterns of the 3D joint locations of the human body. The proposed descriptor is trained using binary Support Vector Machine (SVM) for recognizing human actions or human related events. We evaluate the proposed approach on two challenging action recognition datasets: UCF sports and CMU Mocap datasets. In addition to the two action recognition dataset, the proposed approach is tested on the Hollywood2 event recognition dataset. On all the benchmark datasets for both action and event recognition, the proposed approach has shown its efficacy compared to the state-of-the-art techniques.
引用
收藏
页码:13661 / 13678
页数:18
相关论文
共 35 条
[1]  
[Anonymous], 2012, CRCV T 12 01
[2]  
[Anonymous], 2009, ICCV WORKSH VID OR O
[3]  
[Anonymous], 2009, BMVC 2009
[4]   Landmark-based multimodal human action recognition [J].
Asteriadis, Stylianos ;
Daras, Petros .
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) :4505-4521
[5]   Speeded-Up Robust Features (SURF) [J].
Bay, Herbert ;
Ess, Andreas ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359
[6]  
Bregonzio M, 2009, PROC CVPR IEEE, P1948, DOI 10.1109/CVPRW.2009.5206779
[7]   Action recognition from depth sequences using weighted fusion of 2D and 3D auto-correlation of gradients features [J].
Chen, Chen ;
Zhang, Baochang ;
Hou, Zhenjie ;
Jiang, Junjun ;
Liu, Mengyuan ;
Yang, Yun .
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) :4651-4669
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]   Human detection using oriented histograms of flow and appearance [J].
Dalal, Navneet ;
Triggs, Bill ;
Schmid, Cordelia .
COMPUTER VISION - ECCV 2006, PT 2, PROCEEDINGS, 2006, 3952 :428-441
[10]  
David G., 1999, P 7 IEEE INT C COMP, V2, P1150