Constructing Hierarchical Spatiotemporal Information for Action Recognition

被引:0
作者
Yao, Guangle [1 ,2 ,3 ]
Zhong, Jiandan [1 ,2 ,3 ]
Lei, Tao [1 ]
Liu, Xianyuan [1 ]
机构
[1] Chinese Acad Sci, Inst Opt & Elect, Chengdu, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI) | 2018年
关键词
action recognition; convolutional neural network; spatiotemporal information; action representation; optical flow; NETWORKS;
D O I
10.1109/SmartWorld.2018.00123
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Video action recognition is widely applied in video indexing, intelligent surveillance, multimedia understanding, and other fields. Recently, it was greatly improved by incorporating the convolutional neural network (ConvNet). The features of shadow layers in ConvNet tend to model the apparent and motion of actions, and the features of deep layers tend to represent actions. In this paper, we propose to construct hierarchical information by combining the spatiotemporal features of shadow and deep layers in 3D ConvNet for action recognition. Specifically, we use Res3D to extract spatiotemporal information from different types of layers, and transfer the knowledge learned from RGB to optical flow field. We also propose a Parallel Pair Discriminant Correlation Analysis (PPDCA) to fuse the multiple layers' spatiotemporal information into a compact hierarchal action representation. The experimental results show that there is a good balance between accuracy and dimension in our proposed hierarchical spatiotemporal information, and our method not only outperforms the single layer Res3D methods but also achieves recognition performance comparable to that of state-of-the-art methods.
引用
收藏
页码:596 / 602
页数:7
相关论文
共 36 条
  • [1] [Anonymous], 2017, ABS170805038 CORR
  • [2] [Anonymous], 2017, IEEE T PATTERN ANAL
  • [3] [Anonymous], 2014, ADV NEURAL INFORM PR
  • [4] [Anonymous], 2011, P IEEE INT C COMP VI
  • [5] [Anonymous], 2013, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2012.59
  • [6] [Anonymous], PROC CVPR IEEE
  • [7] [Anonymous], P ACM MULT C
  • [8] [Anonymous], P INT C MACH LEARN
  • [9] [Anonymous], PROC CVPR IEEE
  • [10] [Anonymous], 2015, PROC CVPR IEEE