Attribute-based supervised deep learning model for action recognition

被引:0
作者
Kai Chen
Guiguang Ding
Jungong Han
机构
[1] Tsinghua University,School of Software
[2] Northumbria University,Department of Computer Science
来源
Frontiers of Computer Science | 2017年 / 11卷
关键词
action recognition; convolutional neural network; attribute;
D O I
暂无
中图分类号
学科分类号
摘要
Deep learning has been the most popular feature learning method used for a variety of computer vision applications in the past 3 years. Not surprisingly, this technique, especially the convolutional neural networks (ConvNets) structure, is exploited to identify the human actions, achieving great success. Most algorithms in existence directly adopt the basic ConvNets structure, which works pretty well in the ideal situation, e.g., under stable lighting conditions. However, its performance degrades significantly when the intra-variation in relation to image appearance occurs within the same category. To solve this problem, we propose a new method, integrating the semantically meaningful attributes into deep learning’s hierarchical structure. Basically, the idea is to add simple yet effective attributes to the category level of ConvNets such that the attribute information is able to drive the learning procedure. The experimental results based on three popular action recognition databases show that the embedding of auxiliary multiple attributes into the deep learning framework improves the classification accuracy significantly.
引用
收藏
页码:219 / 229
页数:10
相关论文
共 78 条
  • [1] Lao WL(2009)Automatic video-based human motion analyzer for consumer surveillance system IEEE Transactions on Consumer Electronics 55 591-598
  • [2] Han J G(2008)Broadcast court-net sports video analysis using fast 3-D camera modeling IEEE Transactions on Circuits and Systems for Video Technology 18 1628-1638
  • [3] Han J G(2016)Large-scale cross-modality search via collective matrix factorization hashing IEEE Transactions on Image Processing 25 5427-5440
  • [4] Dirk F(2015)A bundled-optimization model of multiview dense depth map synthesis for dynamic scene reconstruction Information Sciences 320 306-319
  • [5] De With P H N(2006)Principal components analysis corrects for stratification in genome-wide association studies Nature Genetics 38 904-909
  • [6] Ding G G(2015)Multipe/singleview human action recognition via part-induced multitask structural learning IEEE Transactions on Cybernetics 45 1194-1208
  • [7] Guo Y C(2015)Single/multi-view human action recognition via regularized multi-task learning Neurocomputing 151 544-553
  • [8] Zhou J L(2015)Coupled hidden conditional random fields for RGB-D human action recognition Signal Processing 112 74-82
  • [9] Gao Y(2014)A multi-dimensional image quality prediction model for user-generated images in social networks Information Sciences 281 601-610
  • [10] Yang Y(2014)Video super-resolution based on automatic key-frame selection and feature-guided variational optical flow Signal Processing: Image Communication 29 875-886