Learning universal multiview dictionary for human action recognition

被引:36
|
作者
Yao, Tingting [1 ,2 ]
Wang, Zhiyong [1 ]
Xie, Zhao [2 ]
Gao, Jun [2 ]
Feng, David Dagan [1 ]
机构
[1] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
[2] Hefei Univ Technol, Sch Comp & Informat, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Dictionary learning; Sparse coding; Multiview learning; Action recognition; MOTION; PARTS;
D O I
10.1016/j.patcog.2016.11.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, many sparse coding based approaches have been proposed for human action recognition. However, most of them focus on learning a discriminative dictionary without explicitly taking into account the common patterns shared among different action classes. In this paper, we propose a novel discriminative dictionary learning framework by formulating a universal dictionary which consists of a shared sub-dictionary and a set of class-specific sub-dictionaries. As a result, inter-class differences can be better characterized with sparse codes obtained from the class-specific dictionaries. In addition, group sparsity and locality constraints are utilized to preserve therelationship and structure among features. In order to leverage the benefits of multiple descriptors, a dictionary is learned for each view, and the corresponding sparse representations of those descriptors are fused in a low dimensional feature space together with temporal information. The experimental results on three challenging datasets demonstrate that our method is able to achieve better performance than a number of stateof-the-art ones.
引用
收藏
页码:236 / 244
页数:9
相关论文
共 50 条
  • [1] Learning Zeroth Class Dictionary for Human Action Recognition
    Cai, Jiaxin
    Tang, Xin
    Zhang, Lifang
    Feng, Guocan
    COMPUTER VISION, PT III, 2017, 773 : 651 - 666
  • [2] Multiview Supervised Dictionary Learning in Speech Emotion Recognition
    Gangeh, Mehrdad J.
    Fewzee, Pouria
    Ghodsi, Ali
    Kamel, Mohamed S.
    Karray, Fakhri
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (06) : 1056 - 1068
  • [3] Learning Cross-domain Dictionary Pairs for Human Action Recognition
    Zhang, Bingbing
    Shi, Dongcheng
    Ni, Kang
    Liang, Chao
    PROCEEDINGS OF THE 2015 2ND INTERNATIONAL WORKSHOP ON MATERIALS ENGINEERING AND COMPUTER SCIENCES (IWMECS 2015), 2015, 33 : 423 - 428
  • [4] Two-Stream Dictionary Learning Architecture for Action Recognition
    Xu, Ke
    Jiang, Xinghao
    Sun, Tanfeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (03) : 567 - 576
  • [5] A supervised dictionary learning and discriminative weighting model for action recognition
    Dong, Jian
    Sun, Changyin
    Yang, Wankou
    NEUROCOMPUTING, 2015, 158 : 246 - 256
  • [6] Discriminative Dictionary Learning for Skeletal Action Recognition
    Xiang, Yang
    Xu, Jinhua
    NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 531 - 539
  • [7] Human action recognition by leaning pose dictionary
    Cai, Jiaxin
    Feng, Guocan
    Tang, Xin
    Luo, Zhihong
    Cai, Jiaxin, 1600, Chinese Optical Society (34):
  • [8] Benchmarking a Multimodal and Multiview and Interactive Dataset for Human Action Recognition
    Liu, An-An
    Xu, Ning
    Nie, Wei-Zhi
    Su, Yu-Ting
    Wong, Yongkang
    Kankanhalli, Mohan
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (07) : 1781 - 1794
  • [9] Linearized kernel dictionary learning with group sparse priors for action recognition
    Changde Fan
    Chunhai Hu
    Bin Liu
    The Visual Computer, 2019, 35 : 1797 - 1807
  • [10] Linearized kernel dictionary learning with group sparse priors for action recognition
    Fan, Changde
    Hu, Chunhai
    Liu, Bin
    VISUAL COMPUTER, 2019, 35 (12) : 1797 - 1807