Robust action recognition using local motion and group sparsity

被引:55
|
作者
Cho, Jungchan [1 ]
Lee, Minsik [1 ]
Chang, Hyung Jin [2 ]
Oh, Songhwai [1 ]
机构
[1] Seoul Natl Univ, ASRI, Dept Elect & Comp Engn, Seoul, South Korea
[2] Univ London Imperial Coll Sci Technol & Med, Dept Elect & Elect Engn, London SW7 2AZ, England
基金
新加坡国家研究基金会;
关键词
Action recognition; Motion descriptor; Sparse representation; Dynamic scene understanding;
D O I
10.1016/j.patcog.2013.12.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing actions in a video is a critical step for making many vision-based applications possible and has attracted much attention recently. However, action recognition in a video is a challenging task due to wide variations within an action, camera motion, cluttered background, and occlusions, to name a few. While dense sampling based approaches are currently achieving the state-of-the-art performance in action recognition, they do not perform well for many realistic video sequences since, by considering every motion found in a video equally, the discriminative power of these approaches is often reduced due to clutter motions, such as background changes and camera motions. In this paper, we robustly identify local motions of interest in an unsupervised manner by taking advantage of group sparsity. In order to robustly classify action types, we emphasize local motion by combining local motion descriptors and full motion descriptors and apply group sparsity to the emphasized motion features using the multiple kernel method. In experiments, we show that different types of actions can be well recognized using a small number of selected local motion descriptors and the proposed algorithm achieves the state-of-the-art performance on popular benchmark datasets, outperforming existing methods. We also demonstrate that the group sparse representation with the multiple kernel method can dramatically improve the action recognition performance. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1813 / 1825
页数:13
相关论文
共 50 条
  • [41] A New System of Face Recognition: Using Fuzziness and Sparsity
    Tan, Yuanpeng
    Cao, Feilong
    Cai, Miaomiao
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2015, 23 (06) : 829 - 844
  • [42] Using Color Texture Sparsity for Facial Expression Recognition
    Lee, Seung Ho
    Kim, Hyungil
    Ro, Yong Man
    Plataniotis, Konstantinos N.
    2013 10TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), 2013,
  • [43] A Robust and Efficient Video Representation for Action Recognition
    Heng Wang
    Dan Oneata
    Jakob Verbeek
    Cordelia Schmid
    International Journal of Computer Vision, 2016, 119 : 219 - 238
  • [44] Rank-GCN for Robust Action Recognition
    Lee, Haetsal
    Park, Unsang
    Kim, Ig-Jae
    Cho, Junghyun
    IEEE ACCESS, 2022, 10 : 91739 - 91749
  • [45] ACTION RECOGNITION BASED ON SPARSE MOTION TRAJECTORIES
    Jargalsaikhan, Iveel
    Little, Suzanne
    Direkoglu, Cem
    O'Connor, Noel E.
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3982 - 3985
  • [46] TMF: Temporal Motion and Fusion for action recognition
    Wang, Yanze
    Ye, Junyong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 213
  • [47] A Robust and Efficient Video Representation for Action Recognition
    Wang, Heng
    Oneata, Dan
    Verbeek, Jakob
    Schmid, Cordelia
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 119 (03) : 219 - 238
  • [48] Combined trajectories for action recognition based on saliency detection and motion boundary
    Wang, Xiaofang
    Qi, Chun
    Lin, Fei
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 57 : 91 - 102
  • [49] Action Recognition With Motion Diversification and Dynamic Selection
    Zhuang, Peiqin
    Guo, Yu
    Yu, Zhipeng
    Zhou, Luping
    Bai, Lei
    Liang, Ding
    Wang, Zhiyong
    Wang, Yali
    Ouyang, Wanli
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4884 - 4896
  • [50] A learnable motion preserving pooling for action recognition
    Li, Tankun
    Chan, Kwok Leung
    Tjahjadi, Tardi
    IMAGE AND VISION COMPUTING, 2024, 151