View-Invariant Action Recognition Using Latent Kernelized Structural SVM

被引:0
|
作者
Wu, Xinxiao [1 ]
Jia, Yunde [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
来源
COMPUTER VISION - ECCV 2012, PT V | 2012年 / 7576卷
关键词
View-invariant action recognition; latent kernelized structural SVM; correlation feature; multiple level features;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper goes beyond recognizing human actions from a fixed view and focuses on action recognition from an arbitrary view. A novel learning algorithm, called latent kernelized structural SVM, is proposed for the view-invariant action recognition, which extends the kernelized structural SVM framework to include latent variables. Due to the changing and frequently unknown positions of the camera, we regard the view label of action as a latent variable and implicitly infer it during both learning and inference. Motivated by the geometric correlation between different views and semantic correlation between different action classes, we additionally propose a mid-level correlation feature which describes an action video by a set of decision values from the pre-learned classifiers of all the action classes from all the views. Each decision value captures both geometric and semantic correlations between the action video and the corresponding action class from the corresponding view. After that, we combine the low-level visual cue, mid-level correlation description, and high-level label information into a novel nonlinear kernel under the latent kernelized structural SVM framework. Extensive experiments on multi-view IXMAS and MuHAVi action datasets demonstrate that our method generally achieves higher recognition accuracy than other state-of-the-art methods.
引用
收藏
页码:411 / 424
页数:14
相关论文
共 50 条
  • [1] Latent Multitask Learning for View-Invariant Action Recognition
    Mahasseni, Behrooz
    Todorovic, Sinisa
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3128 - 3135
  • [2] View-invariant action recognition using Fundamental Ratios
    Shen, Yuping
    Foroosh, Hassan
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3216 - 3221
  • [3] View-invariant Action Recognition in Surveillance Videos
    Zhang, Fang
    Wang, Yunhong
    Zhang, Zhaoxiang
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 580 - 583
  • [4] VIEW-INVARIANT ACTION RECOGNITION USING CROSS RATIOS ACROSS FRAMES
    Zhang, Yeyin
    Huang, Kaiqi
    Huang, Yongzhen
    Tan, Tieniu
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3549 - 3552
  • [5] View-Invariant Action Recognition from Point Triplets
    Shen, Yuping
    Foroosh, Hassan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (10) : 1898 - 1905
  • [6] Towards Fast, View-Invariant Human Action Recognition
    Cherla, Srikanth
    Kulkarni, Kaustubh
    Kale, Amit
    Ramasubramanian, V.
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 1650 - 1657
  • [7] A survey about view-invariant human action recognition
    Nghia Pham Trong
    Anh Truong Minh
    Nguyen, Hung
    Kazunori, Kotani
    Bac Le Hoai
    2017 56TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2017, : 699 - 704
  • [8] On Temporal Order Invariance for View-Invariant Action Recognition
    Anwaar-ul-Haq
    Gondal, Iqbal
    Murshed, Manzur
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (02) : 203 - 211
  • [9] Dual-attention Network for View-invariant Action Recognition
    Gedamu Alemu Kumie
    Maregu Assefa Habtie
    Tewodros Alemu Ayall
    Changjun Zhou
    Huawen Liu
    Abegaz Mohammed Seid
    Aiman Erbad
    Complex & Intelligent Systems, 2024, 10 : 305 - 321
  • [10] Dual-attention Network for View-invariant Action Recognition
    Kumie, Gedamu Alemu
    Habtie, Maregu Assefa
    Ayall, Tewodros Alemu
    Zhou, Changjun
    Liu, Huawen
    Seid, Abegaz Mohammed
    Erbad, Aiman
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 305 - 321