View-Invariant Action Recognition Using Latent Kernelized Structural SVM

被引：0

作者：

Wu, Xinxiao ^{[1
]}

Jia, Yunde ^{[1
]}

机构：

[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China

来源：

COMPUTER VISION - ECCV 2012, PT V | 2012年 / 7576卷

关键词：

View-invariant action recognition; latent kernelized structural SVM; correlation feature; multiple level features;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper goes beyond recognizing human actions from a fixed view and focuses on action recognition from an arbitrary view. A novel learning algorithm, called latent kernelized structural SVM, is proposed for the view-invariant action recognition, which extends the kernelized structural SVM framework to include latent variables. Due to the changing and frequently unknown positions of the camera, we regard the view label of action as a latent variable and implicitly infer it during both learning and inference. Motivated by the geometric correlation between different views and semantic correlation between different action classes, we additionally propose a mid-level correlation feature which describes an action video by a set of decision values from the pre-learned classifiers of all the action classes from all the views. Each decision value captures both geometric and semantic correlations between the action video and the corresponding action class from the corresponding view. After that, we combine the low-level visual cue, mid-level correlation description, and high-level label information into a novel nonlinear kernel under the latent kernelized structural SVM framework. Extensive experiments on multi-view IXMAS and MuHAVi action datasets demonstrate that our method generally achieves higher recognition accuracy than other state-of-the-art methods.

引用

页码：411 / 424

页数：14

共 50 条

[1] Latent Multitask Learning for View-Invariant Action Recognition
Mahasseni, Behrooz
Todorovic, Sinisa
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3128 - 3135
[2] View-invariant action recognition using Fundamental Ratios
Shen, Yuping
Foroosh, Hassan
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3216 - 3221
[3] View-invariant Action Recognition in Surveillance Videos
Zhang, Fang
Wang, Yunhong
Zhang, Zhaoxiang
2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 580 - 583
[4] VIEW-INVARIANT ACTION RECOGNITION USING CROSS RATIOS ACROSS FRAMES
Zhang, Yeyin
Huang, Kaiqi
Huang, Yongzhen
Tan, Tieniu
2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3549 - 3552
[5] View-Invariant Action Recognition from Point Triplets
Shen, Yuping
Foroosh, Hassan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (10) : 1898 - 1905
[6] Towards Fast, View-Invariant Human Action Recognition
Cherla, Srikanth
Kulkarni, Kaustubh
Kale, Amit
Ramasubramanian, V.
2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 1650 - 1657
[7] A survey about view-invariant human action recognition
Nghia Pham Trong
Anh Truong Minh
Nguyen, Hung
Kazunori, Kotani
Bac Le Hoai
2017 56TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2017, : 699 - 704
[8] On Temporal Order Invariance for View-Invariant Action Recognition
Anwaar-ul-Haq
Gondal, Iqbal
Murshed, Manzur
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (02) : 203 - 211
[9] Dual-attention Network for View-invariant Action Recognition
Gedamu Alemu Kumie
Maregu Assefa Habtie
Tewodros Alemu Ayall
Changjun Zhou
Huawen Liu
Abegaz Mohammed Seid
Aiman Erbad
Complex & Intelligent Systems, 2024, 10 : 305 - 321
[10] Dual-attention Network for View-invariant Action Recognition
Kumie, Gedamu Alemu
Habtie, Maregu Assefa
Ayall, Tewodros Alemu
Zhou, Changjun
Liu, Huawen
Seid, Abegaz Mohammed
Erbad, Aiman
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 305 - 321

← 1 2 3 4 5 →