Semi-Supervised Multiple Feature Analysis for Action Recognition

被引：58

作者：

Wang, Sen ^{[1
]}

Ma, Zhigang ^{[2
]}

Yang, Yi ^{[1
]}

Li, Xue ^{[1
,3
]}

Pang, Chaoyi ^{[4
]}

Hauptmann, Alexander G. ^{[5
]}

机构：

[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia

[2] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA

[3] Chongqing Univ, Key Lab Dependable Serv Comp, Cyber Phys Soc, Chongqing 630044, Peoples R China

[4] CSIRO, Australian E Hlth Res Ctr, Brisbane, Qld, Australia

[5] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2014年 / 16卷 / 02期

基金：

澳大利亚研究理事会;

关键词：

Human action recognition; multiple feature learning; semi-supervised learning; shared structural analysis; IMAGE ANNOTATION; FRAMEWORK; WEB;

D O I：

10.1109/TMM.2013.2293060

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a semi-supervised method for categorizing human actions using multiple visual features. The proposed algorithm simultaneously learns multiple features from a small number of labeled videos, and automatically utilizes data distributions between labeled and unlabeled data to boost the recognition performance. Shared structural analysis is applied in our approach to discover a common subspace shared by each type of feature. In the subspace, the proposed algorithm is able to characterize more discriminative information of each feature type. Additionally, data distribution information of each type of feature has been preserved. The aforementioned attributes make our algorithm robust for action recognition, especially when only limited labeled training samples are provided. Extensive experiments have been conducted on both the choreographed and the realistic video datasets, including KTH, Youtube action and UCF50. Experimental results show that our method outperforms several state-of-the-art algorithms. Most notably, much better performances have been achieved when there are only a few labeled training samples.

引用

页码：289 / 298

页数：10

共 38 条

[1] Ando RK, 2005, J MACH LEARN RES, V6, P1817
[2] [Anonymous], P NIPS
[3] [Anonymous], 2010, SDM
[4] [Anonymous], 2009, P BRIT MACH VIS C
[5] Chen M.-y., 2009, MOSIFT RECOGNIZING H
[6] Farquhar JDR, 2005, P NIPS
[7] Feng Y., 2012, P 11 ASIAN C COMPUTE, P343
[8] Golub G. H., 1996, MATRIX COMPUTATIONS, V4
[9] Gong BQ, 2012, PROC CVPR IEEE, P2066, DOI 10.1109/CVPR.2012.6247911
[10] Gopalan R, 2011, IEEE I CONF COMP VIS, P999, DOI 10.1109/ICCV.2011.6126344

← 1 2 3 4 →