Multi-Task Clustering of Human Actions by Sharing Information

被引：31

作者：

Yan, Xiaoqiang ^{[1
]}

Hu, Shizhe ^{[1
]}

Ye, Yangdong ^{[1
]}

机构：

[1] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450000, Henan, Peoples R China

来源：

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年

基金：

中国国家自然科学基金;

关键词：

HUMAN ACTION CATEGORIES; RECOGNITION;

D O I：

10.1109/CVPR.2017.431

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sharing information between multiple tasks can enhance the accuracy of human action recognition systems. However, using shared information to improve multi-task human action clustering has never been considered before, and cannot be achieved using existing clustering methods. In this work, we present a novel and effective Multi-Task Information Bottleneck (MTIB) clustering method, which is capable of exploring the shared information between multiple action clustering tasks to improve the performance of individual task. Our motivation is that, different action collections always share many similar action patterns, and exploiting the shared information can lead to improved performance. Specifically, MTIB generally formulates this problem as an information loss minimization function. In this function, the shared information can be quantified by the distributional correlation of clusters in different tasks, which is based on a high-level common vocabulary constructed through a novel agglomerative information maximization method. Extensive experiments on two kinds of challenging data sets, including realistic action data sets (HMDB & UCF50, Olympic & YouTube), and cross-view data sets (IXMAS, WVU), show that the proposed approach compares favorably to the state-of-the-art methods.

引用

页码：4049 / 4057

页数：9

共 35 条

[11] Latent Multitask Learning for View-Invariant Action Recognition [J].

Mahasseni, Behrooz ;

Todorovic, Sinisa .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3128-3135

[12] Unsupervised learning of human action categories using spatial-temporal words [J].

Niebles, Juan Carlos ;

Wang, Hongcheng ;

Fei-Fei, Li .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 79 (03) :299-318

[13]

Niebles JC, 2010, LECT NOTES COMPUT SC, V6312, P392, DOI 10.1007/978-3-642-15552-9_29

[14]

Raman S, 2011, POSTCOLON LIT STUD, P1

[15] Recognizing 50 human action categories of web videos [J].

Reddy, Kishore K. ;

Shah, Mubarak .

MACHINE VISION AND APPLICATIONS, 2013, 24 (05) :971-981

[16]

Sivic J., 2005, 2005005 AI, P1

[17]

Slonim N, 2000, ADV NEUR IN, V12, P617

[18] Multivariate information bottleneck [J].

Slonim, Noam ;

Friedman, Nir ;

Tishby, Naftali .

NEURAL COMPUTATION, 2006, 18 (08) :1739-1789

[19]

Thomas, 1991, ELEMENTS INFORM THEO

[20]

Tishby Naftali, 1999, P 37 ANN ALL C COMM, P368

← 1 2 3 4 →