Multi-view representation learning for multi-view action recognition

被引:26
作者
Hao, Tong [1 ]
Wu, Dan [1 ]
Wang, Qian [1 ]
Sun, Jin-Sheng [1 ,2 ]
机构
[1] Tianjin Normal Univ, Tianjin Key Lab Anim & Plant Resistance, Coll Life Sci, Tianjin 300387, Peoples R China
[2] Tianjin Aquat Anim Infect Dis Control & Prevent C, Tianjin 300221, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-view learning; Multi-task learning; Sparse coding; Action recognition; MODEL; DICTIONARY;
D O I
10.1016/j.jvcir.2017.01.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although multiple methods have been proposed for human action recognition, the existing multi-view approaches cannot well discover meaningful relationship among multiple action categories from different views. To handle this problem, this paper proposes an multi-view learning approach for multi-view action recognition. First, the proposed method leverages the popular visual representation method, bag of -visual-words (BoVW)/fisher vector (FV), to represent individual videos in each view. Second, the sparse coding algorithm is utilized to transfer the low-level features of various views into the discriminative and high-level semantics space. Third, we employ the multi-task learning (MTL) approach for joint action modeling and discovery of latent relationship among different action categories. The extensive experimental results on (MI)-I-2 and IXMAS datasets have demonstrated the effectiveness of our proposed approach. Moreover, the experiments further demonstrate that the discovered latent relationship can benefit multi-view model learning to augment the performance of action recognition. (C) 2017 Published by Elsevier Inc.
引用
收藏
页码:453 / 460
页数:8
相关论文
共 54 条
[51]  
Yilmaz A, 2005, PROC CVPR IEEE, P984
[52]  
Zha HY, 2002, ADV NEUR IN, V14, P1057
[53]   Thermoelectric effect in an Aharonov-Bohm ring with an embedded quantum dot [J].
Zheng, Jun ;
Chi, Feng ;
Lu, Xiao-Dong ;
Zhang, Kai-Cheng .
NANOSCALE RESEARCH LETTERS, 2012, 7 :1-7
[54]  
Zhou Jiayu, 2011, Adv Neural Inf Process Syst, V2011, P702