MMA: a multi-view and multi-modality benchmark dataset for human action recognition

被引:5
作者
Gao, Zan [1 ,2 ]
Han, Tao-tao [1 ,2 ]
Zhang, Hua [1 ,2 ]
Xue, Yan-bing [1 ,2 ]
Xu, Guang-ping [1 ,2 ]
机构
[1] Tianjin Univ Technol, Key Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China
[2] Tianjin Univ Technol, Tianjin Key Lab Intelligence Comp & Novel Softwar, Tianjin 300384, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Benchmark dataset; Multi-view; Multi-modalidy; Cross-view; Multi-task; Cross-domain; FEATURE-SELECTION;
D O I
10.1007/s11042-018-5833-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition is an active research topic in both computer vision and machine learning communities, which has broad applications including surveillance, biometrics and human computer interaction. In the past decades, although some famous action datasets have been released, there still exist limitations, including the limited action categories and samples, camera views and variety of scenarios. Moreover, most of them are designed for a subset of the learning problems, such as single-view learning problem, cross-view learning problem and multi-task learning problem. In this paper, we introduce a multi-view, multi-modality benchmark dataset for human action recognition (abbreviated to MMA). MMA consists of 7080 action samples from 25 action categories, including 15 single-subject actions and 10 double-subject interactive actions in three views of two different scenarios. Further, we systematically benchmark the state-of-the-art approaches on MMA with respective to all three learning problems by different temporal-spatial feature representations. Experimental results demonstrate that MMA is challenging on all three learning problems due to significant intra-class variations, occlusion issues, views and scene variations, and multiple similar action categories. Meanwhile, we provide the baseline for the evaluation of existing state-of-the-art algorithms.
引用
收藏
页码:29383 / 29404
页数:22
相关论文
共 44 条
[1]  
[Anonymous], 2012, COMPUTER SCI
[2]  
[Anonymous], IEEE T PATTERN ANAL
[3]  
[Anonymous], IJCV
[4]  
[Anonymous], IEEE C COMP VIS PATT
[5]  
[Anonymous], NTU RGB D LARGE SCAL
[6]  
[Anonymous], 2014, CVPR
[7]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[8]  
Chen C, 2015, IEEE IMAGE PROC, P168, DOI 10.1109/ICIP.2015.7350781
[9]  
Chen G, 2015, HUMAN ACTION RECOGNI, P418
[10]  
Cheng ZW, 2012, LECT NOTES COMPUT SC, V7584, P52, DOI 10.1007/978-3-642-33868-7_6