Exploring 3D Human Action Recognition: from Offline to Online

被引:8
作者
Li, Rui [1 ]
Liu, Zhenyu [1 ]
Tan, Jianrong [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
action recognition; skeletal sequence; depth map; online segmentation; Kinect; VARIABLE SELECTION; SEQUENCES; LATENCY;
D O I
10.3390/s18020633
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
With the introduction of cost-effective depth sensors, a tremendous amount of research has been devoted to studying human action recognition using 3D motion data. However, most existing methods work in an offline fashion, i.e., they operate on a segmented sequence. There are a few methods specifically designed for online action recognition, which continually predicts action labels as a stream sequence proceeds. In view of this fact, we propose a question: can we draw inspirations and borrow techniques or descriptors from existing offline methods, and then apply these to online action recognition? Note that extending offline techniques or descriptors to online applications is not straightforward, since at least two problems-including real-time performance and sequence segmentation-are usually not considered in offline action recognition. In this paper, we give a positive answer to the question. To develop applicable online action recognition methods, we carefully explore feature extraction, sequence segmentation, computational costs, and classifier selection. The effectiveness of the developed methods is validated on the MSR 3D Online Action dataset and the MSR Daily Activity 3D dataset.
引用
收藏
页数:24
相关论文
共 43 条
[11]  
Devanne M., 2014, IEEE T SYST MAN CYB, V45, P1023
[12]   Motion segment decomposition of RGB-D sequences for human behavior understanding [J].
Devanne, Maxime ;
Berretti, Stefano ;
Pala, Pietro ;
Wannous, Hazem ;
Daoudi, Mohamed ;
Del Bimbo, Alberto .
PATTERN RECOGNITION, 2017, 61 :222-233
[13]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[14]   Exploring the Trade-off Between Accuracy and Observational Latency in Action Recognition [J].
Ellis, Chris ;
Masood, Syed Zain ;
Tappen, Marshall F. ;
LaViola, Joseph J., Jr. ;
Sukthankar, Rahul .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (03) :420-436
[15]   Continuous Human Action Recognition Using Depth-MHI-HOG and a Spotter Model [J].
Eum, Hyukmin ;
Yoon, Changyong ;
Lee, Heejin ;
Park, Mignon .
SENSORS, 2015, 15 (03) :5197-5227
[16]   Continuous Gesture Recognition from Articulated Poses [J].
Evangelidis, Georgios D. ;
Singh, Gurkirt ;
Horaud, Radu .
COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 :595-607
[17]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[18]  
Fanello SR, 2013, J MACH LEARN RES, V14, P2617
[19]  
Gong D, 2012, LECT NOTES COMPUTER, V7574
[20]   Structured Time Series Analysis for Human Action Segmentation and Recognition [J].
Gong, Dian ;
Medioni, Gerard ;
Zhao, Xuemei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) :1414-1427