Motion segment decomposition of RGB-D sequences for human behavior understanding

被引:41
作者
Devanne, Maxime [1 ,2 ]
Berretti, Stefano [2 ]
Pala, Pietro [2 ]
Wannous, Hazem [1 ]
Daoudi, Mohamed [1 ]
Del Bimbo, Alberto [2 ]
机构
[1] Univ Lille, Telecom Lille, CNRS, UMR 9189,CRIStAL, F-59000 Lille, France
[2] Univ Florence, MICC, Florence, Italy
关键词
3D human behavior understanding; Temporal modeling; Shape space analysis; Online activity detection; ACTION RECOGNITION; DICTIONARY;
D O I
10.1016/j.patcog.2016.07.041
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a framework for analyzing and understanding human behavior from depth videos. The proposed solution first employs shape analysis of the human pose across time to decompose the full motion into short temporal segments representing elementary motions. Then, each segment is characterized by human motion and depth appearance around hand joints to describe the change in pose of the body and the interaction with objects. Finally, the sequence of temporal segments is modeled through a Dynamic Naive Bayes classifier, which captures the dynamics of elementary motions characterizing human behavior. Experiments on four challenging datasets evaluate the potential of the proposed approach in different contexts, including gesture or activity recognition and online activity detection. Competitive results in comparison with state-of-the-art methods are reported. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:222 / 233
页数:12
相关论文
共 48 条
[41]   On the improvement of human action recognition from depth map sequences using Space-Time Occupancy Patterns [J].
Vieira, Antonio W. ;
Nascimento, Erickson R. ;
Oliveira, Gabriel L. ;
Liu, Zicheng ;
Campos, Mario F. M. .
PATTERN RECOGNITION LETTERS, 2014, 36 :221-227
[42]  
Wang J., 2012, IEEE C COMPUTER VISI, P1
[43]  
Wang J, 2012, ELECTRON J QUAL THEO, P1
[44]   Modeling 4D Human-Object Interactions for Event and Object Recognition [J].
Wei, Ping ;
Zhao, Yibiao ;
Zheng, Nanning ;
Zhu, Song-Chun .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3272-3279
[45]   Spatio-Temporal Depth Cuboid Similarity Feature for Activity Recognition Using Depth Camera [J].
Xia, Lu ;
Aggarwal, J. K. .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2834-2841
[46]   Super Normal Vector for Activity Recognition Using Depth Sequences [J].
Yang, Xiaodong ;
Tian, YingLi .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :804-811
[47]   The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection [J].
Zanfir, Mihai ;
Leordeanu, Marius ;
Sminchisescu, Cristian .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :2752-2759
[48]   Hierarchical Aligned Cluster Analysis for Temporal Clustering of Human Motion [J].
Zhou, Feng ;
De la Torre, Fernando ;
Hodgins, Jessica K. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (03) :582-596