Classification of Multi-class Daily Human Motion using Discriminative Body Parts and Sentence Descriptions

被引：8

作者：

Goutsu, Yusuke ^{[1
]}

Takano, Wataru ^{[2
]}

Nakamura, Yoshihiko ^{[3
]}

机构：

[1] AIST, Comp Vis Res Grp, Cent 1,1-1-1 Umezono, Tsukuba, Ibaraki, Japan

[2] Osaka Univ, Ctr Math Modeling & Data Sci, 1-3 Machikaneyamacho, Toyonaka, Osaka, Japan

[3] Univ Tokyo, Mechanoinformat, Bunkyo Ku, 7-3-1 Hongo, Tokyo, Japan

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2018年 / 126卷 / 05期

基金：

日本学术振兴会;

关键词：

Hidden Markov model; Fisher vector; Multiple kernel learning; Motion classification; Multi-class; Sentence description; PARTIAL LEAST-SQUARES; ACTION RECOGNITION; POSE; IMITATION; LATENCY;

D O I：

10.1007/s11263-017-1053-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a motion model that focuses on the discriminative parts of the human body related to target motions to classify human motions into specific categories, and apply this model to multi-class daily motion classifications. We extend this model to a motion recognition system which generates multiple sentences associated with human motions. The motion model is evaluated with the following four datasets acquired by a Kinect sensor or multiple infrared cameras in a motion capture studio: UCF-kinect; UT-kinect; HDM05-mocap; and YNL-mocap. We also evaluate the sentences generated from the dataset of motion and language pairs. The experimental results indicate that the motion model improves classification accuracy and our approach is better than other state-of-the-art methods for specific datasets, including human-object interactions with variations in the duration of motions, such as daily human motions. We achieve a classification rate of 81.1% for multi-class daily motion classifications in a non cross-subject setting. Additionally, the sentences generated by the motion recognition system are semantically and syntactically appropriate for the description of the target motion, which may lead to human-robot interaction using natural language.

引用

页码：495 / 514

页数：20

共 45 条

[1] Fusion of Skeletal and Silhouette-based Features for Human Action Recognition with RGB-D Devices [J].

Andre Chaaraoui, Alexandros ;

Ramon Padilla-Lopez, Jose ;

Florez-Revuelta, Francisco .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :91-97

[2]

[Anonymous], 2007, Documentation mocap database hdm05

[3]

[Anonymous], 1991, Origins of the Modern Mind: Three stages in the evolution of culture and cognition

[4]

[Anonymous], 2012, P ACM INT C MULT NAR, DOI DOI 10.1145/2393347.2396382

[5] Partial least squares for discrimination [J].

Barker, M ;

Rayens, W .

JOURNAL OF CHEMOMETRICS, 2003, 17 (03) :166-173

[6] Discriminative and adaptive imitation in uni-manual and bi-manual tasks [J].

Billard, Aude G. ;

Calinon, Sylvain ;

Guenter, Florent .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (05) :370-384

[7] Bio-inspired Dynamic 3D Discriminative Skeletal Features for Human Action Recognition [J].

Chaudhry, Rizwan ;

Ofli, Ferda ;

Kurillo, Gregorij ;

Bajcsy, Ruzena ;

Vidal, Rene .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, :471-478

[8]

Devanne M, 2013, LECT NOTES COMPUT SC, V8158, P456, DOI 10.1007/978-3-642-41190-8_49

[9]

Dong G., 1999, Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P43, DOI [DOI 10.1145/312129.312191, 10.1145/312129., DOI 10.1145/312129]

[10] Exploring the Trade-off Between Accuracy and Observational Latency in Action Recognition [J].

Ellis, Chris ;

Masood, Syed Zain ;

Tappen, Marshall F. ;

LaViola, Joseph J., Jr. ;

Sukthankar, Rahul .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (03) :420-436

← 1 2 3 4 5 →