3D Action Recognition Exploiting Hierarchical Deep Feature Fusion Model

被引：0

作者：

Thien Huynh-The ^{[1
]}

Hua, Cam-Hao ^{[2
]}

Nguyen Anh Tu ^{[3
]}

Kim, Jae-Woo ^{[1
]}

Kim, Seung-Hwan ^{[1
]}

Kim, Dong-Seong ^{[1
]}

机构：

[1] Kumoh Natl Inst Technol, Gumi Si, South Korea

[2] Kyung Hee Univ, Gwangju Si, South Korea

[3] Nazarbayev Univ, Nur Sultan, Kazakhstan

来源：

PROCEEDINGS OF THE 2020 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM) | 2020年

基金：

新加坡国家研究基金会;

关键词：

Human action recognition; geometric feature; deep feature fusion; convolutional neural network;

D O I：

10.1109/imcom48794.2020.9001766

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Numerous existing handcrafted feature-based and conventional machine learning-based approaches cannot seize the intensive correlations of skeleton structure in the spatiotemporal dimension. On another hand, some modern methods exploiting Long Short Term Memory (LSTM) to learn temporal action attribute, which lack an efficient scheme of revealing high-level informative features. To handle the aforementioned issues, this research introduces a novel hierarchical deep feature fusion model for 3D skeleton-based human action recognition, in which the deep information for modeling human appearance and action dynamic is gained by Convolutional Neural Networks (CNNs). The deep features of geometrical joint distance and orientation are extracted via a multi-stream CNN architecture to uncovering the hidden correlations in both the spatial and temporal dimensions. The experimental results on the NTU RGB+D dataset demonstrates the superiority of the proposed fusion model against several recently deep learning (DL)-based action recognition approaches.

引用

页数：3

共 16 条

[1]

Banos O, 2015, IEEE ENG MED BIO, P5062, DOI 10.1109/EMBC.2015.7319529

[2] Bimodal learning via trilogy of skip-connection deep networks for diabetic retinopathy risk progression identification [J].