Action Recognition Based on 3D Skeleton and RGB Frame Fusion

被引：0

作者：

Liu, Guiyu ^{[1
]}

Qian, Jiuchao ^{[1
]}

Wen, Fei ^{[1
]}

Zhu, Xiaoguang ^{[1
]}

Ying, Rendong ^{[1
]}

Liu, Peilin ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China

来源：

2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2019年

关键词：

SEGMENTATION;

D O I：

10.1109/iros40897.2019.8967570

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Action recognition has wide applications in assisted living, health monitoring, surveillance, and human-computer interaction. In traditional action recognition methods, RGB video-based ones are effective but computationally inefficient, while skeleton-based ones are computationally efficient but do not make use of low-level detail information. This work considers action recognition based on a multimodal fusion between the 3D skeleton and the RGB image. We design a neural network that uses a 3D skeleton sequence and a single middle frame from an RGB video as input. Specifically, our method picks up one frame in a video and extracts spatial features from it using two attention modules, a self-attention module and a skeleton-attention module. Further, temporal features are extracted from the skeleton sequence via a BI-LSTM sub-network. Finally, the spatial features and the temporal features are combined via a feature fusion network for action classification. A distinct feature of our method is that it uses only a single RGB frame rather than an RGB video. Accordingly, it has a light-weighted architecture and is more efficient than RGB video-based methods. Comparative evaluation on two public datasets, NTU-RGBD and SYSU, demonstrates that, our method can achieve competitive performance compared with state-of-the-art methods.

引用

页码：258 / 264

页数：7

共 50 条

[21] Tripool: Graph triplet pooling for 3D skeleton-based action recognition
Peng, Wei
Hong, Xiaopeng
Zhao, Guoying
PATTERN RECOGNITION, 2021, 115
[22] Mix Dimension in Poincare Geometry for 3D Skeleton-based Action Recognition
Peng, Wei
Shi, Jingang
Xia, Zhaoqiang
Zhao, Guoying
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1432 - 1440
[23] INVESTIGATION OF DIFFERENT SKELETON FEATURES FOR CNN-BASED 3D ACTION RECOGNITION
Ding, Zewei
Wang, Pichao
Ogunbona, Philip O.
Li, Wanqing
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
[24] Accurate and Real-time Human Action Recognition Based on 3D Skeleton
Chen, Hongzhao
Wang, Guijin
He, Li
2013 INTERNATIONAL CONFERENCE ON OPTICAL INSTRUMENTS AND TECHNOLOGY: OPTOELECTRONIC IMAGING AND PROCESSING TECHNOLOGY, 2013, 9045
[25] Recurrent Neural Network based Action Recognition from 3D Skeleton Data
Shukla, Parul
Biswas, Kanad K.
Kalra, Prem K.
2017 13TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS (SITIS), 2017, : 339 - 345
[26] AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
Guan, Shannan
Lu, Haiyan
Zhu, Linchao
Fang, Gengfa
NEUROCOMPUTING, 2022, 514 : 256 - 267
[27] HIF3D: Handwriting -Inspired Features for 3D skeleton-based action recognition
Boulahia, Said Yacine
Anquetil, Eric
Kulpa, Richard
Multon, Franck
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 985 - 990
[28] Behavior Recognition Based on 3D Skeleton Features
Liu, W. T.
Lu, T. W.
Miao, S. J.
Peng, L.
Min, F.
INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENVIRONMENTAL ENGINEERING (CSEE 2015), 2015, : 760 - 765
[29] Understanding the Gap between 2D and 3D Skeleton-Based Action Recognition
Elias, Petr
Sedmidubsky, Jan
Zezula, Pavel
2019 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2019), 2019, : 192 - 195
[30] Action Recognition Based on Features Fusion and 3D Convolutional Neural Networks
Liu, Lulu
Hu, Fangyu
Zhou, Jiahui
PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2016, : 178 - 181

← 1 2 3 4 5 →