Action Recognition Based on 3D Skeleton and RGB Frame Fusion

被引：0

作者：

Liu, Guiyu ^{[1
]}

Qian, Jiuchao ^{[1
]}

Wen, Fei ^{[1
]}

Zhu, Xiaoguang ^{[1
]}

Ying, Rendong ^{[1
]}

Liu, Peilin ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China

来源：

2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2019年

关键词：

SEGMENTATION;

D O I：

10.1109/iros40897.2019.8967570

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Action recognition has wide applications in assisted living, health monitoring, surveillance, and human-computer interaction. In traditional action recognition methods, RGB video-based ones are effective but computationally inefficient, while skeleton-based ones are computationally efficient but do not make use of low-level detail information. This work considers action recognition based on a multimodal fusion between the 3D skeleton and the RGB image. We design a neural network that uses a 3D skeleton sequence and a single middle frame from an RGB video as input. Specifically, our method picks up one frame in a video and extracts spatial features from it using two attention modules, a self-attention module and a skeleton-attention module. Further, temporal features are extracted from the skeleton sequence via a BI-LSTM sub-network. Finally, the spatial features and the temporal features are combined via a feature fusion network for action classification. A distinct feature of our method is that it uses only a single RGB frame rather than an RGB video. Accordingly, it has a light-weighted architecture and is more efficient than RGB video-based methods. Comparative evaluation on two public datasets, NTU-RGBD and SYSU, demonstrates that, our method can achieve competitive performance compared with state-of-the-art methods.

引用

页码：258 / 264

页数：7

共 50 条

[1] Infrared and 3D Skeleton Feature Fusion for RGB-D Action Recognition
De Boissiere, Alban Main
Noumeir, Rita
IEEE ACCESS, 2020, 8 (08): : 168297 - 168308
[2] Fusion of Skeleton and RGB Features for RGB-D Human Action Recognition
Weiyao, Xu
Muqing, Wu
Min, Zhao
Ting, Xia
IEEE SENSORS JOURNAL, 2021, 21 (17) : 19157 - 19164
[3] Action Recognition Based on Adaptive Fusion of RGB and Skeleton Features
Guo Fuzheng
Kong Jun
Jiang Min
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (20)
[4] Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition
Zhu, Xiaoguang
Zhu, Ye
Wang, Haoyu
Wen, Honglin
Yan, Yan
Liu, Peilin
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (03)
[5] A three-stream fusion network for 3D skeleton-based action recognition
Fang, Ming
Liu, Qi
Ren, Jianping
Li, Jie
Du, Xinning
Liu, Shuhua
MULTIMEDIA SYSTEMS, 2025, 31 (02)
[6] Fuzzy Integral-Based CNN Classifier Fusion for 3D Skeleton Action Recognition
Banerjee, Avinandan
Singh, Pawan Kumar
Sarkar, Ram
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2206 - 2216
[7] 3D skeleton-based action recognition by representing motion capture sequences as 2D-RGB images
Laraba, Sohaib
Brahimi, Mohammed
Tilmanne, Joelle
Dutoit, Thierry
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2017, 28 (3-4)
[8] Human Action Recognition Based on Quaternion 3D Skeleton Representation
Xu Haiyang
Kong Jun
Jiang Min
LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (02)
[9] Action Recognition based on a mixture of RGB and Depth based skeleton
Das, Srijan
Koperski, Michal
Bremond, Francois
Francesca, Gianpiero
2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2017,
[10] Arm-hand Action Recognition Based on 3D Skeleton Joints
Rui, Ling
Ma, Shi-wei
Wen, Jia-rui
Liu, Li-na
INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA 2016), 2016, : 326 - 332

← 1 2 3 4 5 →