Action Recognition Based on 3D Skeleton and RGB Frame Fusion

被引:0
|
作者
Liu, Guiyu [1 ]
Qian, Jiuchao [1 ]
Wen, Fei [1 ]
Zhu, Xiaoguang [1 ]
Ying, Rendong [1 ]
Liu, Peilin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
来源
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2019年
关键词
SEGMENTATION;
D O I
10.1109/iros40897.2019.8967570
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition has wide applications in assisted living, health monitoring, surveillance, and human-computer interaction. In traditional action recognition methods, RGB video-based ones are effective but computationally inefficient, while skeleton-based ones are computationally efficient but do not make use of low-level detail information. This work considers action recognition based on a multimodal fusion between the 3D skeleton and the RGB image. We design a neural network that uses a 3D skeleton sequence and a single middle frame from an RGB video as input. Specifically, our method picks up one frame in a video and extracts spatial features from it using two attention modules, a self-attention module and a skeleton-attention module. Further, temporal features are extracted from the skeleton sequence via a BI-LSTM sub-network. Finally, the spatial features and the temporal features are combined via a feature fusion network for action classification. A distinct feature of our method is that it uses only a single RGB frame rather than an RGB video. Accordingly, it has a light-weighted architecture and is more efficient than RGB video-based methods. Comparative evaluation on two public datasets, NTU-RGBD and SYSU, demonstrates that, our method can achieve competitive performance compared with state-of-the-art methods.
引用
收藏
页码:258 / 264
页数:7
相关论文
共 50 条
  • [1] Infrared and 3D Skeleton Feature Fusion for RGB-D Action Recognition
    De Boissiere, Alban Main
    Noumeir, Rita
    IEEE ACCESS, 2020, 8 (08): : 168297 - 168308
  • [2] Fusion of Skeleton and RGB Features for RGB-D Human Action Recognition
    Weiyao, Xu
    Muqing, Wu
    Min, Zhao
    Ting, Xia
    IEEE SENSORS JOURNAL, 2021, 21 (17) : 19157 - 19164
  • [3] Action Recognition Based on Adaptive Fusion of RGB and Skeleton Features
    Guo Fuzheng
    Kong Jun
    Jiang Min
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (20)
  • [4] Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition
    Zhu, Xiaoguang
    Zhu, Ye
    Wang, Haoyu
    Wen, Honglin
    Yan, Yan
    Liu, Peilin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (03)
  • [5] A three-stream fusion network for 3D skeleton-based action recognition
    Fang, Ming
    Liu, Qi
    Ren, Jianping
    Li, Jie
    Du, Xinning
    Liu, Shuhua
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [6] Fuzzy Integral-Based CNN Classifier Fusion for 3D Skeleton Action Recognition
    Banerjee, Avinandan
    Singh, Pawan Kumar
    Sarkar, Ram
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2206 - 2216
  • [7] 3D skeleton-based action recognition by representing motion capture sequences as 2D-RGB images
    Laraba, Sohaib
    Brahimi, Mohammed
    Tilmanne, Joelle
    Dutoit, Thierry
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2017, 28 (3-4)
  • [8] Human Action Recognition Based on Quaternion 3D Skeleton Representation
    Xu Haiyang
    Kong Jun
    Jiang Min
    LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (02)
  • [9] Action Recognition based on a mixture of RGB and Depth based skeleton
    Das, Srijan
    Koperski, Michal
    Bremond, Francois
    Francesca, Gianpiero
    2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2017,
  • [10] Arm-hand Action Recognition Based on 3D Skeleton Joints
    Rui, Ling
    Ma, Shi-wei
    Wen, Jia-rui
    Liu, Li-na
    INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA 2016), 2016, : 326 - 332