Action Recognition Based on 3D Skeleton and RGB Frame Fusion

被引:0
|
作者
Liu, Guiyu [1 ]
Qian, Jiuchao [1 ]
Wen, Fei [1 ]
Zhu, Xiaoguang [1 ]
Ying, Rendong [1 ]
Liu, Peilin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
来源
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2019年
关键词
SEGMENTATION;
D O I
10.1109/iros40897.2019.8967570
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition has wide applications in assisted living, health monitoring, surveillance, and human-computer interaction. In traditional action recognition methods, RGB video-based ones are effective but computationally inefficient, while skeleton-based ones are computationally efficient but do not make use of low-level detail information. This work considers action recognition based on a multimodal fusion between the 3D skeleton and the RGB image. We design a neural network that uses a 3D skeleton sequence and a single middle frame from an RGB video as input. Specifically, our method picks up one frame in a video and extracts spatial features from it using two attention modules, a self-attention module and a skeleton-attention module. Further, temporal features are extracted from the skeleton sequence via a BI-LSTM sub-network. Finally, the spatial features and the temporal features are combined via a feature fusion network for action classification. A distinct feature of our method is that it uses only a single RGB frame rather than an RGB video. Accordingly, it has a light-weighted architecture and is more efficient than RGB video-based methods. Comparative evaluation on two public datasets, NTU-RGBD and SYSU, demonstrates that, our method can achieve competitive performance compared with state-of-the-art methods.
引用
收藏
页码:258 / 264
页数:7
相关论文
共 50 条
  • [21] Tripool: Graph triplet pooling for 3D skeleton-based action recognition
    Peng, Wei
    Hong, Xiaopeng
    Zhao, Guoying
    PATTERN RECOGNITION, 2021, 115
  • [22] Mix Dimension in Poincare Geometry for 3D Skeleton-based Action Recognition
    Peng, Wei
    Shi, Jingang
    Xia, Zhaoqiang
    Zhao, Guoying
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1432 - 1440
  • [23] INVESTIGATION OF DIFFERENT SKELETON FEATURES FOR CNN-BASED 3D ACTION RECOGNITION
    Ding, Zewei
    Wang, Pichao
    Ogunbona, Philip O.
    Li, Wanqing
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [24] Accurate and Real-time Human Action Recognition Based on 3D Skeleton
    Chen, Hongzhao
    Wang, Guijin
    He, Li
    2013 INTERNATIONAL CONFERENCE ON OPTICAL INSTRUMENTS AND TECHNOLOGY: OPTOELECTRONIC IMAGING AND PROCESSING TECHNOLOGY, 2013, 9045
  • [25] Recurrent Neural Network based Action Recognition from 3D Skeleton Data
    Shukla, Parul
    Biswas, Kanad K.
    Kalra, Prem K.
    2017 13TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS (SITIS), 2017, : 339 - 345
  • [26] AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
    Guan, Shannan
    Lu, Haiyan
    Zhu, Linchao
    Fang, Gengfa
    NEUROCOMPUTING, 2022, 514 : 256 - 267
  • [27] HIF3D: Handwriting -Inspired Features for 3D skeleton-based action recognition
    Boulahia, Said Yacine
    Anquetil, Eric
    Kulpa, Richard
    Multon, Franck
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 985 - 990
  • [28] Behavior Recognition Based on 3D Skeleton Features
    Liu, W. T.
    Lu, T. W.
    Miao, S. J.
    Peng, L.
    Min, F.
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENVIRONMENTAL ENGINEERING (CSEE 2015), 2015, : 760 - 765
  • [29] Understanding the Gap between 2D and 3D Skeleton-Based Action Recognition
    Elias, Petr
    Sedmidubsky, Jan
    Zezula, Pavel
    2019 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2019), 2019, : 192 - 195
  • [30] Action Recognition Based on Features Fusion and 3D Convolutional Neural Networks
    Liu, Lulu
    Hu, Fangyu
    Zhou, Jiahui
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2016, : 178 - 181