VIEW-INVARIANT ACTION RECOGNITION FROM RGB DATA VIA 3D POSE ESTIMATION

被引:0
|
作者
Baptista, Renato [1 ]
Ghorbel, Enjie [1 ]
Papadopoulos, Konstantinos [1 ]
Demisse, Girum G. [1 ]
Aouada, Djamila [1 ]
Ottersten, Bjorn [1 ]
机构
[1] Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust, 29 Ave JF Kennedy, L-1855 Luxembourg, Luxembourg
基金
欧盟地平线“2020”;
关键词
Pose Estimation; Skeleton; View-Invariance; LSTM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a novel view-invariant action recognition method using a single monocular RGB camera. View invariance remains a very challenging topic in 2D action recognition due to the lack of 3D information in RGB images. Most successful approaches make use of the concept of knowledge transfer by projecting 3D synthetic data to multiple viewpoints. Instead of relying on knowledge transfer, we propose to augment the RGB data by a third dimension by means of 3D skeleton estimation from 2D images using a CNN-based pose estimator. In order to ensure view invariance, a pre-processing for alignment is applied followed by data expansion as a way for denoising. Finally, a Long Short Term Memory (LSTM) architecture is used to model the temporal dependency between skeletons. The proposed network is trained to directly recognize actions from aligned 3D skeletons. The experiments performed on the challenging Northwestern-UCLA dataset show the superiority of our approach as compared to state-of-the-art ones.
引用
收藏
页码:2542 / 2546
页数:5
相关论文
共 50 条
  • [21] Extended 3D Line Segments from RGB-D Data for Pose Estimation
    Buch, Anders Glent
    Jessen, Jeppe Barsoe
    Kraft, Dirk
    Savarimuthu, Thiusius Rajeeth
    Kruger, Norbert
    IMAGE ANALYSIS, SCIA 2013: 18TH SCANDINAVIAN CONFERENCE, 2013, 7944 : 54 - 65
  • [22] A View-invariant Skeleton Map with 3DCNN for Action Recognition
    Zhao, Yang
    Wen, Long
    Li, Shuguang
    Cheng, Hong
    Zhang, Chen
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2128 - 2132
  • [23] Towards Fast, View-Invariant Human Action Recognition
    Cherla, Srikanth
    Kulkarni, Kaustubh
    Kale, Amit
    Ramasubramanian, V.
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 1650 - 1657
  • [24] Latent Multitask Learning for View-Invariant Action Recognition
    Mahasseni, Behrooz
    Todorovic, Sinisa
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3128 - 3135
  • [25] Ergonomic posture recognition using 3D view-invariant features from single ordinary camera
    Zhang, Hong
    Yan, Xuzhong
    Li, Heng
    AUTOMATION IN CONSTRUCTION, 2018, 94 : 1 - 10
  • [26] On Temporal Order Invariance for View-Invariant Action Recognition
    Anwaar-ul-Haq
    Gondal, Iqbal
    Murshed, Manzur
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (02) : 203 - 211
  • [27] A survey about view-invariant human action recognition
    Nghia Pham Trong
    Anh Truong Minh
    Nguyen, Hung
    Kazunori, Kotani
    Bac Le Hoai
    2017 56TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2017, : 699 - 704
  • [28] Error Accuracy Estimation of 3D Reconstruction and 3D Camera Pose from RGB-D Data
    Ortiz-Fernandez, Luis E.
    Silva, Bruno M. F.
    Goncalves, Luiz M. G.
    2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 67 - 72
  • [29] View-Invariant Gait Recognition via Deterministic Learning
    Zeng, Wei
    Wang, Cong
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3465 - 3472
  • [30] View-invariant gait recognition via deterministic learning
    Zeng, Wei
    Wang, Cong
    NEUROCOMPUTING, 2016, 175 : 324 - 335