VIEW-INVARIANT ACTION RECOGNITION FROM RGB DATA VIA 3D POSE ESTIMATION

被引:0
|
作者
Baptista, Renato [1 ]
Ghorbel, Enjie [1 ]
Papadopoulos, Konstantinos [1 ]
Demisse, Girum G. [1 ]
Aouada, Djamila [1 ]
Ottersten, Bjorn [1 ]
机构
[1] Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust, 29 Ave JF Kennedy, L-1855 Luxembourg, Luxembourg
基金
欧盟地平线“2020”;
关键词
Pose Estimation; Skeleton; View-Invariance; LSTM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a novel view-invariant action recognition method using a single monocular RGB camera. View invariance remains a very challenging topic in 2D action recognition due to the lack of 3D information in RGB images. Most successful approaches make use of the concept of knowledge transfer by projecting 3D synthetic data to multiple viewpoints. Instead of relying on knowledge transfer, we propose to augment the RGB data by a third dimension by means of 3D skeleton estimation from 2D images using a CNN-based pose estimator. In order to ensure view invariance, a pre-processing for alignment is applied followed by data expansion as a way for denoising. Finally, a Long Short Term Memory (LSTM) architecture is used to model the temporal dependency between skeletons. The proposed network is trained to directly recognize actions from aligned 3D skeletons. The experiments performed on the challenging Northwestern-UCLA dataset show the superiority of our approach as compared to state-of-the-art ones.
引用
收藏
页码:2542 / 2546
页数:5
相关论文
共 50 条
  • [1] View-Invariant Pose Analysis for Human Movement Assessment from RGB Data
    Sardari, Faegheh
    Paiement, Adeline
    Mirmehdi, Majid
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT II, 2019, 11752 : 237 - 248
  • [2] View-Invariant Action Recognition Based on Temporal and Spatial Segmentation in 3D Templates
    Kalantari, Samira
    Aghagolzadeh, Ali
    Motameni, Homayoun
    2014 6TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2014, : 140 - 145
  • [3] View Invariant 3D Human Pose Estimation
    Wei, Guoqiang
    Lan, Cuiling
    Zeng, Wenjun
    Chen, Zhibo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4601 - 4610
  • [4] View-Invariant Action Recognition from Point Triplets
    Shen, Yuping
    Foroosh, Hassan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (10) : 1898 - 1905
  • [5] Cross-View Action Recognition Using View-Invariant Pose Feature Learned from Synthetic Data with Domain Adaptation
    Yang, Yu-Huan
    Liu, An-Sheng
    Liu, Yu-Hung
    Yeh, Tso-Hsin
    Li, Zi-Jun
    Fu, Li-Chen
    COMPUTER VISION - ACCV 2018, PT II, 2019, 11362 : 431 - 446
  • [6] Extending the Interaction Area for View-Invariant 3D Gesture Recognition
    Caon, Maurizio
    Tscherrig, Julien
    Yue, Yong
    Abou Khaled, Omar
    Mugellini, Elena
    2012 3RD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS, 2012, : 293 - 298
  • [7] View-Invariant 3D Action Recognition using Spatiotemporal Self-Similarities from Depth Camera
    Lee, A-Reum
    Suk, Heung-Il
    Lee, Seong-Whan
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 501 - 505
  • [8] View-invariant 3D hand trajectory-based recognition
    Zhang, Yi
    Zhang, Shuo
    Luo, Yuan
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2014, 43 (01): : 60 - 65
  • [9] View-Invariant Human Action Recognition Via View Transformation Network (VTN)
    Gao, Lingling
    Ji, Yanli
    Gedamu, Kumie
    Zhu, Xiaofeng
    Xu, Xing
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4493 - 4503
  • [10] Online view-invariant human action recognition using rgb-d spatio-temporal matrix
    Hsu, Yen-Pin
    Liu, Chengyin
    Chen, Tzu-Yang
    Fu, Li-Chen
    PATTERN RECOGNITION, 2016, 60 : 215 - 226