Learning representations from quadrilateral based geometric features for skeleton-based action recognition using LSTM networks

被引:5
作者
Naveenkumar, M. [1 ]
Domnic, S. [1 ]
机构
[1] Natl Inst Technol Tiruchirappalli, Tiruchirappalli, Tamil Nadu, India
来源
INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS | 2020年 / 14卷 / 01期
关键词
Action recognition; skeleton maps; quadrilateral; geometric features; LSTM;
D O I
10.3233/IDT-190078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the recent developments in sensor technology and pose estimation algorithms, skeleton based action recognition has become popular. Classical machine learning methods based on hand-crafted features fail on large scale datasets due to their limited representation power. Recently, recurrent neural networks (RNN) based methods focus on the temporal evolution of body joints and neglect the geometric relations between them. In this paper, we propose eleven quadrilaterals to capture the geometric relations among joints for action recognition. An end-to-end 3-layer Bi-LSTM network is designed as Base-Net to learn robust representations. We propose two subnets based on the Base-Net to extract discriminative spatio temporal features. Specifically, the first subnet (SQuadNet) uses four spatial features and the second one (TQuadNet) uses two temporal features. The empirical results on two benchmark datasets, NTU RGB+D and UTD MHAD, show how our method achieves state of the art performance when compared to recent methods in the literature.
引用
收藏
页码:47 / 54
页数:8
相关论文
共 33 条
[1]  
[Anonymous], IEEE C COMP VIS PATT
[2]  
[Anonymous], 2013, 23 INT JOINT C ART I, DOI DOI 10.5555/2540128.2540483
[3]  
[Anonymous], 2012, MEX INT C ART INT
[4]   Bio-inspired Dynamic 3D Discriminative Skeletal Features for Human Action Recognition [J].
Chaudhry, Rizwan ;
Ofli, Ferda ;
Kurillo, Gregorij ;
Bajcsy, Ruzena ;
Vidal, Rene .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, :471-478
[5]  
Chen C, 2015, IEEE IMAGE PROC, P168, DOI 10.1109/ICIP.2015.7350781
[6]   REConvertor: Transforming Textual Use Cases to High-Level Message Sequence Chart [J].
Ding, Zuohua ;
Shuai, Tiantian ;
Jiang, Mingyue .
2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C), 2017, :610-611
[7]  
Du Y, 2015, PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, P579, DOI 10.1109/ACPR.2015.7486569
[8]  
Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714
[9]   Skeletal Quads: Human Action Recognition Using Joint Quadruples [J].
Evangelidis, Georgios ;
Singh, Gurkirt ;
Horaud, Radu .
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, :4513-4518
[10]   Attention-Based Multiview Re-Observation Fusion Network for Skeletal Action Recognition [J].
Fan, Zhaoxuan ;
Zhao, Xu ;
Lin, Tianwei ;
Su, Haisheng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (02) :363-374