Action recognition and tracking via deep representation extraction and motion bases learning

被引:0
作者
Hao-Ting Li
Yung-Pin Liu
Yun-Kai Chang
Chen-Kuo Chiang
机构
[1] National Chung Cheng University,Department of Computer Science and Information Engineering, Advanced Institute of Manufacturingwith High
来源
Multimedia Tools and Applications | 2022年 / 81卷
关键词
Action recognition; Motion bases; Action tracking; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Action recognition and positional tracking are critical issues in many applications in Virtual Reality (VR). In this paper, a novel feature representation method is proposed to recognize actions based on sensor signals. The feature extraction is achieved by jointly learning Convolutional Auto-Encoder (CAE) and the representation of motion bases via clustering, which is called the Sequence of Cluster Centroids (SoCC). Then, the learned features are used to train the action recognition classifier. We have collected new dataset of actions of limbs by sensor signals. In addition, a novel action tracking method is proposed for the VR environment. It extends the sensor signals from three Degrees of Freedom (DoF) of rotation to 6DoF of position plus rotation. Experimental results demonstrate that CAE-SoCC feature is effective for action recognition and accurate prediction of position displacement.
引用
收藏
页码:11845 / 11864
页数:19
相关论文
共 50 条
[31]   Learning motion and content-dependent features with convolutions for action recognition [J].
Cong Liu ;
Weisheng Xu ;
Qidi Wu ;
Gelan Yang .
Multimedia Tools and Applications, 2016, 75 :13023-13039
[32]   Spatiotemporal Saliency Representation Learning for Video Action Recognition [J].
Kong, Yongqiang ;
Wang, Yunhong ;
Li, Annan .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1515-1528
[33]   DEEP SELECTIVE FEATURE LEARNING FOR ACTION RECOGNITION [J].
Li, Ziqiang ;
Ge, Yongxin ;
Feng, Jinyuan ;
Qi, Xiaolei ;
Yu, Jiaruo ;
Yu, Hui .
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[34]   A Hierarchical Action Recognition System Applying Fisher Discrimination Dictionary Learning via Sparse Representation [J].
Bao, Ruihan ;
Shibata, Tadashi .
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2012, 7267 :468-476
[35]   A Hybrid Deep Learning-Based Intelligent System for Sports Action Recognition via Visual Knowledge Discovery [J].
Zhao, Lei .
IEEE ACCESS, 2023, 11 :46541-46549
[36]   Real-Time Violent Action Recognition Using Key Frames Extraction and Deep Learning [J].
Ahmed, Muzamil ;
Ramzan, Muhammad ;
Khan, Hikmat Ullah ;
Iqbal, Saqib ;
Khan, Muhammad Attique ;
Choi, Jung-In ;
Nam, Yunyoung ;
Kadry, Seifedine .
CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 69 (02) :2217-2230
[37]   Representation learning in a deep network for license plate recognition [J].
Sajed Rakhshani ;
Esmat Rashedi ;
Hossein Nezamabadi-pour .
Multimedia Tools and Applications, 2020, 79 :13267-13289
[38]   Representation learning in a deep network for license plate recognition [J].
Rakhshani, Sajed ;
Rashedi, Esmat ;
Nezamabadi-pour, Hossein .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (19-20) :13267-13289
[39]   Learning multi-temporal-scale deep information for action recognition [J].
Yao, Guangle ;
Lei, Tao ;
Zhong, Jiandan ;
Jiang, Ping .
APPLIED INTELLIGENCE, 2019, 49 (06) :2017-2029
[40]   The Progress of Human Action Recognition in Videos Based on Deep Learning: A Review [J].
Luo H.-L. ;
Tong K. ;
Kong F.-S. .
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (05) :1162-1173