Skeleton-Guided Action Recognition with Multistream 3D Convolutional Neural Network for Elderly-Care Robot

被引:1
作者
Zhang, Dawei [1 ]
Zhang, Yanmin [2 ]
Zhou, Meng [1 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Henan, Peoples R China
[2] Zhengzhou Univ, Sch Elect & Informat Engn, Zhengzhou 450001, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
action recognition; deep learning; service robots; 2-STREAM;
D O I
10.1002/aisy.202300326
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the arrival of a global aging society, elderly-care robots are becoming more and more attractive and can provide better caring services through action recognition. This article presents a skeleton-guided action recognition framework with multistream 3D convolutional neural network. Two parallel dual-stream lightweight networks are proposed to enhance the feature extraction ability of human action and meanwhile reduce computation. Two different modes of skeleton input video are constructed to improve the recognition accuracy by decision fusion. The backbone networks adopt Resnet-18, the feature fusion layer and sliding window mechanism are both designed, and two cross-entropy losses are used to supervise their training. A dataset (named elder care action recognition (EC-AR)) with different categories of action is built. The experimental results on HMDB-51 and EC-AR datasets both demonstrate that the proposed framework outperforms the existing methods. The developed method is also applied to a prototype of elderly-care robots, and the test results in home scenarios show that it still has high recognition accuracy and good real-time performance. This article presents a skeleton-guided action recognition framework with multistream 3D convolutional neural network for elderly-care robot. Two parallel dual-stream Light-SlowFast networks based on ResNet-18 are proposed to enhance the feature extraction ability of human action and meanwhile reduce computation. Two different modes of skeleton input video are constructed to improve the recognition accuracy by decision fusion.image & COPY; 2023 WILEY-VCH GmbH
引用
收藏
页数:11
相关论文
共 50 条
[41]   Indoor 3D Semantic Robot VSLAM Based on Mask Regional Convolutional Neural Network [J].
Tao, Chongben ;
Gao, Zhen ;
Yan, Jinli ;
Li, Chunguang ;
Cui, Guozeng .
IEEE ACCESS, 2020, 8 :52906-52916
[42]   Direction-guided two-stream convolutional neural networks for skeleton-based action recognition [J].
Su, Benyue ;
Zhang, Peng ;
Sun, Manzhen ;
Sheng, Min .
SOFT COMPUTING, 2023, 27 (16) :11833-11842
[43]   Direction-guided two-stream convolutional neural networks for skeleton-based action recognition [J].
Benyue Su ;
Peng Zhang ;
Manzhen Sun ;
Min Sheng .
Soft Computing, 2023, 27 :11833-11842
[44]   A 2D Convolutional Neural Network Approach for Human Action Recognition [J].
Toudjeu, Ignace Tchangou ;
Tapamo, Jules-Raymond .
2019 IEEE AFRICON, 2019,
[45]   An efficient 3D convolutional neural network with informative 3D volumes for human activity recognition using wearable sensors‏ [J].
Saeedeh Zebhi .
Multimedia Tools and Applications, 2024, 83 :42233-42256
[46]   Deep convolutional neural network for automatic fault recognition from 3D seismic datasets [J].
An, Yu ;
Guo, Jiulin ;
Ye, Qing ;
Childs, Conrad ;
Walsh, John ;
Dong, Ruihai .
COMPUTERS & GEOSCIENCES, 2021, 153
[47]   An efficient 3D convolutional neural network with informative 3D volumes for human activity recognition using wearable sensors [J].
Zebhi, Saeedeh .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) :42233-42256
[48]   Automatic recognition of schizophrenia from facial videos using 3D convolutional neural network [J].
Huang, Jie ;
Zhao, Yanli ;
Qu, Wei ;
Tian, Zhanxiao ;
Tan, Yunlong ;
Wang, Zhiren ;
Tan, Shuping .
ASIAN JOURNAL OF PSYCHIATRY, 2022, 77
[49]   Sentence Level Indonesian Sign Language Recognition Using 3D Convolutional Neural Network and Bidirectional Recurrent Neural Network [J].
Ariesta, Meita Chandra ;
Wiryana, Fanny ;
Suharjito ;
Zahra, Amalia .
2018 INDONESIAN ASSOCIATION FOR PATTERN RECOGNITION INTERNATIONAL CONFERENCE (INAPR), 2018, :16-22
[50]   3D network with channel excitation and knowledge distillation for action recognition [J].
Hu, Zhengping ;
Mao, Jianzeng ;
Yao, Jianxin ;
Bi, Shuai .
FRONTIERS IN NEUROROBOTICS, 2023, 17