Action Recognition Network Combining Spatio-Temporal Adaptive Graph Convolution and Transformer

被引：0

作者：

Han, Zongwang ^{[1
]}

Yang, Han ^{[1
]}

Wu, Shiqing ^{[1
]}

Chen, Long ^{[1
]}

机构：

[1] Univ Shanghai Sci & Technol, Sch Mech Engn, Shanghai 200093, Peoples R China

来源：

JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY | 2024年 / 46卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Intelligent manufacturing; Recognition of worker activity; Deep learning; Adaptive graph; Transformer;

D O I：

10.11999/JEIT230551

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In a human-centered smart factory, perceiving and understanding workers' behavior is crucial, as different job categories are often associated with work time and tasks. In this paper, the accuracy of the model's recognition is improved by combining two approaches, namely adaptive graphs and Transformers, to focus more on the spatiotemporal information of the skeletal structure. Firstly, an adaptive graph method is employed to capture the connectivity relationships beyond the human body skeleton. Furthermore, the Transformer framework is utilized to capture the dynamic temporal variations of the worker's skeleton. To evaluate the model's performance, six typical worker action datasets are created for intelligent production line assembly tasks and validated. The results indicate that the model proposed in this article has a Top-1 accuracy comparable to mainstream action recognition models. Finally, the proposed model is compared with several mainstream methods on the publicly available NTU-RGBD and Skeleton-Kinetics datasets, and the experimental results demonstrate the robustness of the model proposed in this paper.

引用

页码：2587 / 2595

页数：9

共 31 条

[1] A union of deep learning and swarm-based optimization for 3D human action recognition [J].

Basak, Hritam ;

Kundu, Rohit ;

Singh, Pawan Kumar ;

Ijaz, Muhammad Fazal ;

Wozniak, Marcin ;

Sarkar, Ram .

SCIENTIFIC REPORTS, 2022, 12 (01)

[2] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[3] Human Action Recognition Network Based on Improved Channel Attention Mechanism [J].

Chen Ying ;

Gong Suming .

JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (12) :3538-3545

[4] Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition [J].

Chen, Yuxin ;

Zhang, Ziqi ;

Yuan, Chunfeng ;

Li, Bing ;

Deng, Ying ;

Hu, Weiming .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :13339-13348

[5] Skeleton-Based Action Recognition with Shift Graph Convolutional Network [J].

Cheng, Ke ;

Zhang, Yifan ;

He, Xiangyu ;

Chen, Weihan ;

Cheng, Jian ;

Lu, Hanqing .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :180-189

[6]

Du Y, 2015, PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, P579, DOI 10.1109/ACPR.2015.7486569

[7] Revisiting Skeleton-based Action Recognition [J].

Duan, Haodong ;

Zhao, Yue ;

Chen, Kai ;

Lin, Dahua ;

Dai, Bo .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :2959-2968

[8] Relation-mining self-attention network for skeleton-based human action recognition [J].

Gedamu, Kumie ;

Ji, Yanli ;

Gao, LingLing ;

Yang, Yang ;

Shen, Heng Tao .

PATTERN RECOGNITION, 2023, 139

[9] Action Recognition Based on 3D Skeleton and LSTM for the Monitoring of Construction Workers' Safety Harness Usage [J].

Guo, Hongling ;

Zhang, Zhitian ;

Yu, Run ;

Sun, Yakang ;

Li, Heng .

JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2023, 149 (04)

[10] Continual spatio-temporal graph convolutional networks [J].

Hedegaard, Lukas ;

Heidari, Negar ;

Iosifidis, Alexandros .

PATTERN RECOGNITION, 2023, 140

← 1 2 3 4 →