Human Skeleton Feature Optimizer and Adaptive Structure Enhancement Graph Convolution Network for Action Recognition

被引：19

作者：

Xiong, Xin ^{[1
,2
,3
]}

Min, Weidong ^{[3
,4
]}

Wang, Qi ^{[5
]}

Zha, Cheng ^{[6
]}

机构：

[1] Nanchang Univ, Affiliated Hosp 1, Informat Dept, Nanchang, Peoples R China

[2] Nanchang Univ, Inst Metaverse, Nanchang 330031, Peoples R China

[3] Jiangxi Key Lab Smart City, Nanchang, Peoples R China

[4] Nanchang Univ, Inst Metaverse, Sch Math & Comp Sci, Nanchang 330047, Peoples R China

[5] Nanchang Univ, Sch Software, Nanchang 330047, Peoples R China

[6] Nanchang Univ, Sch Math & Comp Sci, Nanchang 330031, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2023年 / 33卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Skeleton; Convolution; Data mining; Directed graphs; Smart cities; Kernel; Action recognition; graph convolution network; skeleton feature optimizer; graph structure mask; directed graph mapping; adaptive pooling operation; KNOWLEDGE DISTILLATION; INTERNET;

D O I：

10.1109/TCSVT.2022.3201186

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Human action recognition based on the graph convolution network (GCN) is a hot topic in computer vision. Existing GCN-based methods fail to capture internal implicit information when extracting action features, thereby leading to over-smoothing in the training stage. These issues result in poor performance and inaccurate extraction of action features. To address these problems, a new GCN is constructed. In this paper, a human skeleton feature optimizer (SFO) and adaptive structure enhancement graph convolution network (ASE-GCN) for action recognition are proposed in an end-to-end manner. To obtain discriminative features, the SFO is proposed to construct a new skeleton representation for action recognition through the connection criterion, which extracts the internal implicit information of action. The action feature of the joint coordinates is extracted by graph structure mask (GSM), directed graph mapping (DGM), and adaptive pooling operation (APO) in the proposed ASE-GCN network. The GSM acts as the regularizer of skeleton structure information to strengthen the representation of the graph structure. The DGM correlates the directed graph with human motion information through kinematic principle, and the APO strengthens the global high-frequency features to alleviate over-smoothing. The proposed method achieves comparable or superior results over state-of-the-art methods when used in experiments on two large public-scale datasets, NTU-RGB+D and Kinetics.

引用

页码：342 / 353

页数：12

共 72 条

[61] Yang Y., 2020, P ADV NEUR INF PROC, P1
[62] Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking
Yang, Yiding
Ren, Zhou
Li, Haoxiang
Zhou, Chunluan
Wang, Xinchao
Hua, Gang
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8070 - 8080
[63] Action Recognition With Spatio-Temporal Visual Attention on Skeleton Image Sequences
Yang, Zhengyuan
Li, Yuncheng
Yang, Jianchao
Luo, Jiebo
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2405 - 2415
[64] Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition
Ye, Fanfan
Pu, Shiliang
Zhong, Qiaoyong
Li, Chao
Xie, Di
Tang, Huiming
[J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 55 - 63
[65] Neural-Network-Based Root Mean Delay Spread Model for Ubiquitous Indoor Internet-of-Things Scenarios
Yu, Yu
Lu, Wen-Jun
Liu, Yang
Zhu, Hong-Bo
[J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (06): : 5580 - 5589
[66] View Adaptive Neural Networks for High Performance Skeleton-Based Human Action Recognition
Zhang, Pengfei
Lan, Cuiling
Xing, Junliang
Zeng, Wenjun
Xue, Jianru
Zheng, Nanning
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (08) : 1963 - 1978
[67] Graph Edge Convolutional Neural Networks for Skeleton-Based Action Recognition
Zhang, Xikun
Xu, Chang
Tian, Xinmei
Tao, Dacheng
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 3047 - 3060
[68] Zhao L., 2019, ARXIV190912223
[69] Zheng W, 2019, Arxiv, DOI [arXiv:1805.02556, DOI 10.48550/ARXIV.1805.02556]
[70] Detecting Motion Blurred Vehicle Logo in IoV Using Filter-DeblurGAN and VL-YOLO
Zhou, Linghua
Min, Weidong
Lin, Deyu
Han, Qing
Liu, Ruikang
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (04) : 3604 - 3614

← 1 2 3 4 5 6 7 8 →