Human Skeleton Feature Optimizer and Adaptive Structure Enhancement Graph Convolution Network for Action Recognition

被引:19
作者
Xiong, Xin [1 ,2 ,3 ]
Min, Weidong [3 ,4 ]
Wang, Qi [5 ]
Zha, Cheng [6 ]
机构
[1] Nanchang Univ, Affiliated Hosp 1, Informat Dept, Nanchang, Peoples R China
[2] Nanchang Univ, Inst Metaverse, Nanchang 330031, Peoples R China
[3] Jiangxi Key Lab Smart City, Nanchang, Peoples R China
[4] Nanchang Univ, Inst Metaverse, Sch Math & Comp Sci, Nanchang 330047, Peoples R China
[5] Nanchang Univ, Sch Software, Nanchang 330047, Peoples R China
[6] Nanchang Univ, Sch Math & Comp Sci, Nanchang 330031, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Skeleton; Convolution; Data mining; Directed graphs; Smart cities; Kernel; Action recognition; graph convolution network; skeleton feature optimizer; graph structure mask; directed graph mapping; adaptive pooling operation; KNOWLEDGE DISTILLATION; INTERNET;
D O I
10.1109/TCSVT.2022.3201186
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Human action recognition based on the graph convolution network (GCN) is a hot topic in computer vision. Existing GCN-based methods fail to capture internal implicit information when extracting action features, thereby leading to over-smoothing in the training stage. These issues result in poor performance and inaccurate extraction of action features. To address these problems, a new GCN is constructed. In this paper, a human skeleton feature optimizer (SFO) and adaptive structure enhancement graph convolution network (ASE-GCN) for action recognition are proposed in an end-to-end manner. To obtain discriminative features, the SFO is proposed to construct a new skeleton representation for action recognition through the connection criterion, which extracts the internal implicit information of action. The action feature of the joint coordinates is extracted by graph structure mask (GSM), directed graph mapping (DGM), and adaptive pooling operation (APO) in the proposed ASE-GCN network. The GSM acts as the regularizer of skeleton structure information to strengthen the representation of the graph structure. The DGM correlates the directed graph with human motion information through kinematic principle, and the APO strengthens the global high-frequency features to alleviate over-smoothing. The proposed method achieves comparable or superior results over state-of-the-art methods when used in experiments on two large public-scale datasets, NTU-RGB+D and Kinetics.
引用
收藏
页码:342 / 353
页数:12
相关论文
共 72 条
  • [61] Yang Y., 2020, P ADV NEUR INF PROC, P1
  • [62] Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking
    Yang, Yiding
    Ren, Zhou
    Li, Haoxiang
    Zhou, Chunluan
    Wang, Xinchao
    Hua, Gang
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8070 - 8080
  • [63] Action Recognition With Spatio-Temporal Visual Attention on Skeleton Image Sequences
    Yang, Zhengyuan
    Li, Yuncheng
    Yang, Jianchao
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2405 - 2415
  • [64] Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition
    Ye, Fanfan
    Pu, Shiliang
    Zhong, Qiaoyong
    Li, Chao
    Xie, Di
    Tang, Huiming
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 55 - 63
  • [65] Neural-Network-Based Root Mean Delay Spread Model for Ubiquitous Indoor Internet-of-Things Scenarios
    Yu, Yu
    Lu, Wen-Jun
    Liu, Yang
    Zhu, Hong-Bo
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (06): : 5580 - 5589
  • [66] View Adaptive Neural Networks for High Performance Skeleton-Based Human Action Recognition
    Zhang, Pengfei
    Lan, Cuiling
    Xing, Junliang
    Zeng, Wenjun
    Xue, Jianru
    Zheng, Nanning
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (08) : 1963 - 1978
  • [67] Graph Edge Convolutional Neural Networks for Skeleton-Based Action Recognition
    Zhang, Xikun
    Xu, Chang
    Tian, Xinmei
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 3047 - 3060
  • [68] Zhao L., 2019, ARXIV190912223
  • [69] Zheng W, 2019, Arxiv, DOI [arXiv:1805.02556, DOI 10.48550/ARXIV.1805.02556]
  • [70] Detecting Motion Blurred Vehicle Logo in IoV Using Filter-DeblurGAN and VL-YOLO
    Zhou, Linghua
    Min, Weidong
    Lin, Deyu
    Han, Qing
    Liu, Ruikang
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (04) : 3604 - 3614