Attentional weighting strategy-based dynamic GCN for skeleton-based action recognition

被引:14
作者
Hu, Kai [1 ,2 ]
Jin, Junlan [1 ]
Shen, Chaowen [1 ]
Xia, Min [1 ,2 ]
Weng, Liguo [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Automat, 219 Ningliu Rd, Nanjing 210044, Jiangsu, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, 219 Ningliu Rd, Nanjing 210044, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Skeleton-based action recognition; Graph topology; Position feature; GRAPH NEURAL-NETWORK;
D O I
10.1007/s00530-023-01082-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph Convolutional Networks (GCNs) have become the standard skeleton-based human action recognition research paradigm. As a core component in graph convolutional networks, the construction of graph topology often significantly impacts the accuracy of classification. Considering that the fixed physical graph topology cannot capture the non-physical connection relationship of the human body, existing methods capture more flexible node relationships by constructing dynamic graph structures. This paper proposes a novel attentional weighting strategy-based dynamic GCN (AWD-GCN). We construct a new dynamic adjacency matrix, which uses the attention weighting mechanism to simultaneously capture the dynamic relationships among the three partitions of the human skeleton under multiple actions to extract the discriminative action features fully. In addition, considering the importance of skeletal node position features for action differentiation, we propose new multi-scale position attention and multi-level attention. We use a multi-scale modelling method to capture the complex relationship between skeletal node position features, which is helpful in distinguishing human action in different spatial scales. Extensive experiments on two challenging datasets, NTU-RGB+D and Skeleton-Kinetics, demonstrate the effectiveness and superiority of our method.
引用
收藏
页码:1941 / 1954
页数:14
相关论文
共 56 条
[1]   All about VLAD [J].
Arandjelovic, Relja ;
Zisserman, Andrew .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :1578-1585
[2]   Dynamic Convolution: Attention over Convolution Kernels [J].
Chen, Yinpeng ;
Dai, Xiyang ;
Liu, Mengchen ;
Chen, Dongdong ;
Yuan, Lu ;
Liu, Zicheng .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11027-11036
[3]   Skeleton-Based Action Recognition with Shift Graph Convolutional Network [J].
Cheng, Ke ;
Zhang, Yifan ;
He, Xiangyu ;
Chen, Weihan ;
Cheng, Jian ;
Lu, Hanqing .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :180-189
[4]  
Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714
[5]   Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos [J].
Duta, Ionut C. ;
Ionescu, Bogdan ;
Aizawa, Kiyoharu ;
Sebe, Nicu .
MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 :365-378
[6]   Two Stream LSTM : A Deep Fusion Framework for Human Action Recognition [J].
Gammulle, Harshala ;
Denman, Simon ;
Sridharan, Sridha ;
Fookes, Clinton .
2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :177-186
[7]   Res2Net: A New Multi-Scale Backbone Architecture [J].
Gao, Shang-Hua ;
Cheng, Ming-Ming ;
Zhao, Kai ;
Zhang, Xin-Yu ;
Yang, Ming-Hsuan ;
Torr, Philip .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) :652-662
[8]   On the spatial attention in spatio-temporal graph convolutional networks for skeleton-based human action recognition [J].
Heidari, Negar ;
Iosifidis, Alexandros .
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[9]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[10]   MCANet: A Multi-Branch Network for Cloud/Snow Segmentation in High-Resolution Remote Sensing Images [J].
Hu, Kai ;
Zhang, Enwei ;
Xia, Min ;
Weng, Liguo ;
Lin, Haifeng .
REMOTE SENSING, 2023, 15 (04)