Multi-grained clip focus for skeleton-based action recognition

被引：11

作者：

Qiu, Helei ^{[1
]}

Hou, Biao ^{[1
]}

机构：

[1] Xidian Univ, Sch Artificial Intelligence, Int Res Ctr Intelligent Percept & Computat, Key Lab Intelligent Percept & Image Understanding,, Xian 710071, Shaanxi, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 148卷

基金：

中国国家自然科学基金;

关键词：

Action recognition; Skeleton; Multi-grain; Self-attention;

D O I：

10.1016/j.patcog.2023.110188

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Joint-level and part-level information are crucial for modeling actions with different granularity. In addition, the relevant information on different joints between consecutive frames is very useful for skeleton-based action recognition. To effectively capture the action information, a new multi-grained clip focus network (MGCF-Net) is proposed. Firstly, the skeleton sequence is divided into multiple clips, each containing several consecutive frames. According to the structure of the human body, each clip is divided into several tuples. Then an intra-clip attention module is proposed to capture intra-clip action information. Specifically, multi-head self-attention is divided into two parts, obtaining relevant information at the joint and part levels, and integrating the information captured from these two parts to obtain multi-grained contextual features. In addition, an inter clip focus module is used to capture the key information of several consecutive sub-actions, which will help to distinguish similar actions. On two large-scale benchmarks for skeleton-based action recognition, our method achieves the most advanced performance, and its effectiveness has been verified.

引用

页数：9

共 37 条

[1] Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition
Chen, Yuxin
Zhang, Ziqi
Yuan, Chunfeng
Li, Bing
Deng, Ying
Hu, Weiming
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13339 - 13348
[2] Locomotion speed capability analysis of six-legged robots: Optimization and application
Chen, Zhijun
Tian, Yuan
Gao, Feng
Liu, Jimu
[J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2021, 235 (21) : 5434 - 5449
[3] Skeleton-Based Action Recognition with Shift Graph Convolutional Network
Cheng, Ke
Zhang, Yifan
He, Xiangyu
Chen, Weihan
Cheng, Jian
Lu, Hanqing
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 180 - 189
[4] Global spatio-temporal synergistic topology learning for skeleton-based action recognition
Dai, Meng
Sun, Zhonghua
Wang, Tianyi
Feng, Jinchao
Jia, Kebin
[J]. PATTERN RECOGNITION, 2023, 140
[5] Dosovitskiy Alexey, 2021, ICLR
[6] Revisiting Skeleton-based Action Recognition
Duan, Haodong
Zhao, Yue
Chen, Kai
Lin, Dahua
Dai, Bo
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2959 - 2968
[7] Relation-mining self-attention network for skeleton-based human action recognition
Gedamu, Kumie
Ji, Yanli
Gao, LingLing
Yang, Yang
Shen, Heng Tao
[J]. PATTERN RECOGNITION, 2023, 139
[8] Jointly Learning Heterogeneous Features for RGB-D Activity Recognition
Hu, Jian-Fang
Zheng, Wei-Shi
Lai, Jianhuang
Zhang, Jianguo
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (11) : 2186 - 2200
[9] Huang LJ, 2020, AAAI CONF ARTIF INTE, V34, P11045
[10] Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition
Lee, Jungho
Lee, Minhyeok
Lee, Dogyoon
Lee, Sangyoun
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10410 - 10419

← 1 2 3 4 →