Multi-grained clip focus for skeleton-based action recognition

被引:11
作者
Qiu, Helei [1 ]
Hou, Biao [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Int Res Ctr Intelligent Percept & Computat, Key Lab Intelligent Percept & Image Understanding,, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Skeleton; Multi-grain; Self-attention;
D O I
10.1016/j.patcog.2023.110188
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Joint-level and part-level information are crucial for modeling actions with different granularity. In addition, the relevant information on different joints between consecutive frames is very useful for skeleton-based action recognition. To effectively capture the action information, a new multi-grained clip focus network (MGCF-Net) is proposed. Firstly, the skeleton sequence is divided into multiple clips, each containing several consecutive frames. According to the structure of the human body, each clip is divided into several tuples. Then an intra-clip attention module is proposed to capture intra-clip action information. Specifically, multi-head self-attention is divided into two parts, obtaining relevant information at the joint and part levels, and integrating the information captured from these two parts to obtain multi-grained contextual features. In addition, an inter clip focus module is used to capture the key information of several consecutive sub-actions, which will help to distinguish similar actions. On two large-scale benchmarks for skeleton-based action recognition, our method achieves the most advanced performance, and its effectiveness has been verified.
引用
收藏
页数:9
相关论文
共 37 条
  • [1] Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition
    Chen, Yuxin
    Zhang, Ziqi
    Yuan, Chunfeng
    Li, Bing
    Deng, Ying
    Hu, Weiming
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13339 - 13348
  • [2] Locomotion speed capability analysis of six-legged robots: Optimization and application
    Chen, Zhijun
    Tian, Yuan
    Gao, Feng
    Liu, Jimu
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2021, 235 (21) : 5434 - 5449
  • [3] Skeleton-Based Action Recognition with Shift Graph Convolutional Network
    Cheng, Ke
    Zhang, Yifan
    He, Xiangyu
    Chen, Weihan
    Cheng, Jian
    Lu, Hanqing
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 180 - 189
  • [4] Global spatio-temporal synergistic topology learning for skeleton-based action recognition
    Dai, Meng
    Sun, Zhonghua
    Wang, Tianyi
    Feng, Jinchao
    Jia, Kebin
    [J]. PATTERN RECOGNITION, 2023, 140
  • [5] Dosovitskiy Alexey, 2021, ICLR
  • [6] Revisiting Skeleton-based Action Recognition
    Duan, Haodong
    Zhao, Yue
    Chen, Kai
    Lin, Dahua
    Dai, Bo
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2959 - 2968
  • [7] Relation-mining self-attention network for skeleton-based human action recognition
    Gedamu, Kumie
    Ji, Yanli
    Gao, LingLing
    Yang, Yang
    Shen, Heng Tao
    [J]. PATTERN RECOGNITION, 2023, 139
  • [8] Jointly Learning Heterogeneous Features for RGB-D Activity Recognition
    Hu, Jian-Fang
    Zheng, Wei-Shi
    Lai, Jianhuang
    Zhang, Jianguo
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (11) : 2186 - 2200
  • [9] Huang LJ, 2020, AAAI CONF ARTIF INTE, V34, P11045
  • [10] Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition
    Lee, Jungho
    Lee, Minhyeok
    Lee, Dogyoon
    Lee, Sangyoun
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10410 - 10419