Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition

被引:50
|
作者
Huang, Zhen [1 ]
Shen, Xu [2 ]
Tian, Xinmei [1 ]
Li, Houqiang [1 ]
Huang, Jianqiang [2 ]
Hua, Xian-Sheng [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Alibaba Grp, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
graph convolutional networks; skeleton-based classification; FORM;
D O I
10.1145/3394171.3413666
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based human action recognition has attracted much attention with the prevalence of accessible depth sensors. Recently, graph convolutional networks (GCNs) have been widely used for this task due to their powerful capability to model graph data. The topology of the adjacency graph is a key factor for modeling the correlations of the input skeletons. Thus, previous methods mainly focus on the design/learning of the graph topology. But once the topology is learned, only a single-scale feature and one transformation exist in each layer of the networks. Many insights, such as multi-scale information and multiple sets of transformations, that have been proven to be very effective in convolutional neural networks (CNNs), have not been investigated in GCNs. The reason is that, due to the gap between graph-structured skeleton data and conventional image/video data, it is very challenging to embed these insights into GCNs. To overcome this gap, we reinvent the split-transform-merge strategy in GCNs for skeleton sequence processing. Specifically, we design a simple and highly modularized graph convolutional network architecture for skeleton-based action recognition. Our network is constructed by repeating a building block that aggregates multi-granularity information from both the spatial and temporal paths. Extensive experiments demonstrate that our network outperforms state-of-the-art methods by a significant margin with only 1/5 of the parameters and 1/10 of the FLOPs.
引用
收藏
页码:2122 / 2130
页数:9
相关论文
共 50 条
  • [1] On the spatial attention in spatio-temporal graph convolutional networks for skeleton-based human action recognition
    Heidari, Negar
    Iosifidis, Alexandros
    Proceedings of the International Joint Conference on Neural Networks, 2021, 2021-July
  • [2] Position-aware spatio-temporal graph convolutional networks for skeleton-based action recognition
    Yang, Ping
    Wang, Qin
    Chen, Hao
    Wu, Zizhao
    IET COMPUTER VISION, 2023, 17 (07) : 844 - 854
  • [3] On the spatial attention in spatio-temporal graph convolutional networks for skeleton-based human action recognition
    Heidari, Negar
    Iosifidis, Alexandros
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Spatio-Temporal Graph Routing for Skeleton-Based Action Recognition
    Li, Bin
    Li, Xi
    Zhang, Zhongfei
    Wu, Fei
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8561 - 8568
  • [5] Lightweight Multiscale Spatio-Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
    Zheng, Zhiyun
    Yuan, Qilong
    Zhang, Huaizhu
    Wang, Yizhou
    Wang, Junfeng
    BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 310 - 325
  • [6] PROGRESSIVE SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION
    Heidari, Negar
    Iosifidis, Alexandros
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3220 - 3224
  • [7] Skeleton-based action recognition using spatio-temporal features with convolutional neural networks
    Rostami, Zahra
    Afrasiabi, Mahlagha
    Khotanlou, Hassan
    2017 IEEE 4TH INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI), 2017, : 583 - 587
  • [8] Spatio-Temporal Motion Topology Aware Graph Convolutional Network for Skeleton-Based Action Recognition
    Ma, Ji
    Liu, Wei
    Ding, Linlin
    Luo, Hao
    WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 549 - 560
  • [9] Skeleton-based action recognition based on spatio-temporal adaptive graph convolutional neural-network
    Cao Y.
    Liu C.
    Huang Z.
    Sheng Y.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (11): : 5 - 10
  • [10] Temporal segment graph convolutional networks for skeleton-based action recognition
    Ding, Chongyang
    Wen, Shan
    Ding, Wenwen
    Liu, Kai
    Belyaev, Evgeny
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110