Mix Dimension in Poincare Geometry for 3D Skeleton-based Action Recognition

被引:38
|
作者
Peng, Wei [1 ]
Shi, Jingang [2 ]
Xia, Zhaoqiang [3 ]
Zhao, Guoying [1 ]
机构
[1] Univ Oulu, CMVS, Oulu, Finland
[2] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
[3] Northwestern Polytech Univ, Xian, Peoples R China
基金
芬兰科学院; 中国国家自然科学基金;
关键词
Skeleton-based Action Recognition; Graph Topology Analysis; Rie-mann Manifold; Graph Convolutional Networks;
D O I
10.1145/3394171.3413910
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Convolutional Networks (GCNs) have already demonstrated their powerful ability to model the irregular data, e.g., skeletal data in human action recognition, providing an exciting new way to fuse rich structural information for nodes residing in different parts of a graph. In human action recognition, current works introduce a dynamic graph generation mechanism to better capture the underlying semantic skeleton connections and thus improves the performance. In this paper, we provide an orthogonal way to explore the underlying connections. Instead of introducing an expensive dynamic graph generation paradigm, we build a more efficient GCN on a Riemann manifold, which we think is a more suitable space to model the graph data, to make the extracted representations fit the embedding matrix. Specifically, we present a novel spatial-temporal GCN (ST-GCN) architecture which is defined via the Poincare geometry such that it is able to better model the latent anatomy of the structure data. To further explore the optimal projection dimension in the Riemann space, we mix different dimensions on the manifold and provide an efficient way to explore the dimension for each ST-GCN layer. With the final resulted architecture, we evaluate our method on two current largest scale 3D datasets, i.e., NTU RGB+D and NTU RGB+D 120. The comparison results show that the model could achieve a superior performance under any given evaluation metrics with only 40% model size when compared with the previous best GCN method, which proves the effectiveness of our model.
引用
收藏
页码:1432 / 1440
页数:9
相关论文
共 50 条
  • [1] 3D skeleton-based action recognition with convolutional neural networks
    Van-Nam Hoang
    Thi-Lan Le
    Thanh-Hai Tran
    Hai-Vu
    Van-Toi Nguyen
    2019 INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2019,
  • [2] Learning Clip Representations for Skeleton-Based 3D Action Recognition
    Ke, Qiuhong
    Bennamoun, Mohammed
    An, Senjian
    Sohel, Ferdous
    Boussaid, Farid
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (06) : 2842 - 2855
  • [3] A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
    Ren, Bin
    Liu, Mengyuan
    Ding, Runwei
    Liu, Hong
    CYBORG AND BIONIC SYSTEMS, 2024, 5
  • [4] Tripool: Graph triplet pooling for 3D skeleton-based action recognition
    Peng, Wei
    Hong, Xiaopeng
    Zhao, Guoying
    PATTERN RECOGNITION, 2021, 115
  • [5] AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
    Guan, Shannan
    Lu, Haiyan
    Zhu, Linchao
    Fang, Gengfa
    NEUROCOMPUTING, 2022, 514 : 256 - 267
  • [6] Understanding the Gap between 2D and 3D Skeleton-Based Action Recognition
    Elias, Petr
    Sedmidubsky, Jan
    Zezula, Pavel
    2019 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2019), 2019, : 192 - 195
  • [7] HIF3D: Handwriting -Inspired Features for 3D skeleton-based action recognition
    Boulahia, Said Yacine
    Anquetil, Eric
    Kulpa, Richard
    Multon, Franck
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 985 - 990
  • [8] Enhancing Robustness of Viewpoint Changes in 3D Skeleton-Based Human Action Recognition
    Park, Jinyoon
    Kim, Chulwoong
    Kim, Seung-Chan
    MATHEMATICS, 2023, 11 (15)
  • [9] A three-stream fusion network for 3D skeleton-based action recognition
    Fang, Ming
    Liu, Qi
    Ren, Jianping
    Li, Jie
    Du, Xinning
    Liu, Shuhua
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [10] Spatiotemporal decoupling attention transformer for 3D skeleton-based driver action recognition
    Xu, Zhuoyan
    Xu, Jingke
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (04)