Constructing Stronger and Faster Baselines for Skeleton-Based Action Recognition

被引:250
|
作者
Song, Yi-Fan [1 ,2 ]
Zhang, Zhang [1 ,2 ]
Shan, Caifeng [3 ,4 ]
Wang, Liang [1 ,2 ]
机构
[1] Univ Chinese Acad Sci UCAS, Sch Artificial Intelligence, Beijing 100190, Peoples R China
[2] Chinese Acad Sci CASIA, Inst Automat, Ctr Res Intelligent Percept & Comp CRIPAC, Natl Lab Pattern Recognit NLPR, Beijing 100190, Peoples R China
[3] Shandong Univ Sci & Technol SDUST, Coll Elect Engn & Automation, Qingdao 266590, Peoples R China
[4] Chinese Acad Sci CAS AIR, Artificial Intelligence Res, Beijing 100190, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Action recognition; skeleton sequence; graph convolutional network; EfficientNet; separable convolution; PERSON REIDENTIFICATION;
D O I
10.1109/TPAMI.2022.3157033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One essential problem in skeleton-based action recognition is how to extract discriminative features over all skeleton joints. However, the complexity of the recent State-Of-The-Art (SOTA) models for this task tends to be exceedingly sophisticated and over-parameterized. The low efficiency in model training and inference has increased the validation costs of model architectures in large-scale datasets. To address the above issue, recent advanced separable convolutional layers are embedded into an early fused Multiple Input Branches (MIB) network, constructing an efficient Graph Convolutional Network (GCN) baseline for skeleton-based action recognition. In addition, based on such the baseline, we design a compound scaling strategy to expand the model's width and depth synchronously, and eventually obtain a family of efficient GCN baselines with high accuracies and small amounts of trainable parameters, termed EfficientGCN-Bx, where "x " denotes the scaling coefficient. On two large-scale datasets, i.e., NTU RGB+D 60 and 120, the proposed EfficientGCN-B4 baseline outperforms other SOTA methods, e.g., achieving 92.1% accuracy on the cross-subject benchmark of NTU 60 dataset, while being 5.82x smaller and 5.85x faster than MS-G3D, which is one of the SOTA methods. The source code in PyTorch version and the pretrained models are available at https://github.com/yfsong0709/EfficientGCNv1.
引用
收藏
页码:1474 / 1488
页数:15
相关论文
共 50 条
  • [41] Graph-aware transformer for skeleton-based action recognition
    Zhang, Jiaxu
    Xie, Wei
    Wang, Chao
    Tu, Ruide
    Tu, Zhigang
    VISUAL COMPUTER, 2023, 39 (10): : 4501 - 4512
  • [42] Ghost Graph Convolutional Network for Skeleton-based Action Recognition
    Jang, Sungjun
    Lee, Heansung
    Cho, Suhwan
    Woo, Sungmin
    Lee, Sangyoun
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2021,
  • [43] Selective Hypergraph Convolutional Networks for Skeleton-based Action Recognition
    Zhu, Yiran
    Huang, Guangji
    Xu, Xing
    Ji, Yanli
    Shen, Fumin
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 518 - 526
  • [44] MOTION-LET CLUSTERING FOR SKELETON-BASED ACTION RECOGNITION
    Yang, Jianyu
    Zhu, Chen
    Yuan, Junsong
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 150 - 155
  • [45] Deep Progressive Reinforcement Learning for Skeleton-based Action Recognition
    Tang, Yansong
    Tian, Yi
    Lu, Jiwen
    Li, Peiyang
    Zhou, Jie
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5323 - 5332
  • [46] A High Invariance Motion Representation for Skeleton-Based Action Recognition
    Guo, Songrui
    Pan, Huawei
    Tan, Guanghua
    Chen, Lin
    Gao, Chunming
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (08)
  • [47] A Cross View Learning Approach for Skeleton-Based Action Recognition
    Zheng, Hui
    Zhang, Xinming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3061 - 3072
  • [48] STAR: An STGCN ARchitecture for Skeleton-Based Human Action Recognition
    Wu, Weiwei
    Tu, Fengbin
    Niu, Mengqi
    Yue, Zhiheng
    Liu, Leibo
    Wei, Shaojun
    Li, Xiangyu
    Hu, Yang
    Yin, Shouyi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (06) : 2370 - 2383
  • [49] Skeleton-Based Human Action Recognition via Screw Matrices
    DING Wenwen
    LIU Kai
    XU Biao
    CHENG Fei
    ChineseJournalofElectronics, 2017, 26 (04) : 790 - 796
  • [50] Recurrent graph convolutional networks for skeleton-based action recognition
    Zhu, Guangming
    Yang, Lu
    Zhang, Liang
    Shen, Peiyi
    Song, Juan
    Proceedings - International Conference on Pattern Recognition, 2020, : 1352 - 1359