Decoupled Knowledge Embedded Graph Convolutional Network for Skeleton-Based Human Action Recognition

被引:1
作者
Liu, Yanan [1 ]
Li, Yanqiu [2 ]
Zhang, Hao [1 ]
Zhang, Xuejie [1 ]
Xu, Dan [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650504, Peoples R China
[2] Univ Melbourne, Sch Comp & Informat Syst, Parkville, Vic 3010, Australia
基金
中国国家自然科学基金;
关键词
Skeleton; Knowledge engineering; Feature extraction; Computational modeling; Topology; Computational efficiency; Convolution; Action recognition; skeleton-based data; graph convolution; knowledge distillation;
D O I
10.1109/TCSVT.2024.3399126
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Skeleton-based action recognition has broad prospects owing to the fact that skeleton data is more robust to scene noise and camera view changes. Recently, researchers mainly aim to explore deep-learning feature engineering with competitive recognition accuracy for skeleton actions. However, a high-performance recognition network is usually stacked by complex feature extraction modules introducing massive computational costs. In this work, we designed a powerful and universal action knowledge distillation paradigm based on decoupled knowledge distillation for transferring action knowledge from heavy teachers to lightweight students more robustly. We constructed a network architecture space consisting of the shrinking versions of outdated 2s-AGCN and searched for several robust students. On this basis, this paradigm is further developed into a powerful decoupled knowledge embedded graph convolutional network (DKE-GCN), which outperforms the teacher significantly on three public datasets and achieves the state-of-the-art. In addition, a light-DKE-GCN is designed to achieve comparable performance with teacher with 16x less parameters, 26x less FLOPs and 8x FPS.
引用
收藏
页码:9445 / 9457
页数:13
相关论文
共 58 条
[1]   Structural Knowledge Distillation for Efficient Skeleton-Based Action Recognition [J].
Bian, Cunling ;
Feng, Wei ;
Wan, Liang ;
Wang, Song .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :2963-2976
[2]   Skeleton-Based Action Recognition With Gated Convolutional Neural Networks [J].
Cao, Congqi ;
Lan, Cuiling ;
Zhang, Yifan ;
Zeng, Wenjun ;
Lu, Hanqing ;
Zhang, Yanning .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) :3247-3257
[3]   OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields [J].
Cao, Zhe ;
Hidalgo, Gines ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) :172-186
[4]   Distilling Knowledge via Knowledge Review [J].
Chen, Pengguang ;
Liu, Shu ;
Zhao, Hengshuang ;
Jia, Jiaya .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :5006-5015
[5]   AGPN: Action Granularity Pyramid Network for Video Action Recognition [J].
Chen, Yatong ;
Ge, Hongwei ;
Liu, Yuxuan ;
Cai, Xinye ;
Sun, Liang .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) :3912-3923
[6]   Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition [J].
Chen, Yuxin ;
Zhang, Ziqi ;
Yuan, Chunfeng ;
Li, Bing ;
Deng, Ying ;
Hu, Weiming .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :13339-13348
[7]   Extremely Lightweight Skeleton-Based Action Recognition With ShiftGCN plus [J].
Cheng, Ke ;
Zhang, Yifan ;
He, Xiangyu ;
Cheng, Jian ;
Lu, Hanqing .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :7333-7348
[8]   Skeleton-Based Action Recognition with Shift Graph Convolutional Network [J].
Cheng, Ke ;
Zhang, Yifan ;
He, Xiangyu ;
Chen, Weihan ;
Cheng, Jian ;
Lu, Hanqing .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :180-189
[9]   InfoGCN: Representation Learning for Human Skeleton-based Action Recognition [J].
Chi, Hyung-gun ;
Ha, Myoung Hoon ;
Chi, Seunggeun ;
Lee, Sang Wan ;
Huang, Qixing ;
Ramani, Karthik .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :20154-20164
[10]   Exploring the Trade-off Between Accuracy and Observational Latency in Action Recognition [J].
Ellis, Chris ;
Masood, Syed Zain ;
Tappen, Marshall F. ;
LaViola, Joseph J., Jr. ;
Sukthankar, Rahul .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (03) :420-436