GRA: Graph Representation Alignment for Semi-Supervised Action Recognition

被引:1
作者
Huang, Kuan-Hung [1 ]
Huang, Yao-Bang [1 ]
Lin, Yong-Xiang [1 ]
Hua, Kai-Lung [1 ]
Tanveer, M. [2 ]
Lu, Xuequan [3 ]
Razzak, Imran [4 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, Taipei 10607, Taiwan
[2] Indian Inst Technol Indore, Dept Math, Indore 453552, India
[3] La Trobe Univ, Dept Comp Sci & IT, Melbourne, Vic 3086, Australia
[4] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
关键词
Action recognition; consistency regularization; graph convolutional networks (GCNs); graph representation learning; self-training (ST); semi-supervised learning; skeleton action recognition; NETWORKS;
D O I
10.1109/TNNLS.2023.3347593
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional networks (GCNs) have emerged as a powerful tool for action recognition, leveraging skeletal graphs to encapsulate human motion. Despite their efficacy, a significant challenge remains the dependency on huge labeled datasets. Acquiring such datasets is often prohibitive, and the frequent occurrence of incomplete skeleton data, typified by absent joints and frames, complicates the testing phase. To tackle these issues, we present graph representation alignment (GRA), a novel approach with two main contributions: 1) a self-training (ST) paradigm that substantially reduces the need for labeled data by generating high-quality pseudo-labels, ensuring model stability even with minimal labeled inputs and 2) a representation alignment (RA) technique that utilizes consistency regularization to effectively reduce the impact of missing data components. Our extensive evaluations on the NTU RGB+D and Northwestern-UCLA (N-UCLA) benchmarks demonstrate that GRA not only improves GCN performance in data-constrained environments but also retains impressive performance in the face of data incompleteness.
引用
收藏
页码:11896 / 11905
页数:10
相关论文
共 56 条
[1]   Graph Convolutional Neural Network for Human Action Recognition: A Comprehensive Survey [J].
Ahmad T. ;
Jin L. ;
Zhang X. ;
Lai S. ;
Tang G. ;
Lin L. .
IEEE Transactions on Artificial Intelligence, 2021, 2 (02) :128-145
[2]  
[Anonymous], 2011, CVPR, DOI DOI 10.1109/CVPR.2011.5995316
[3]  
Arazo D., 2020, ARXIV
[4]  
Bengio Y., 2009, P 26 ANN INT C MACH, P41
[5]   OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields [J].
Cao, Zhe ;
Hidalgo, Gines ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) :172-186
[6]  
Cascante-Bonilla P, 2021, AAAI CONF ARTIF INTE, V35, P6912
[7]  
Chapelle O., 2009, IEEE Trans. Neural Netw., V20, P542, DOI [DOI 10.1109/TNN.2009.2015974, 10.1109/TNN.2009.2015974]
[8]   Perspective Preserving Style Transfer for Interior Portraits [J].
Chen, Wen-Yin ;
Ople, Jose Jaena Mari ;
Si, Maynard John ;
Tan, Daniel Stanley ;
Hua, Kai-Lung .
IEEE ACCESS, 2021, 9 (09) :7033-7042
[9]   Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition [J].
Chen, Yuxin ;
Zhang, Ziqi ;
Yuan, Chunfeng ;
Li, Bing ;
Deng, Ying ;
Hu, Weiming .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :13339-13348
[10]  
Cheng K, 2021, Disc Algorithms, P1, DOI 10.1017/pls.2021.19