Toward Complete-View and High-Level Pose-Based Gait Recognition

被引:16
作者
Pan, Honghu [1 ]
Chen, Yongyong [1 ]
Xu, Tingyang [2 ]
He, Yunqi [3 ]
He, Zhenyu [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[2] Tencent AI Lab, Shenzhen 150000, Peoples R China
[3] Northeast Forestry Univ, Coll Informat & Comp Engn, Harbin 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Gait recognition; Convolutional neural networks; Generative adversarial networks; Training; Three-dimensional displays; Generators; Feature extraction; adversarial training; hypergraph convolution;
D O I
10.1109/TIFS.2023.3254449
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Model-based gait recognition methods usually adopt the pedestrian walking postures to identify human beings. However, existing methods did not explicitly resolve the large intra-class variance of human pose due to changes in camera view. In this paper, we propose a lower-upper generative adversarial network (LUGAN) to generate multi-view pose sequences for each single-view sample to reduce the cross-view variance. Based on the prior of camera imaging, we prove that the spatial coordinates between cross-view poses satisfy a linear transformation of a full-rank matrix. Hence, LUGAN employs the adversarial training to learn full-rank transformation matrices from the source pose and target views to obtain the target pose sequences. The generator of LUGAN is composed of graph convolutional (GCN) layers, fully connected (FC) layers and two-branch convolutional (CNN) layers: GCN layers and FC layers encode the source pose sequence and target view, then CNN layers take as input the encoded features to learn a lower triangular matrix and an upper one, finally the transformation matrix is formulated by multiplying the lower and upper triangular matrices. For the purpose of adversarial training, we develop a conditional discriminator that distinguishes whether the pose sequence is true or generated. Furthermore, to facilitate the high-level correlation learning, we propose a plug-and-play module, named multi-scale hypergraph convolution (HGC), to replace the spatial graph convolutional layer in baseline, which can simultaneously model the joint-level, part-level and body-level correlations. Extensive experiments on three large gait recognition datasets (i.e., CASIA-B, OUMVLP-Pose and NLPR) demonstrate that our method outperforms the baseline model by a large margin.
引用
收藏
页码:2104 / 2118
页数:15
相关论文
共 69 条
  • [1] Arjovsky Martin, 2017, arXiv, DOI 10.48550/arXiv.1701.04862
  • [2] Hypergraph convolution and hypergraph attention
    Bai, Song
    Zhang, Feihu
    Torr, Philip H. S.
    [J]. PATTERN RECOGNITION, 2021, 110
  • [3] Bhanu B, 2003, LECT NOTES COMPUT SC, V2688, P600
  • [4] Bruna J., 2013, ABS13126203 CORR, P1
  • [5] Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks
    Cai, Yujun
    Ge, Liuhao
    Liu, Jun
    Cai, Jianfei
    Cham, Tat-Jen
    Yuan, Junsong
    Thalmann, Nadia Magnenat
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2272 - 2281
  • [6] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
  • [7] Multi-View Gait Image Generation for Cross-View Gait Recognition
    Chen, Xin
    Luo, Xizhao
    Weng, Jian
    Luo, Weiqi
    Li, Huiting
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3041 - 3055
  • [8] Skeleton-Based Action Recognition with Shift Graph Convolutional Network
    Cheng, Ke
    Zhang, Yifan
    He, Xiangyu
    Chen, Weihan
    Cheng, Jian
    Lu, Hanqing
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 180 - 189
  • [9] Skeleton-Based Gait Recognition via Robust Frame-Level Matching
    Choi, Seokeon
    Kim, Jonghee
    Kim, Wonjun
    Kim, Changick
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (10) : 2577 - 2592
  • [10] StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
    Choi, Yunjey
    Choi, Minje
    Kim, Munyoung
    Ha, Jung-Woo
    Kim, Sunghun
    Choo, Jaegul
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8789 - 8797