Toward Complete-View and High-Level Pose-Based Gait Recognition

被引:20
作者
Pan, Honghu [1 ]
Chen, Yongyong [1 ]
Xu, Tingyang [2 ]
He, Yunqi [3 ]
He, Zhenyu [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[2] Tencent AI Lab, Shenzhen 150000, Peoples R China
[3] Northeast Forestry Univ, Coll Informat & Comp Engn, Harbin 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Gait recognition; Convolutional neural networks; Generative adversarial networks; Training; Three-dimensional displays; Generators; Feature extraction; adversarial training; hypergraph convolution;
D O I
10.1109/TIFS.2023.3254449
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Model-based gait recognition methods usually adopt the pedestrian walking postures to identify human beings. However, existing methods did not explicitly resolve the large intra-class variance of human pose due to changes in camera view. In this paper, we propose a lower-upper generative adversarial network (LUGAN) to generate multi-view pose sequences for each single-view sample to reduce the cross-view variance. Based on the prior of camera imaging, we prove that the spatial coordinates between cross-view poses satisfy a linear transformation of a full-rank matrix. Hence, LUGAN employs the adversarial training to learn full-rank transformation matrices from the source pose and target views to obtain the target pose sequences. The generator of LUGAN is composed of graph convolutional (GCN) layers, fully connected (FC) layers and two-branch convolutional (CNN) layers: GCN layers and FC layers encode the source pose sequence and target view, then CNN layers take as input the encoded features to learn a lower triangular matrix and an upper one, finally the transformation matrix is formulated by multiplying the lower and upper triangular matrices. For the purpose of adversarial training, we develop a conditional discriminator that distinguishes whether the pose sequence is true or generated. Furthermore, to facilitate the high-level correlation learning, we propose a plug-and-play module, named multi-scale hypergraph convolution (HGC), to replace the spatial graph convolutional layer in baseline, which can simultaneously model the joint-level, part-level and body-level correlations. Extensive experiments on three large gait recognition datasets (i.e., CASIA-B, OUMVLP-Pose and NLPR) demonstrate that our method outperforms the baseline model by a large margin.
引用
收藏
页码:2104 / 2118
页数:15
相关论文
共 69 条
[1]  
Arjovsky M., 2017, Towards principled methods for training generative adversarial networks, DOI 10.48550/arXiv.1701.04862
[2]   Hypergraph convolution and hypergraph attention [J].
Bai, Song ;
Zhang, Feihu ;
Torr, Philip H. S. .
PATTERN RECOGNITION, 2021, 110
[3]  
Bhanu B, 2003, LECT NOTES COMPUT SC, V2688, P600
[4]  
Bruna Joan, 2014, 2 INT C LEARN REPR I, P1
[5]   Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks [J].
Cai, Yujun ;
Ge, Liuhao ;
Liu, Jun ;
Cai, Jianfei ;
Cham, Tat-Jen ;
Yuan, Junsong ;
Thalmann, Nadia Magnenat .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2272-2281
[6]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[7]   Multi-View Gait Image Generation for Cross-View Gait Recognition [J].
Chen, Xin ;
Luo, Xizhao ;
Weng, Jian ;
Luo, Weiqi ;
Li, Huiting ;
Tian, Qi .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3041-3055
[8]   Skeleton-Based Action Recognition with Shift Graph Convolutional Network [J].
Cheng, Ke ;
Zhang, Yifan ;
He, Xiangyu ;
Chen, Weihan ;
Cheng, Jian ;
Lu, Hanqing .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :180-189
[9]   Skeleton-Based Gait Recognition via Robust Frame-Level Matching [J].
Choi, Seokeon ;
Kim, Jonghee ;
Kim, Wonjun ;
Kim, Changick .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (10) :2577-2592
[10]   StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation [J].
Choi, Yunjey ;
Choi, Minje ;
Kim, Munyoung ;
Ha, Jung-Woo ;
Kim, Sunghun ;
Choo, Jaegul .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8789-8797