GaitRA: triple-branch multimodal gait recognition with larger effective receptive fields and mixed attention

被引:0
作者
Xue L. [1 ]
Tao Z. [1 ]
机构
[1] Software College, Hebei Normal University, Shijiazhuang
关键词
Effective receptive field; Gait recognition; Motion pattern; Multimodal;
D O I
10.1007/s11042-024-19596-9
中图分类号
学科分类号
摘要
Gait Recognition, as a long-distance biometric technique for identity recognition, has attracted widespread attention in recent years. Previous academia typically employs minor convolutional networks to extract single-modal features from the silhouette or the joint skeleton. Nevertheless, silhouette-based methods are susceptible to clothing variations, while skeleton-based methods encounter the issue of missing physique information. Therefore, we propose a novel multimodal triple-branch network dubbed GaitRA to comprehensively acquire gait features from two aspects of silhouette and skeleton simultaneously. GaitRA consists of three branches: a 3D-CNN branch to extract the primary features of silhouette sequences, a 2D-CNN branch to obtain the secondary features of silhouette sequences, and a branch based on Spatio-Temporal Graph Convolution (ST-GCN) to gain joint skeleton features. More importantly, we innovatively introduce the RepLK-ACTION Module, which combines the RepLK Block based on the Swin Transformer and the ACTION Module based on a mixed attention mechanism from the Action Recognition. RepLK-ACTION Module establishes larger Effective Receptive Fields (ERFs) through the large-kernel re-parameterized convolution to attain more discriminative multimodal gait information, thereby enhancing the model performance under complex walking conditions. Experiments have demonstrated that GaitRA significantly improves performance on both CASIA-B and Gait3D datasets. Especially, the proposed method achieves excellent results of 73.1% (Rank-1), 86.4% (Rank-5), 64.3% (mAP), and 36.2% (mINP) on the Gait3D dataset. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:80225 / 80259
页数:34
相关论文
共 59 条
[1]  
Parashar A., Parashar A., Ding W., Shabaz M., Rida I., Data preprocessing and feature selection techniques in gait recognition: a comparative study of machine learning and deep learning approaches, Pattern Recogn Lett, 172, pp. 65-73, (2023)
[2]  
Habib Z., Mughal M.A., Khan M.A., Hamza A., Alturki N., Jamel L., A novel deep dual self-attention and Bi-LSTM fusion framework for Parkinson’s disease prediction using freezing of gait: a biometric application, Multimed Tools Appl, (2024)
[3]  
Parashar A., Parashar A., Abate A.F., Shekhawat R.S., Rida I., Real-time gait biometrics for surveillance applications: a review, Image Vis Comput, 138, (2023)
[4]  
Hanif C.A., Alimughal M., Khan M.A., Almujally N.A., Kim T., Cha J.-H., Human gait recognition for biometrics application based on deep learning fusion assisted framework, Comp Mater Contin, 78, 1, pp. 357-374, (2024)
[5]  
Fan C., Liang J., Shen C., Hou S., Huang Y., Yu S., OpenGait: revisiting gait recognition towards better practicality, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9707-9716, (2023)
[6]  
Rida I., Almaadeed N., Almaadeed S., Robust gait recognition: a comprehensive survey, IET Biometrics, 8, 1, pp. 14-28, (2019)
[7]  
Parashar A., Parashar A., Ding W., Shekhawat R.S., Rida I., Deep learning pipelines for recognition of gait biometrics with covariates: a comprehensive review, Artif Intell Rev, 56, 8, pp. 8889-8953, (2023)
[8]  
Parashar A., Parashar A., Rida I., Journey into gait biometrics: integrating deep learning for enhanced pattern recognition, Digit Signal Process, 147, (2024)
[9]  
Wan C., Wang L., Phoha V.V., A survey on gait recognition, ACM Comput Surv (CSUR), 51, 5, pp. 1-35, (2018)
[10]  
Sepas-Moghaddam A., Etemad A., Deep gait recognition: a survey, IEEE Trans Pattern Anal Mach Intell, 45, 1, pp. 264-284, (2022)