TransGait: Multimodal-based gait recognition with set transformer

被引:30
作者
Li, Guodong [1 ]
Guo, Lijun [1 ]
Zhang, Rong [1 ]
Qian, Jiangbo [1 ]
Gao, Shangce [2 ]
机构
[1] NingBo Univ, Fac Elect Engn & Comp Sci, Ningbo, Peoples R China
[2] Univ Toyama, Toyama, Japan
关键词
Gait recognition; Multi-modal; Transformer;
D O I
10.1007/s10489-022-03543-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a biological feature that can be recognized from a distance, gait has a wide range of applications such as crime prevention, judicial identification, and social security. However, gait recognition is still a challenging task with two problems in the typical gait recognition methods. First, the existing gait recognition methods have weak robustness to the pedestrians' clothing and carryings. Second, the existing temporal modeling methods for gait recognition fail to fully exploit the temporal relationships of the sequence and require that the gait sequence maintain unnecessary sequential constraints. In this paper, we propose a new multi-modal gait recognition framework based on silhouette and pose features to overcome these problems. Joint features of silhouettes and poses provide high discriminability and robustness to the pedestrians' clothing and carryings. Furthermore, we propose a set transformer model with a temporal aggregation operation for obtaining set-level spatio-temporal features. The temporal modeling approach is unaffected by frame permutations and can seamlessly integrate frames from different videos acquired in different scenarios, such as diverse viewing angles. Experiments on two public datasets, CASIA-B and GREW, demonstrate that the proposed method provides state-of-the-art performance. Under the most challenging condition of walking in different clothes on CASIA-B, the proposed method achieves a rank-1 accuracy of 85.8%, outperforming other methods by a significant margin (> 4%).
引用
收藏
页码:1535 / 1547
页数:13
相关论文
共 50 条
  • [41] LLE based gait recognition
    Li, HG
    Shi, CP
    Li, XG
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 4516 - 4521
  • [42] Multi-Level Fine-Tuned Transformer for Gait Recognition
    Wu, Huimin
    Zhao, Aite
    2022 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY, HUMAN-COMPUTER INTERACTION AND ARTIFICIAL INTELLIGENCE, VRHCIAI, 2022, : 83 - 89
  • [43] Gait recognition with global-local feature fusion based on swin transformer-3DCNN
    Wang, Ting
    Zhou, Guanghang
    Pu, Yanfeng
    Moreno, Ramon
    Yang, Guoping
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [44] Multiscenario Open-Set Gait Recognition Based on Radar Micro-Doppler Signatures
    Yang, Yang
    Ge, Yanyan
    Li, Beichen
    Wang, Qing
    Lang, Yue
    Li, Kaiming
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [45] Multimodal Adaptive Identity-Recognition Algorithm Fused with Gait Perception
    Wang, Changjie
    Li, Zhihua
    Sarpong, Benjamin
    BIG DATA MINING AND ANALYTICS, 2021, 4 (04) : 223 - 232
  • [46] TriGait: Hybrid Fusion Strategy for Multimodal Alignment and Integration in Gait Recognition
    Sun, Yan
    Feng, Xueling
    Liu, Xiaolei
    Ma, Liyan
    Hu, Long
    Nixon, Mark S.
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2025, 7 (01): : 82 - 94
  • [47] A survey of transformer-based multimodal pre-trained modals
    Han, Xue
    Wang, Yi-Tong
    Feng, Jun-Lan
    Deng, Chao
    Chen, Zhan-Heng
    Huang, Yu-An
    Su, Hui
    Hu, Lun
    Hu, Peng-Wei
    NEUROCOMPUTING, 2023, 515 : 89 - 106
  • [48] Research on Modelling Capability of English Multimodal File Search based on Transformer
    Li, Hongjuan
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2025, 22 (01) : 116 - 123
  • [49] Improving multimodal fusion with Main Modal Transformer for emotion recognition in conversation
    Zou, ShiHao
    Huang, Xianying
    Shen, XuDong
    Liu, Hankai
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [50] Cross-scale cascade transformer for multimodal human action recognition
    Liu, Zhen
    Cheng, Qin
    Song, Chengqun
    Cheng, Jun
    PATTERN RECOGNITION LETTERS, 2023, 168 : 17 - 23