TransGait: Multimodal-based gait recognition with set transformer

被引:30
|
作者
Li, Guodong [1 ]
Guo, Lijun [1 ]
Zhang, Rong [1 ]
Qian, Jiangbo [1 ]
Gao, Shangce [2 ]
机构
[1] NingBo Univ, Fac Elect Engn & Comp Sci, Ningbo, Peoples R China
[2] Univ Toyama, Toyama, Japan
关键词
Gait recognition; Multi-modal; Transformer;
D O I
10.1007/s10489-022-03543-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a biological feature that can be recognized from a distance, gait has a wide range of applications such as crime prevention, judicial identification, and social security. However, gait recognition is still a challenging task with two problems in the typical gait recognition methods. First, the existing gait recognition methods have weak robustness to the pedestrians' clothing and carryings. Second, the existing temporal modeling methods for gait recognition fail to fully exploit the temporal relationships of the sequence and require that the gait sequence maintain unnecessary sequential constraints. In this paper, we propose a new multi-modal gait recognition framework based on silhouette and pose features to overcome these problems. Joint features of silhouettes and poses provide high discriminability and robustness to the pedestrians' clothing and carryings. Furthermore, we propose a set transformer model with a temporal aggregation operation for obtaining set-level spatio-temporal features. The temporal modeling approach is unaffected by frame permutations and can seamlessly integrate frames from different videos acquired in different scenarios, such as diverse viewing angles. Experiments on two public datasets, CASIA-B and GREW, demonstrate that the proposed method provides state-of-the-art performance. Under the most challenging condition of walking in different clothes on CASIA-B, the proposed method achieves a rank-1 accuracy of 85.8%, outperforming other methods by a significant margin (> 4%).
引用
收藏
页码:1535 / 1547
页数:13
相关论文
共 50 条
  • [21] Open-Set Signal Recognition Based on Transformer and Wasserstein Distance
    Zhang, Wei
    Huang, Da
    Zhou, Minghui
    Lin, Jingran
    Wang, Xiangfeng
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [22] Multimodal Emotion Recognition With Transformer-Based Self Supervised Feature Fusion
    Siriwardhana, Shamane
    Kaluarachchi, Tharindu
    Billinghurst, Mark
    Nanayakkara, Suranga
    IEEE ACCESS, 2020, 8 (08): : 176274 - 176285
  • [23] Multimodal features fusion for gait, gender and shoes recognition
    Francisco M. Castro
    Manuel J. Marín-Jiménez
    Nicolás Guil
    Machine Vision and Applications, 2016, 27 : 1213 - 1228
  • [24] Multimodal features fusion for gait, gender and shoes recognition
    Castro, Francisco M.
    Marin-Jimenez, Manuel J.
    Guil, Nicolas
    MACHINE VISION AND APPLICATIONS, 2016, 27 (08) : 1213 - 1228
  • [25] GaitGMT: Global feature mapping transformer for gait recognition
    Chen, Guilong
    Huang, Jiayi
    Chen, Guanghai
    Chen, Xin
    Deng, Xiaoling
    Lan, Yubin
    Long, Yongbing
    Tian, Qi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [26] Multimodal transformer augmented fusion for speech emotion recognition
    Wang, Yuanyuan
    Gu, Yu
    Yin, Yifei
    Han, Yingping
    Zhang, He
    Wang, Shuang
    Li, Chenyu
    Quan, Dou
    FRONTIERS IN NEUROROBOTICS, 2023, 17
  • [27] Tokenization of Skeleton-based Transformer Model for Cross-View Gait Recognition
    Kawakami, Tatsuya
    Ryu, Jegoon
    Kamata, Sei-ichiro
    2024 IEEE 8TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS, ICSIPA, 2024,
  • [28] Multimodal gait recognition based on stereo vision and 3D template matching
    Gomatam, ANM
    Sasi, S
    CISST '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS, AND TECHNOLOGY, 2004, : 405 - 410
  • [29] Transformer-Based Multimodal Infusion Dialogue Systems
    Liu, Bo
    He, Lejian
    Liu, Yafei
    Yu, Tianyao
    Xiang, Yuejia
    Zhu, Li
    Ruan, Weijian
    ELECTRONICS, 2022, 11 (20)
  • [30] A Multi-Stage Adaptive Feature Fusion Neural Network for Multimodal Gait Recognition
    Zou, Shinan
    Xiong, Jianbo
    Fan, Chao
    Shen, Chuanfu
    Yu, Shiqi
    Tang, Jin
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2024, 6 (04): : 539 - 549