Person re-identification transformer with patch attention and pruning

被引:0
|
作者
Ndayishimiye, Fabrice [1 ]
Yoon, Gang-Joon [2 ]
Lee, Joonjae [1 ]
Yoon, Sang Min [3 ]
机构
[1] Keimyung Univ, Fac Comp Engn, 1095 Dalgubeol Daero, Daegu 42601, South Korea
[2] Natl Inst Math Sci, 70 Yuseong Daero 1689 Beon Gil, Daejeon 34047, South Korea
[3] Kookmin Univ, Coll Comp Sci, HCI Lab, 77 Jeongneung Ro, Seoul 02707, South Korea
关键词
Person re-identification; Token pruning; Vision transformer;
D O I
10.1016/j.jvcir.2024.104348
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Person re-identification (Re-ID), which is widely used in surveillance and tracking systems, aims to search individuals as they move between different camera views by maintaining identity across various camera views. In the realm of person re-identification (Re-ID), recent advancements have introduced convolutional neural networks (CNNs) and vision transformers (ViTs) as promising solutions. While CNN-based methods excel in local feature extraction, ViTs have emerged as effective alternatives to CNN-based person Re-ID, offering the ability to capture long-range dependencies through multi-head self-attention without relying on convolution and downsampling. However, it still faces challenges such as changes in illumination, viewpoint, pose, low resolutions, and partial occlusions. To address the limitations of widely used person Re-ID datasets and improve the generalization, we present a novel person Re-ID method that enhances global and local information interactions using self-attention modules within a ViT network. It leverages dynamic pruning to extract and prioritize essential image patches effectively. The designed patch selection and pruning for person Re-ID model resulted in a robust feature extractor even in scenarios with partial occlusion, background clutter, and illumination variations. Empirical validation demonstrates its superior performance compared to previous approaches and its adaptability across various domains.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Occlusion-Aware Transformer With Second-Order Attention for Person Re-Identification
    Li, Yanping
    Liu, Yizhang
    Zhang, Hongyun
    Zhao, Cairong
    Wei, Zhihua
    Miao, Duoqian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3200 - 3211
  • [22] Transformer-based Cross attention and Feature Diversity for Occluded Person Re-identification
    Kang S.
    Kim S.
    Seo K.
    Transactions of the Korean Institute of Electrical Engineers, 2023, 72 (01): : 108 - 113
  • [23] Multi-patch matching for Person Re-identification
    Labidi, Hocine
    Luo, Sen-Lin
    Boubekeur, Mohamed Bachir
    Benlefki, Tarek
    2015 INTERNATIONAL CONFERENCE ON OPTICAL INSTRUMENTS AND TECHNOLOGY: OPTOELECTRONIC IMAGING AND PROCESSING TECHNOLOGY, 2015, 9622
  • [24] Constraint Patch Matching for Faster Person Re-identification
    Lejbolle, Aske R.
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    2017 IEEE INTERNATIONAL CONFERENCE ON IDENTITY, SECURITY AND BEHAVIOR ANALYSIS (ISBA), 2017,
  • [25] Feature Completion Transformer for Occluded Person Re-Identification
    Wang, Tao
    Liu, Mengyuan
    Liu, Hong
    Li, Wenhao
    Ban, Miaoju
    Guo, Tianyu
    Li, Yidi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8529 - 8542
  • [26] Vision transformer with multiple granularities for person re-identification
    Bingcai Chen
    Fansheng Zhang
    Xin Yang
    Qian Ning
    Victor C. M. Leung
    Neural Computing and Applications, 2023, 35 : 23213 - 23223
  • [27] Dynamic Feature Pruning and Consolidation for Occluded Person Re-identification
    Ye, YuTeng
    Zhou, Hang
    Cai, Jiale
    Gao, Chenxing
    Zhang, Youjia
    Wang, Junle
    Hu, Qiang
    Yu, Junqing
    Yang, Wei
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6684 - 6692
  • [28] Filter pruning based on evolutionary algorithms for person re-identification
    Jiaqi Zhao
    Ying Chen
    Yufeng Zhong
    Yong Zhou
    Rui Yao
    Lixu Zhang
    Shixiong Xia
    Multimedia Tools and Applications, 2024, 83 : 32569 - 32586
  • [29] Vision transformer with multiple granularities for person re-identification
    Chen, Bingcai
    Zhang, Fansheng
    Yang, Xin
    Ning, Qian
    Leung, Victor C. M.
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (31): : 23213 - 23223
  • [30] NFormer: Robust Person Re-identification with Neighbor Transformer
    Wang, Haochen
    Shen, Jiayi
    Liu, Yongtuo
    Gao, Yan
    Gavves, Efstratios
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7287 - 7297