Transformer-based neural architecture search for effective visible-infrared person re-identification

被引:0
|
作者
Sarker, Prodip Kumar [1 ]
机构
[1] Begum Rokeya Univ, Dept Comp Sci & Engn, Rangpur 5400, Bangladesh
关键词
Transformer; Neural architecture search; Attention mechanism; Feature extraction; Cross-modality;
D O I
10.1016/j.neucom.2024.129257
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visible-infrared person re-identification (VI-reID) is a complex task insecurity and video surveillance that aims to identify and match a person captured by various non-overlapping cameras. In recent years, there has been a notable advancement in reID owing to the development of transformer-based architectures. Although many existing methods emphasize on learning both modality-specific and shared features, challenges remain in fully exploiting the complementary information between infrared and visible modalities. Consequently, there is still opportunity to increase retrieval performance by effectively comprehending and integrating cross- modality semantic information. These designs often have problems with model complexity and time-consuming processes. To tackle these issues, we employ a novel transformer-based neural architecture search (TNAS) deep learning approach for effective VI-reID. To alleviate modality gaps, we first introduce a global-local transformer (GLT) module that captures features at both global and local levels across different modalities, contributing to better feature representation and matching. Then, an efficient neural architecture search (NAS) module is developed to search for the optimal transformer-based architecture, which further enhances the performance of VI-reID. Additionally, we introduce distillation loss and modality discriminative (MD) loss to examine the potential consistency between different modalities to promote intermodality separation between classes and intramodality compactness within classes. Experimental results on two challenging benchmark datasets illustrate that our developed model achieves state-of-the-art results, outperforming existing VI-reID methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A simple but effective vision transformer framework for visible-infrared person re-identification
    Li, Yudong
    Zhao, Sanyuan
    Shen, Jianbing
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [2] Cross-Modality Transformer for Visible-Infrared Person Re-Identification
    Jiang, Kongzhu
    Zhang, Tianzhu
    Liu, Xiang
    Qian, Bingqiao
    Zhang, Yongdong
    Wu, Feng
    COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 480 - 496
  • [3] Attributes Based Visible-Infrared Person Re-identification
    Zheng, Aihua
    Feng, Mengya
    Pan, Peng
    Jiang, Bo
    Luo, Bin
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 254 - 266
  • [4] Structure-Aware Positional Transformer for Visible-Infrared Person Re-Identification
    Chen, Cuiqun
    Ye, Mang
    Qi, Meibin
    Wu, Jingjing
    Jiang, Jianguo
    Lin, Chia-Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2352 - 2364
  • [5] A guidance and alignment transformer model for visible-infrared person re-identification
    Huang, Linyu
    Xue, Zijie
    Ning, Qian
    Guo, Yong
    Li, Yongsheng
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [6] Occluded Visible-Infrared Person Re-Identification
    Feng, Yujian
    Ji, Yimu
    Wu, Fei
    Gao, Guangwei
    Gao, Yang
    Liu, Tianliang
    Liu, Shangdong
    Jing, Xiao-Yuan
    Luo, Jiebo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1401 - 1413
  • [7] Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification
    Liang, Tengfei
    Jin, Yi
    Liu, Wu
    Li, Yidong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8432 - 8444
  • [8] Learning multi-granularity representation with transformer for visible-infrared person re-identification
    Feng, Yujian
    Chen, Feng
    Sun, Guozi
    Wu, Fei
    Ji, Yimu
    Liu, Tianliang
    Liu, Shangdong
    Jing, Xiao-Yuan
    Luo, Jiebo
    PATTERN RECOGNITION, 2025, 164
  • [9] Dual-Stream Transformer With Distribution Alignment for Visible-Infrared Person Re-Identification
    Chai, Zehua
    Ling, Yongguo
    Luo, Zhiming
    Lin, Dazhen
    Jiang, Min
    Li, Shaozi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6764 - 6776
  • [10] Grayscale Enhancement Colorization Network for Visible-Infrared Person Re-Identification
    Zhong, Xian
    Lu, Tianyou
    Huang, Wenxin
    Ye, Mang
    Jia, Xuemei
    Lin, Chia-Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1418 - 1430