Transformer-based neural architecture search for effective visible-infrared person re-identification

被引:0
|
作者
Sarker, Prodip Kumar [1 ]
机构
[1] Begum Rokeya Univ, Dept Comp Sci & Engn, Rangpur 5400, Bangladesh
关键词
Transformer; Neural architecture search; Attention mechanism; Feature extraction; Cross-modality;
D O I
10.1016/j.neucom.2024.129257
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visible-infrared person re-identification (VI-reID) is a complex task insecurity and video surveillance that aims to identify and match a person captured by various non-overlapping cameras. In recent years, there has been a notable advancement in reID owing to the development of transformer-based architectures. Although many existing methods emphasize on learning both modality-specific and shared features, challenges remain in fully exploiting the complementary information between infrared and visible modalities. Consequently, there is still opportunity to increase retrieval performance by effectively comprehending and integrating cross- modality semantic information. These designs often have problems with model complexity and time-consuming processes. To tackle these issues, we employ a novel transformer-based neural architecture search (TNAS) deep learning approach for effective VI-reID. To alleviate modality gaps, we first introduce a global-local transformer (GLT) module that captures features at both global and local levels across different modalities, contributing to better feature representation and matching. Then, an efficient neural architecture search (NAS) module is developed to search for the optimal transformer-based architecture, which further enhances the performance of VI-reID. Additionally, we introduce distillation loss and modality discriminative (MD) loss to examine the potential consistency between different modalities to promote intermodality separation between classes and intramodality compactness within classes. Experimental results on two challenging benchmark datasets illustrate that our developed model achieves state-of-the-art results, outperforming existing VI-reID methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Cross-Modality Spatial-Temporal Transformer for Video-Based Visible-Infrared Person Re-Identification
    Feng, Yujian
    Chen, Feng
    Yu, Jian
    Ji, Yimu
    Wu, Fei
    Liu, Tianliang
    Liu, Shangdong
    Jing, Xiao-Yuan
    Luo, Jiebo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6582 - 6594
  • [32] Cross-Modality Semantic Consistency Learning for Visible-Infrared Person Re-Identification
    Liu, Min
    Zhang, Zhu
    Bian, Yuan
    Wang, Xueping
    Sun, Yeqing
    Zhang, Baida
    Wang, Yaonan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 568 - 580
  • [33] Dual Consistency-Constrained Learning for Unsupervised Visible-Infrared Person Re-Identification
    Yang, Bin
    Chen, Jun
    Chen, Cuiqun
    Ye, Mang
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 1767 - 1779
  • [34] Dual-attentive cascade clustering learning for visible-infrared person re-identification
    Xianju Wang
    Cuiqun Chen
    Yong Zhu
    Shuguang Chen
    Multimedia Tools and Applications, 2024, 83 : 19729 - 19746
  • [35] Attention-Based Neural Architecture Search for Person Re-Identification
    Zhou, Qinqin
    Zhong, Bineng
    Liu, Xin
    Ji, Rongrong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6627 - 6639
  • [36] Adaptive Generation of Privileged Intermediate Information for Visible-Infrared Person Re-Identification
    Alehdaghi, Mahdi
    Josi, Arthur
    Cruz, Rafael M. O.
    Shamsolmoali, Pourya
    Granger, Eric
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 3400 - 3413
  • [37] Visible-infrared person re-identification via specific and shared representations learning
    Aihua Zheng
    Juncong Liu
    Zi Wang
    Lili Huang
    Chenglong Li
    Bing Yin
    Visual Intelligence, 1 (1):
  • [38] Context-aware and part alignment for visible-infrared person re-identification
    Zhao, Jiaqi
    Wang, Hanzheng
    Zhou, Yong
    Yao, Rui
    Zhang, Lixu
    El Saddik, Abdulmotaleb
    IMAGE AND VISION COMPUTING, 2023, 138
  • [39] Dual-Semantic Consistency Learning for Visible-Infrared Person Re-Identification
    Zhang, Yiyuan
    Kang, Yuhao
    Zhao, Sanyuan
    Shen, Jianbing
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 1554 - 1565
  • [40] SSRR: Structural Semantic Representation Reconstruction for Visible-Infrared Person Re-Identification
    Yang, Xi
    Tian, Menghui
    Li, Meijie
    Wei, Ziyu
    Yuan, Liu
    Wang, Nannan
    Gao, Xinbo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6273 - 6284