Transformer-based neural architecture search for effective visible-infrared person re-identification

被引：0

作者：

Sarker, Prodip Kumar ^{[1
]}

机构：

[1] Begum Rokeya Univ, Dept Comp Sci & Engn, Rangpur 5400, Bangladesh

来源：

NEUROCOMPUTING | 2025年 / 620卷

关键词：

Transformer; Neural architecture search; Attention mechanism; Feature extraction; Cross-modality;

D O I：

10.1016/j.neucom.2024.129257

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Visible-infrared person re-identification (VI-reID) is a complex task insecurity and video surveillance that aims to identify and match a person captured by various non-overlapping cameras. In recent years, there has been a notable advancement in reID owing to the development of transformer-based architectures. Although many existing methods emphasize on learning both modality-specific and shared features, challenges remain in fully exploiting the complementary information between infrared and visible modalities. Consequently, there is still opportunity to increase retrieval performance by effectively comprehending and integrating cross- modality semantic information. These designs often have problems with model complexity and time-consuming processes. To tackle these issues, we employ a novel transformer-based neural architecture search (TNAS) deep learning approach for effective VI-reID. To alleviate modality gaps, we first introduce a global-local transformer (GLT) module that captures features at both global and local levels across different modalities, contributing to better feature representation and matching. Then, an efficient neural architecture search (NAS) module is developed to search for the optimal transformer-based architecture, which further enhances the performance of VI-reID. Additionally, we introduce distillation loss and modality discriminative (MD) loss to examine the potential consistency between different modalities to promote intermodality separation between classes and intramodality compactness within classes. Experimental results on two challenging benchmark datasets illustrate that our developed model achieves state-of-the-art results, outperforming existing VI-reID methods.

引用

页数：10

共 50 条

[31] Cross-Modality Spatial-Temporal Transformer for Video-Based Visible-Infrared Person Re-Identification
Feng, Yujian
Chen, Feng
Yu, Jian
Ji, Yimu
Wu, Fei
Liu, Tianliang
Liu, Shangdong
Jing, Xiao-Yuan
Luo, Jiebo
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6582 - 6594
[32] Cross-Modality Semantic Consistency Learning for Visible-Infrared Person Re-Identification
Liu, Min
Zhang, Zhu
Bian, Yuan
Wang, Xueping
Sun, Yeqing
Zhang, Baida
Wang, Yaonan
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 568 - 580
[33] Dual Consistency-Constrained Learning for Unsupervised Visible-Infrared Person Re-Identification
Yang, Bin
Chen, Jun
Chen, Cuiqun
Ye, Mang
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 1767 - 1779
[34] Dual-attentive cascade clustering learning for visible-infrared person re-identification
Xianju Wang
Cuiqun Chen
Yong Zhu
Shuguang Chen
Multimedia Tools and Applications, 2024, 83 : 19729 - 19746
[35] Attention-Based Neural Architecture Search for Person Re-Identification
Zhou, Qinqin
Zhong, Bineng
Liu, Xin
Ji, Rongrong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6627 - 6639
[36] Adaptive Generation of Privileged Intermediate Information for Visible-Infrared Person Re-Identification
Alehdaghi, Mahdi
Josi, Arthur
Cruz, Rafael M. O.
Shamsolmoali, Pourya
Granger, Eric
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 3400 - 3413
[37] Visible-infrared person re-identification via specific and shared representations learning
Aihua Zheng
Juncong Liu
Zi Wang
Lili Huang
Chenglong Li
Bing Yin
Visual Intelligence, 1 (1):
[38] Context-aware and part alignment for visible-infrared person re-identification
Zhao, Jiaqi
Wang, Hanzheng
Zhou, Yong
Yao, Rui
Zhang, Lixu
El Saddik, Abdulmotaleb
IMAGE AND VISION COMPUTING, 2023, 138
[39] Dual-Semantic Consistency Learning for Visible-Infrared Person Re-Identification
Zhang, Yiyuan
Kang, Yuhao
Zhao, Sanyuan
Shen, Jianbing
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 1554 - 1565
[40] SSRR: Structural Semantic Representation Reconstruction for Visible-Infrared Person Re-Identification
Yang, Xi
Tian, Menghui
Li, Meijie
Wei, Ziyu
Yuan, Liu
Wang, Nannan
Gao, Xinbo
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6273 - 6284

← 1 2 3 4 5 →