Combining information augmentation aggregation and dual-granularity feature fusion for visible-infrared person re-identification

被引:1
作者
Tao, Mengzhu [1 ]
Long, Huiyun [1 ]
Kong, Guangqian [1 ]
Duan, Xun [1 ]
机构
[1] Guizhou Univ, Coll Comp Sci & Technol, State Key Lab Publ Big Data, Guiyang 550025, Guizhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Multi-granularity features; Modality discrepancy; Two-stream network;
D O I
10.1007/s11760-024-03800-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Visible-Infrared Person Re-Identification (VI-ReID), which aims to retrieve pedestrian images captured by visible and infrared cameras, presents a significant challenge in intelligent surveillance systems. VI-ReID should not only tackle the modality discrepancies between visible and infrared images, but also address intra-modality discrepancies caused by factors such as image occlusion, lighting changes, and background complexity. In this paper, we propose a novel VI-ReID network (IADGN) that effectively combines an Information Augmentation Aggregation Module (IAAM) with a Dual-Granularity Feature Module (DGFM) to balance the processing of cross-modality and intra-modality discrepancies. First, during the data processing stage, a random grayscale strategy is employed for both visible and infrared images to effectively minimize the interference of color information. Second, to address common intra-modality discrepancies such as occlusion, lighting changes, and background complexity, we design an Information Augmentation Aggregation Module (IAAM) based on a self-attention mechanism. This module accurately focuses on pedestrian features and aggregates them with the original features in a channel-adaptive manner, effectively ignoring the cluttered background and enhancing feature discriminability. Additionally, we propose a Dual-Granularity Feature Module (DGFM) that integrates global and local features, overcoming the limitations of single-granularity feature learning, and significantly enhancing the network's recognition accuracy. Finally, we propose the Center Distribution Consistency Loss function (CDCL), which reduces modality discrepancies and enhances the modality consistency of feature representation by aligning the inter-class distributions of visible and infrared images. Extensive experimental results on three publicly available datasets-SYSU-MM01, RegDB, and LLCM-demonstrate the effectiveness and superiority of the proposed method.
引用
收藏
页数:12
相关论文
共 47 条
[1]   Dual-granularity feature fusion in visible-infrared person re-identification [J].
Cai, Shuang ;
Yang, Shanmin ;
Hu, Jing ;
Wu, Xi .
IET IMAGE PROCESSING, 2024, 18 (04) :972-980
[2]  
Dai PY, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P677
[3]   MSO: Multi-Feature Space Joint Optimization Network for RGB-Infrared Person Re-Identification [J].
Gao, Yajun ;
Liang, Tengfei ;
Jin, Yi ;
Gu, Xiaoyan ;
Liu, Wu ;
Li, Yidong ;
Lang, Congyan .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :5257-5265
[4]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[5]   IGIE-net: Cross-modality person re-identification via intermediate modality image generation and discriminative information enhancement [J].
Guo, Jiangtao ;
Du, Haishun ;
Hao, Xinxin ;
Zhang, Minghao .
IMAGE AND VISION COMPUTING, 2024, 147
[6]   Cross-Modality Person Re-Identification via Modality Confusion and Center Aggregation [J].
Hao, Xin ;
Zhao, Sanyuan ;
Ye, Mang ;
Shen, Jianbing .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :16383-16392
[7]   Dual-alignment Feature Embedding for Cross-modality Person Re-identification [J].
Hao, Yi ;
Wang, Nannan ;
Gao, Xinbo ;
Li, Jie ;
Wang, Xiaoyu .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :57-65
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]  
Huang Biwei, 2024, Advances in Neural Information Processing Systems, V36
[10]   Exploring modality-shared appearance features and modality-invariant relation features for cross-modality person Re-IDentification [J].
Huang, Nianchang ;
Liu, Jianan ;
Luo, Yongjiang ;
Zhang, Qiang ;
Han, Jungong .
PATTERN RECOGNITION, 2023, 135