Correlation-Guided Semantic Consistency Network for Visible-Infrared Person Re-Identification

被引:12
作者
Li, Haojie [1 ]
Li, Mingxuan [2 ]
Peng, Qijie [3 ]
Wang, Shijie [2 ]
Yu, Hong [3 ]
Wang, Zhihui [2 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Dalian Univ Technol, Int Sch Informat Sci & Engn, Dalian 116024, Peoples R China
[3] Dalian Univ Technol, Sch Software Technol, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
Correlation; Semantics; Feature extraction; Task analysis; Pedestrians; Cameras; Robustness; Person re-identification; visible infrared; intra-modality and inter-modality correlation;
D O I
10.1109/TCSVT.2023.3340225
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Visible-infrared person re-identification (VI-ReID) has raised more attention in night-time surveillance applications due to the struggle to capture valid appearance information under poor illumination conditions via visible cameras. Existing works usually separate the modality-specific and modality-irrelevant information in visible and infrared features, or project features of two modalities into a unified embedding feature space directly, which aims to eliminate huge modality discrepancies. However, these methods neglect the intra-modality and inter-modality correlations. We argue that the correlations can implicitly guide the network to discover the modality-irrelevant information, thus more beneficial for eliminating huge modality discrepancies and preserving individual differences. To this end, we propose a novel framework, termed as correlation-guided semantic consistency network (CSC-Net), to explore and exploit the intra-modality and inter-modality correlations. Specifically, CSC-Net consists of a cross-modality semantic alignment (CSA) module, a cross-granularity discrepancy awareness (CDA) module, and a probability consistency constraint (PCC) module. CSA mines the inter-modality correlation by calculating the semantic similarity between modalities to explore modality-irrelevant features, and then transfers the learned features to the backbone network to face the input of only single modality images. To preserve the individual differences, CDA sufficiently utilizes the intra-modality correlation via exploring the multi-granularity discriminative information. Finally, PCC constrains the network at the probability level, cooperating with the CSA which constrains at the feature level, to further alleviate the modality discrepancy. Extensive experiments on two public VI-ReID datasets SYSU-MM01 and RegDB have verified the effectiveness of our approach.
引用
收藏
页码:4503 / 4515
页数:13
相关论文
共 70 条
  • [1] A general tensor representation framework for cross-view gait recognition
    Ben, Xianye
    Zhang, Peng
    Lai, Zhihui
    Yan, Rui
    Zhai, Xinliang
    Meng, Weixiao
    [J]. PATTERN RECOGNITION, 2019, 90 : 87 - 98
  • [2] Coupled Patch Alignment for Matching Cross-View Gaits
    Ben, Xianye
    Gong, Chen
    Zhang, Peng
    Jia, Xitong
    Wu, Qiang
    Meng, Weixiao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) : 3142 - 3157
  • [3] Hierarchical Shot Detector
    Cao, Jiale
    Pang, Yanwei
    Han, Jungong
    Li, Xuelong
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9704 - 9713
  • [4] ABD-Net: Attentive but Diverse Person Re-Identification
    Chen, Tianlong
    Ding, Shaojin
    Xie, Jingyi
    Yuan, Ye
    Chen, Wuyang
    Yang, Yang
    Ren, Zhou
    Wang, Zhangyang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8350 - 8360
  • [5] Beyond triplet loss: a deep quadruplet network for person re-identification
    Chen, Weihua
    Chen, Xiaotang
    Zhang, Jianguo
    Huang, Kaiqi
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1320 - 1329
  • [6] Neural Feature Search for RGB-Infrared Person Re-Identification
    Chen, Yehansen
    Wan, Lin
    Li, Zhihang
    Jing, Qianyan
    Sun, Zongyuan
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 587 - 597
  • [7] Choi S, 2020, PROC CVPR IEEE, P10254, DOI 10.1109/CVPR42600.2020.01027
  • [8] Dai PY, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P677
  • [9] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [10] Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification
    Feng, Zhanxiang
    Lai, Jianhuang
    Xie, Xiaohua
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 579 - 590