Correlation-Guided Semantic Consistency Network for Visible-Infrared Person Re-Identification

被引：22

作者：

Li, Haojie ^{[1
]}

Li, Mingxuan ^{[2
]}

Peng, Qijie ^{[3
]}

Wang, Shijie ^{[2
]}

Yu, Hong ^{[3
]}

Wang, Zhihui ^{[2
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China

[2] Dalian Univ Technol, Int Sch Informat Sci & Engn, Dalian 116024, Peoples R China

[3] Dalian Univ Technol, Sch Software Technol, Dalian 116024, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Correlation; Semantics; Feature extraction; Task analysis; Pedestrians; Cameras; Robustness; Person re-identification; visible infrared; intra-modality and inter-modality correlation;

D O I：

10.1109/TCSVT.2023.3340225

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Visible-infrared person re-identification (VI-ReID) has raised more attention in night-time surveillance applications due to the struggle to capture valid appearance information under poor illumination conditions via visible cameras. Existing works usually separate the modality-specific and modality-irrelevant information in visible and infrared features, or project features of two modalities into a unified embedding feature space directly, which aims to eliminate huge modality discrepancies. However, these methods neglect the intra-modality and inter-modality correlations. We argue that the correlations can implicitly guide the network to discover the modality-irrelevant information, thus more beneficial for eliminating huge modality discrepancies and preserving individual differences. To this end, we propose a novel framework, termed as correlation-guided semantic consistency network (CSC-Net), to explore and exploit the intra-modality and inter-modality correlations. Specifically, CSC-Net consists of a cross-modality semantic alignment (CSA) module, a cross-granularity discrepancy awareness (CDA) module, and a probability consistency constraint (PCC) module. CSA mines the inter-modality correlation by calculating the semantic similarity between modalities to explore modality-irrelevant features, and then transfers the learned features to the backbone network to face the input of only single modality images. To preserve the individual differences, CDA sufficiently utilizes the intra-modality correlation via exploring the multi-granularity discriminative information. Finally, PCC constrains the network at the probability level, cooperating with the CSA which constrains at the feature level, to further alleviate the modality discrepancy. Extensive experiments on two public VI-ReID datasets SYSU-MM01 and RegDB have verified the effectiveness of our approach.

引用

页码：4503 / 4515

页数：13

共 70 条

[1] A general tensor representation framework for cross-view gait recognition [J].

Ben, Xianye ;

Zhang, Peng ;

Lai, Zhihui ;

Yan, Rui ;

Zhai, Xinliang ;

Meng, Weixiao .

PATTERN RECOGNITION, 2019, 90 :87-98

[2] Coupled Patch Alignment for Matching Cross-View Gaits [J].

Ben, Xianye ;

Gong, Chen ;

Zhang, Peng ;

Jia, Xitong ;

Wu, Qiang ;

Meng, Weixiao .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) :3142-3157

[3] Hierarchical Shot Detector [J].

Cao, Jiale ;

Pang, Yanwei ;

Han, Jungong ;

Li, Xuelong .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9704-9713

[4] ABD-Net: Attentive but Diverse Person Re-Identification [J].

Chen, Tianlong ;

Ding, Shaojin ;

Xie, Jingyi ;

Yuan, Ye ;

Chen, Wuyang ;

Yang, Yang ;

Ren, Zhou ;

Wang, Zhangyang .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8350-8360

[5] Beyond triplet loss: a deep quadruplet network for person re-identification [J].

Chen, Weihua ;

Chen, Xiaotang ;

Zhang, Jianguo ;

Huang, Kaiqi .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329

[6] Neural Feature Search for RGB-Infrared Person Re-Identification [J].

Chen, Yehansen ;

Wan, Lin ;

Li, Zhihang ;

Jing, Qianyan ;

Sun, Zongyuan .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :587-597

[7] Hi-CMD: Hierarchical Cross-Modality Disentanglement for Visible-Infrared Person Re-Identification [J].

Choi, Seokeon ;

Lee, Sumin ;

Kim, Youngeun ;

Kim, Taekyung ;

Kim, Changick .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10254-10263

[8]

Dai PY, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P677

[9]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[10] Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification [J].

Feng, Zhanxiang ;

Lai, Jianhuang ;

Xie, Xiaohua .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :579-590

← 1 2 3 4 5 6 7 →