Visible-thermal person re-identification via multiple center-based constraints

被引：1

作者：

Song, Wanru ^{[1
]}

Wang, Xinyi ^{[1
]}

Chen, Changhong ^{[1
]}

Liu, Feng ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommunicat, Nanjing 210013, Jiangsu, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 82卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Visible-thermal person re-identification; Center-constrained; Grayscale transformation; Loss function;

D O I：

10.1007/s11042-022-14113-2

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Person re-identification is an important part of the intelligent video analysis and processing system. A new type of surveillance camera can switch to the thermal infrared mode for 24-hour video surveillance. It is necessary to research the visible-thermal cross-modality person re-identification. However, there is a large modal discrepancy in the visible-thermal task. Therefore, the research focus on how to build a bridge to narrow the cross-modality gap and fully exploit the shared information. In this paper, we first employ grayscale transformation of the visible image to generate an intermediate modality, thereby reducing the distance between the original two domains. On this basis, a novel three-branch multiple center-constrained network (TMCC-Net) is built for the visible-thermal task. More specifically, TMCC-Net is a three-branch network that mines the shared information of pedestrians in three modalities through special feature learning and shared feature embedding. In order to obtain better performance, our work introduces two heterogeneous center-constrained losses to constrain the feature embedding. On the one hand, the proposed losses limit the distribution of features at the modality edge; on the other hand, they can strengthen the role of grayscale modality in the cross-modality task. Finally, an end-to-end model for visible-thermal person re-identification is built, which is effective for shared information mining. Extensive experiments are conducted on the two cross-modality datasets, including the SYSU-MM01 and RegDB datasets. The experimental results demonstrate the effectiveness and superiority of the proposed method compared to the state-of-the-art approaches.

引用

页码：18459 / 18481

页数：23

共 46 条

[1] [Anonymous], 2014, ADV COMPUT VIS PATT, DOI DOI 10.1007/978-1-4471-6296-4
[2] Chen W., 2017, IEEE C COMPUTER VISI
[3] TWO-PHASE FEATURE FUSION NETWORK FOR VISIBLE-INFRARED PERSON RE-IDENTIFICATION
Cheng, Yunzhou
Xiao, Guoqiang
Tang, Xiaoqin
Ma, Wenzhuo
Gou, Xinye
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1149 - 1153
[4] Content-based image retrieval system using ORB and SIFT features
Chhabra, Payal
Garg, Naresh Kumar
Kumar, Munish
[J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07) : 2725 - 2733
[5] Choi S, 2020, PROC CVPR IEEE, P10254, DOI 10.1109/CVPR42600.2020.01027
[6] Video Person Re-Identification by Temporal Residual Learning
Dai, Ju
Zhang, Pingping
Wang, Dong
Lu, Huchuan
Wang, Hongyu
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (03) : 1366 - 1377
[7] Dai PY, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P677
[8] Hao Y, 2019, AAAI CONF ARTIF INTE, P8385
[9] Kaur A, 2022, IEEEACM T COMPUTATIO, P1
[10] D-CAD: Deep and Crowded Anomaly Detection
Kumar, Krishan
Kumar, Anurag
Bahuguna, Ayush
[J]. 7TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT - 2017), 2017, : 100 - 105

← 1 2 3 4 5 →