A camera style-invariant learning and channel interaction enhancement fusion network for visible-infrared person re-identification

被引：4

作者：

Du, Haishun ^{[1
]}

Hao, Xinxin ^{[1
]}

Ye, Yanfang ^{[1
]}

He, Linbing ^{[1
]}

Guo, Jiangtao ^{[1
]}

机构：

[1] Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China

来源：

MACHINE VISION AND APPLICATIONS | 2023年 / 34卷 / 06期

关键词：

Cross-modality visible-infrared person re-identification; Channel interaction enhancement fusion; Camera style-invariant learning; Feature-level adversarial learning strategy; AUGMENTATION; ATTENTION;

D O I：

10.1007/s00138-023-01473-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-modality visible-infrared person re-identification (VI-ReID) aims to match visible and infrared pedestrian images from different cameras in various scenarios. However, most existing VI-ReID methods only focus on eliminating the modality discrepancy while ignoring the intra-class discrepancy caused by different camera styles. In addition, some feature fusion-based VI-ReID methods try to improve the discriminative capability of pedestrian representations by fusing pedestrian features from different convolutional layers or branches. However, most of them only implement feature fusion by simple operations, such as summation or concatenation, and ignore the interaction between different feature maps. To this end, we propose a camera style-invariant learning and channel interaction enhancement fusion network for VI-ReID. In particular, we design a channel interaction enhancement fusion module. It first computes and utilizes the channel-level similarity matrix of two feature maps to obtain two corresponding weighted feature maps that enhance the common concern information of the original two feature maps. Then, it obtains more discriminative pedestrian features by fusing the two weighted feature maps and mining their complementary information. Furthermore, in order to weaken the impact of camera style discrepancy of pedestrian images, we design a camera style-invariant feature-level adversarial learning strategy to ensure that the feature extraction network can extract camera style-invariant pedestrian features by the adversarial learning between the feature extraction network and the camera style classifier. Extensive experimental results on the two benchmark datasets, SYSU-MM01 and RegDB, demonstrate that the performance of CC-Net achieves the recent advanced level.

引用

页数：14

共 71 条

[1] Re-ranking via Metric Fusion for Object Retrieval and Person Re-identification [J].

Bai, Song ;

Tang, Peng ;

Torr, Philip H. S. ;

Latecki, Longin Jan .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :740-749

[2] An efficient framework for visible-infrared cross modality person re-identification [J].

Basaran, Emrah ;

Gokmen, Muhittin ;

Kamasak, Mustafa E. .

SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 87

[3] Dual-modality hard mining triplet-center loss for visible infrared person re-identification [J].

Cai, Xin ;

Liu, Li ;

Zhu, Lei ;

Zhang, Huaxiang .

KNOWLEDGE-BASED SYSTEMS, 2021, 215

[4] Neural Feature Search for RGB-Infrared Person Re-Identification [J].

Chen, Yehansen ;

Wan, Lin ;

Li, Zhihang ;

Jing, Qianyan ;

Sun, Zongyuan .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :587-597

[5] Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function [J].

Cheng, De ;

Gong, Yihong ;

Zhou, Sanping ;

Wang, Jinjun ;

Zheng, Nanning .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1335-1344

[6] Exploring Cross-Modality Commonalities via Dual-Stream Multi-Branch Network for Infrared-Visible Person Re-Identification [J].

Cheng, Ding ;

Li, Xiaohong ;

Qi, Meibin ;

Liu, Xueliang ;

Chen, Cuiqun ;

Niu, Dawei .

IEEE ACCESS, 2020, 8 :12824-12834

[7] Improving Person Re-identification via Pose-aware Multi-shot Matching [J].

Cho, Yeong-Jun ;

Yoon, Kuk-Jin .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1354-1362

[8] Hi-CMD: Hierarchical Cross-Modality Disentanglement for Visible-Infrared Person Re-Identification [J].

Choi, Seokeon ;

Lee, Sumin ;

Kim, Youngeun ;

Kim, Taekyung ;

Kim, Changick .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10254-10263

[9]

Dai PY, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P677

[10]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 4 5 6 7 8 →