Shape-Erased Feature Learning for Visible-Infrared Person Re-Identification

被引：74

作者：

Feng, Jiawei ^{[1
]}

Wu, Ancong ^{[1
]}

Zhen, Wei-Shi ^{[1
,2
,3
]}

机构：

[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China

[2] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China

[3] Guangdong Key Lab Informat Secur Technol, Guangzhou, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/CVPR52729.2023.02179

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Due to the modality gap between visible and infrared images with high visual ambiguity, learning diverse modality-shared semantic concepts for visible-infrared person reidentification (VI-ReID) remains a challenging problem. Body shape is one of the significant modality-shared cues for VI-ReID. To dig more diverse modality-shared cues, we expect that erasing body-shape-related semantic concepts in the learned features can force the ReID model to extract more and other modality-shared features for identification. To this end, we propose shape-erased feature learning paradigm that decorrelates modality-shared features in two orthogonal subspaces. Jointly learning shape-related feature in one subspace and shape-erased features in the orthogonal complement achieves a conditional mutual information maximization between shape-erased feature and identity discarding body shape information, thus enhancing the diversity of the learned representation explicitly. Extensive experiments on SYSU-MM01, RegDB, and HITSZ-VCM datasets demonstrate the effectiveness of our method.

引用

页码：22752 / 22761

页数：10

共 40 条

[1]

Achille A, 2018, J MACH LEARN RES, V19

[2] Information Dropout: Learning Optimal Representations Through Noisy Computation [J].

Achille, Alessandro ;

Soatto, Stefano .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (12) :2897-2905

[3] Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks [J].

Chattopadhay, Aditya ;

Sarkar, Anirban ;

Howlader, Prantik ;

Balasubramanian, Vineeth N. .

2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :839-847

[4]

Dai PY, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P677

[5] Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification [J].

Feng, Zhanxiang ;

Lai, Jianhuang ;

Xie, Xiaohua .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :579-590

[6] CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification [J].

Fu, Chaoyou ;

Hu, Yibo ;

Wu, Xiang ;

Shi, Hailin ;

Mei, Tao ;

He, Ran .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :11803-11812

[7]

Ge Y., 2020, INT C LEARN REPR ICL

[8]

Guo Jianyuan, 2019, P IEEE CVF INT C COM, P3

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10]

Hermans A., 2017, DEFENSE TRIPLET LOSS

← 1 2 3 4 →