Identity-Seeking Self-Supervised Representation Learning for Generalizable Person Re-identification

被引：20

作者：

Dou, Zhaopeng ^{[1
,2
]}

Wang, Zhongdao ^{[1
,2
]}

Li, Yali ^{[1
,2
]}

Wang, Shengjin ^{[1
,2
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China

[2] Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing, Peoples R China

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

关键词：

UNSUPERVISED DOMAIN ADAPTATION; NETWORK;

D O I：

10.1109/ICCV51070.2023.01452

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper aims to learn a domain-generalizable (DG) person re-identification (ReID) representation from largescale videos without any annotation. Prior DG ReID methods employ limited labeled data for training due to the high cost of annotation, which restricts further advances. To overcome the barriers of data and annotation, we propose to utilize large-scale unsupervised data for training. The key issue lies in how to mine identity information. To this end, we propose an Identity-seeking Self-supervised Representation learning (ISR) method. ISR constructs positive pairs from inter-frame images by modeling the instance association as a maximum-weight bipartite matching problem. A reliability-guided contrastive loss is further presented to suppress the adverse impact of noisy positive pairs, ensuring that reliable positive pairs dominate the learning process. The training cost of ISR scales approximately linearly with the data size, making it feasible to utilize large-scale data for training. The learned representation exhibits superior generalization ability. Without human annotation and fine-tuning, ISR achieves 87.0% Rank-1 on Market-1501 and 56.4% Rank1 on MSMT17, outperforming the best supervised domaingeneralizable method by 5.0% and 19.5%, respectively. In the pre-training.fine-tuning scenario, ISR achieves stateof-the-art performance, with 88.4% Rank-1 on MSMT17. The code is at https://github.com/dcp15/ISR_ ICCV2023_Oral.

引用

页码：15801 / 15812

页数：12

共 84 条

[1] Hierarchical Connectivity-Centered Clustering for Unsupervised Domain Adaptation on Person Re-Identification [J].

Bai, Yan ;

Wang, Ce ;

Lou, Yihang ;

Liu, Jun ;

Duan, Ling-Yu .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :6715-6729

[2] Mixed High-Order Attention Network for Person Re-Identification [J].

Chen, Binghui ;

Deng, Weihong ;

Hu, Jiani .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :371-381

[3] Self-Critical Attention Learning for Person Re-Identification [J].

Chen, Guangyi ;

Lin, Chunze ;

Ren, Liangliang ;

Lu, Jiwen ;

Zhou, Jie .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9636-9645

[4]

Chen PX, 2021, AAAI CONF ARTIF INTE, V35, P1054

[5] ABD-Net: Attentive but Diverse Person Re-Identification [J].

Chen, Tianlong ;

Ding, Shaojin ;

Xie, Jingyi ;

Yuan, Ye ;

Chen, Wuyang ;

Yang, Yang ;

Ren, Zhou ;

Wang, Zhangyang .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8350-8360

[6]

Chen T, 2020, PR MACH LEARN RES, V119

[7]

Chen Weihua, 2017, CVPR, P2

[8] Exploring Simple Siamese Representation Learning [J].

Chen, Xinlei ;

He, Kaiming .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15745-15753

[9] Meta Batch-Instance Normalization for Generalizable Person Re-Identification [J].

Choi, Seokeon ;

Kim, Taekyung ;

Jeong, Minki ;

Park, Hyoungseob ;

Kim, Changick .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3424-3434

[10] Generalizable Person Re-identification with Relevance-aware Mixture of Experts [J].

Dai, Yongxing ;

Li, Xiaotong ;

Liu, Jun ;

Tong, Zekun ;

Duan, Ling-Yu .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16140-16149

← 1 2 3 4 5 6 7 8 9 →