Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification

被引：136

作者：

Zhao, Yuyang ^{[1
,5
]}

Zhong, Zhun ^{[2
]}

Yang, Fengxiang ^{[1
]}

Luo, Zhiming ^{[1
]}

Lin, Yaojin ^{[4
]}

Li, Shaozi ^{[1
,3
]}

Sebe, Nicu ^{[2
]}

机构：

[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Fujian, Peoples R China

[2] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy

[3] Xiamen Univ, Inst Artificial Intelligence, Xiamen, Fujian, Peoples R China

[4] Minnan Normal Univ, Zhangzhou, Fujian, Peoples R China

[5] Xiamen Univ, Xiamen, Fujian, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

基金：

中国博士后科学基金;

关键词：

D O I：

10.1109/CVPR46437.2021.00621

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent advances in person re-identification (ReID) obtain impressive accuracy in the supervised and unsupervised learning settings. However, most of the existing methods need to train a new model for a new domain by accessing data. Due to public privacy, the new domain data are not always accessible, leading to a limited applicability of these methods. In this paper, we study the problem of multi-source domain generalization in ReID, which aims to learn a model that can perform well on unseen domains with only several labeled source domains. To address this problem, we propose the Memory-based Multi-Source Meta-Learning ((ML)-L-3) framework to train a generalizable model for unseen domains. Specifically, a meta-learning strategy is introduced to simulate the train-test process of domain generalization for learning more generalizable models. To overcome the unstable meta-optimization caused by the parametric classifier, we propose a memory-based identification loss that is non-parametric and harmonizes with meta-learning. We also present a meta batch normalization layer (MetaBN) to diversify meta-test features, further establishing the advantage of meta-learning. Experiments demonstrate that our (ML)-L-3 can effectively enhance the generalization ability of the model for unseen domains and can outperform the state-of-the-art methods on four large-scale ReID datasets.

引用

页码：6273 / 6282

页数：10

共 57 条

[1] Andrychowicz M, 2016, ADV NEUR IN, V29
[2] [Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00632
[3] [Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00345
[4] [Anonymous], 2020, Computer Vision and Pattern Recognition, DOI DOI 10.1109/CVPR42600.2020.00401
[5] [Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00016
[6] [Anonymous], 2018, TOMM, DOI DOI 10.1145/3243316
[7] Balaji Yogesh, 2018, NEURIPS
[8] Chattopadhyay Prithvijit, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12354), P301, DOI 10.1007/978-3-030-58545-7_18
[9] Self-supervised Learning with Geometric Constraints in Monocular Video Connecting Flow, Depth, and Camera
Chen, Yuhua
Schmid, Cordelia
Sminchisescu, Cristian
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7062 - 7071
[10] Finn C, 2017, PR MACH LEARN RES, V70

← 1 2 3 4 5 6 →