Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification

被引:136
作者
Zhao, Yuyang [1 ,5 ]
Zhong, Zhun [2 ]
Yang, Fengxiang [1 ]
Luo, Zhiming [1 ]
Lin, Yaojin [4 ]
Li, Shaozi [1 ,3 ]
Sebe, Nicu [2 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Fujian, Peoples R China
[2] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[3] Xiamen Univ, Inst Artificial Intelligence, Xiamen, Fujian, Peoples R China
[4] Minnan Normal Univ, Zhangzhou, Fujian, Peoples R China
[5] Xiamen Univ, Xiamen, Fujian, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
基金
中国博士后科学基金;
关键词
D O I
10.1109/CVPR46437.2021.00621
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in person re-identification (ReID) obtain impressive accuracy in the supervised and unsupervised learning settings. However, most of the existing methods need to train a new model for a new domain by accessing data. Due to public privacy, the new domain data are not always accessible, leading to a limited applicability of these methods. In this paper, we study the problem of multi-source domain generalization in ReID, which aims to learn a model that can perform well on unseen domains with only several labeled source domains. To address this problem, we propose the Memory-based Multi-Source Meta-Learning ((ML)-L-3) framework to train a generalizable model for unseen domains. Specifically, a meta-learning strategy is introduced to simulate the train-test process of domain generalization for learning more generalizable models. To overcome the unstable meta-optimization caused by the parametric classifier, we propose a memory-based identification loss that is non-parametric and harmonizes with meta-learning. We also present a meta batch normalization layer (MetaBN) to diversify meta-test features, further establishing the advantage of meta-learning. Experiments demonstrate that our (ML)-L-3 can effectively enhance the generalization ability of the model for unseen domains and can outperform the state-of-the-art methods on four large-scale ReID datasets.
引用
收藏
页码:6273 / 6282
页数:10
相关论文
共 57 条
  • [1] Andrychowicz M, 2016, ADV NEUR IN, V29
  • [2] [Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00632
  • [3] [Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00345
  • [4] [Anonymous], 2020, Computer Vision and Pattern Recognition, DOI DOI 10.1109/CVPR42600.2020.00401
  • [5] [Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00016
  • [6] [Anonymous], 2018, TOMM, DOI DOI 10.1145/3243316
  • [7] Balaji Yogesh, 2018, NEURIPS
  • [8] Chattopadhyay Prithvijit, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12354), P301, DOI 10.1007/978-3-030-58545-7_18
  • [9] Self-supervised Learning with Geometric Constraints in Monocular Video Connecting Flow, Depth, and Camera
    Chen, Yuhua
    Schmid, Cordelia
    Sminchisescu, Cristian
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7062 - 7071
  • [10] Finn C, 2017, PR MACH LEARN RES, V70