Joint Visual and Temporal Consistency for Unsupervised Domain Adaptive Person Re-identification

被引:126
作者
Li, Jianing [1 ]
Zhang, Shiliang [1 ]
机构
[1] Peking Univ, Sch EE&CS, Dept Comp Sci, Beijing 100871, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT XXIV | 2020年 / 12369卷
基金
北京市自然科学基金;
关键词
Domain adaption; Person re-identification; Convolution neural networks;
D O I
10.1007/978-3-030-58586-0_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised domain adaptive person Re-IDentification (ReID) is challenging because of the large domain gap between source and target domains, as well as the lackage of labeled data on the target domain. This paper tackles this challenge through jointly enforcing visual and temporal consistency in the combination of a local one-hot classification and a global multi-class classification. The local one-hot classification assigns images in a training batch with different person IDs, then adopts a Self-Adaptive Classification (SAC) model to classify them. The global multi-class classification is achieved by predicting labels on the entire unlabeled training set with the Memory-based Temporal-guided Cluster (MTC). MTC predicts multi-class labels by considering both visual similarity and temporal consistency to ensure the quality of label prediction. The two classification models are combined in a unified framework, which effectively leverages the unlabeled data for discriminative feature learning. Experimental results on three large-scale ReID datasets demonstrate the superiority of proposed method in both unsupervised and unsupervised domain adaptive ReID tasks. For example, under unsupervised setting, our method outperforms recent unsupervised domain adaptive methods, which leverage more labels for training.
引用
收藏
页码:483 / 499
页数:17
相关论文
共 42 条
[1]   Instance-Guided Context Rendering for Cross-Domain Person Re-Identification [J].
Chen, Yanbei ;
Zhu, Xiatian ;
Gong, Shaogang .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :232-242
[2]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[3]  
Ding G., 2019, BMVC
[4]  
Ester M., 1996, Kdd
[5]   Self-similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-identification [J].
Fu, Yang ;
Wei, Yunchao ;
Wang, Guanshuo ;
Zhou, Yuqian ;
Shi, Honghui ;
Huang, Thomas S. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6111-6120
[6]  
Ganin Y, 2015, Arxiv, DOI [arXiv:1409.7495, DOI 10.48550/ARXIV.1409.7495, 10.48550/arXiv.1409.7495]
[7]   Deep Reconstruction-Classification Networks for Unsupervised Domain Adaptation [J].
Ghifary, Muhammad ;
Kleijn, W. Bastiaan ;
Zhang, Mengjie ;
Balduzzi, David ;
Li, Wen .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :597-613
[8]   Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features [J].
Gray, Douglas ;
Tao, Hai .
COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 :262-275
[9]  
Gretton A., 2006, P 21 INT C NEURAL IN, P513
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778