Multi-Pseudo Regularized Label for Generated Data in Person Re-Identification

被引:99
作者
Huang, Yan [1 ]
Xu, Jingsong [1 ]
Wu, Qiang [1 ]
Zheng, Zhedong [2 ]
Zhang, Zhaoxiang [3 ]
Zhang, Jian [1 ]
机构
[1] Univ Technol Sydney, Sch Elect & Data Engn, Global Big Data Technol Ctr, Ultimo, NSW 2007, Australia
[2] Univ Technol Sydney, Sch Software, Ctr Artificial Intelligence, Ultimo, NSW 2007, Australia
[3] Chinese Acad Sci, Inst Automat, Res Ctr Brain Inspired Intelligence, CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing 100190, Peoples R China
关键词
Person re-identification; generated data; virtual label; semi-supervised learning;
D O I
10.1109/TIP.2018.2874715
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sufficient training data normally is required to train deeply learned models. However, due to the expensive manual process for a labeling large number of images (i.e., annotation), the amount of available training data (i.e., real data) is always limited. To produce more data for training a deep network, generative adversarial network can be used to generate artificial sample data (i.e., generated data). However, the generated data usually does not have annotation labels. To solve this problem, in this paper, we propose a virtual label called Multi-pseudo Regularized Label (MpRL) and assign it to the generated data. With MpRL, the generated data will be used as the supplementary of real training data to train a deep neural network in a semi-supervised learning fashion. To build the corresponding relationship between the real data and generated data, MpRL assigns each generated data a proper virtual label which reflects the likelihood of the affiliation of the generated data to pre-defined training classes in the real data domain. Unlike the traditional label which usually is a single integral number, the virtual label proposed in this paper is a set of weight-based values each individual of which is a number in (0,1] called multi-pseudo label and reflects the degree of relation between each generated data to every pre-defined class of real data. A comprehensive evaluation is carried out by adopting two state-of-the-art convolutional neural networks (CNNs) in our experiments to verify the effectiveness of MpRL. Experiments demonstrate that by assigning MpRL to generated data, we can further improve the person re-ID performance on five re-ID datasets, i.e., Market-1501, DukeMTMC-reID, CUHK03, VIPeR, and CUHK01. The proposed method obtains +6.29%, +6.30%, +5.58%, +5.84%, and +3.48% improvements in rank-1 accuracy over a strong CNN baseline on the five datasets, respectively, and outperforms state-of-the-art methods.
引用
收藏
页码:1391 / 1403
页数:13
相关论文
共 64 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
Alec R., 2016, 4 INT C LEARN REPR I, P1
[3]  
Ali S., 2010, ACM Multimedia Conference (ACM MM), P895
[4]  
[Anonymous], PROC CVPR IEEE
[5]  
[Anonymous], 2017, CVPR
[6]  
[Anonymous], 2016, DEEP TRANSFER LEARNI
[7]  
[Anonymous], 2017, ARXIV170107875
[8]  
[Anonymous], P COMP VIS PATT REC
[9]  
[Anonymous], 2016, SEMISUPERVISED LEARN
[10]  
[Anonymous], 2017, CVPR