Multi-Pseudo Regularized Label for Generated Data in Person Re-Identification

被引：99

作者：

Huang, Yan ^{[1
]}

Xu, Jingsong ^{[1
]}

Wu, Qiang ^{[1
]}

Zheng, Zhedong ^{[2
]}

Zhang, Zhaoxiang ^{[3
]}

Zhang, Jian ^{[1
]}

机构：

[1] Univ Technol Sydney, Sch Elect & Data Engn, Global Big Data Technol Ctr, Ultimo, NSW 2007, Australia

[2] Univ Technol Sydney, Sch Software, Ctr Artificial Intelligence, Ultimo, NSW 2007, Australia

[3] Chinese Acad Sci, Inst Automat, Res Ctr Brain Inspired Intelligence, CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing 100190, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2019年 / 28卷 / 03期

关键词：

Person re-identification; generated data; virtual label; semi-supervised learning;

D O I：

10.1109/TIP.2018.2874715

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sufficient training data normally is required to train deeply learned models. However, due to the expensive manual process for a labeling large number of images (i.e., annotation), the amount of available training data (i.e., real data) is always limited. To produce more data for training a deep network, generative adversarial network can be used to generate artificial sample data (i.e., generated data). However, the generated data usually does not have annotation labels. To solve this problem, in this paper, we propose a virtual label called Multi-pseudo Regularized Label (MpRL) and assign it to the generated data. With MpRL, the generated data will be used as the supplementary of real training data to train a deep neural network in a semi-supervised learning fashion. To build the corresponding relationship between the real data and generated data, MpRL assigns each generated data a proper virtual label which reflects the likelihood of the affiliation of the generated data to pre-defined training classes in the real data domain. Unlike the traditional label which usually is a single integral number, the virtual label proposed in this paper is a set of weight-based values each individual of which is a number in (0,1] called multi-pseudo label and reflects the degree of relation between each generated data to every pre-defined class of real data. A comprehensive evaluation is carried out by adopting two state-of-the-art convolutional neural networks (CNNs) in our experiments to verify the effectiveness of MpRL. Experiments demonstrate that by assigning MpRL to generated data, we can further improve the person re-ID performance on five re-ID datasets, i.e., Market-1501, DukeMTMC-reID, CUHK03, VIPeR, and CUHK01. The proposed method obtains +6.29%, +6.30%, +5.58%, +5.84%, and +3.48% improvements in rank-1 accuracy over a strong CNN baseline on the five datasets, respectively, and outperforms state-of-the-art methods.

引用

页码：1391 / 1403

页数：13

共 64 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

Alec R., 2016, 4 INT C LEARN REPR I, P1

[3]

Ali S., 2010, ACM Multimedia Conference (ACM MM), P895

[4]

[Anonymous], PROC CVPR IEEE

[5]

[Anonymous], 2017, CVPR

[6]

[Anonymous], 2016, DEEP TRANSFER LEARNI

[7]

[Anonymous], 2017, ARXIV170107875

[8]

[Anonymous], P COMP VIS PATT REC

[9]

[Anonymous], 2016, SEMISUPERVISED LEARN

[10]

[Anonymous], 2017, CVPR

← 1 2 3 4 5 6 7 →