Recurrent matching networks of spatial alignment learning for person re-identification

被引：0

作者：

Lan Lin

Dan Zhang

Xin Zheng

Mao Ye

Jiuxia Guo

机构：

[1] University of Electronic Science and Technology of China,School of Computer Science and Engineering, Center for Robotics, Key Laboratory for NeuroInformation of Ministry of Education

[2] China West Normal University,undefined

[3] Civil Aviation Flight University of China,undefined

来源：

Multimedia Tools and Applications | 2020年 / 79卷

关键词：

Person re-identification; Uncontrolled spatial misalignment; Local correspondence; Multi-view learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Person re-identification (re-id) usually refers to matching people across disjoint camera views. Many existing methods focus on extracting discriminative features or learning distance metrics to make the intraclass distance smaller than interclass distances. These methods subconsciously assume that pedestrian images are well aligned. However, one major challenge in person re-id is the unconstrained spatial misalignment between image pairs due to view angle changes and pedestrian pose variations. To address this problem, in this paper, we propose Recurrent Matching Network of Spatial Alignment Learning (RMN-SAL) to simulate the human vision perception. Reinforcement learning is introduced to locate attention regions, since it provides a flexible learning strategy for sequential decision-making. A linear mapping is employed to convert the environment state into spatial constraint, comprising spatial alignment into feature learning. And recurrent models are used to extract information from a sequence of corresponding regions. Finally, person re-id is performed based on the global features and the features from the learned alignment regions. Our contributions are: 1) the recurrent matching network, which can subtly combine local feature learning and sequential spatial correspondence learning into an end-to-end framework; 2) the design of a location network, which is based on reinforcement learning and aims to learn task-specific sequential spatial correspondences for different image pairs through the local pairwise internal representation interactions. The proposed model is evaluated on three benchmarks, including Market-1501, DukeMTMC-reID and CUHK03, and achieves better performances than other methods.

引用

页码：33735 / 33755

页数：20

共 107 条

[1] An L(2017)Person re-identification by multi-hypergraph fusion IEEE Trans Neural Netw Learn Syst 28 2763-2774
[2] Chen X(2018)Person re-identification by camera correlation aware feature augmentation IEEE Trans Pattern Anal Mach Intell 40 392-408
[3] Yang S(2016)Combined salience based person re-identification Multimed Tools Appl 75 11,447-11,468
[4] Li X(2012)Learning where to attend with deep architectures for image tracking Neural Comput 24 2151-2184
[5] Chen Y(2010)Object detection with discriminatively trained part-based models IEEE Trans Pattern Anal Mach Intell 32 1627-1645
[6] Zhu X(2017)A person re-identification algorithm based on pyramid color topology feature Multimed Tools Appl 76 26,633-26,646
[7] Zheng W(2017)Super-resolution person re-identification with semi-coupled low-rank discriminant dictionary learning IEEE Trans Image Process 26 1363-1378
[8] Lai J(1996)Reinforcement learning: a survey J Artif Intell Res 4 237-285
[9] Choe G(2017)Structured domain adaptation IEEE Trans Circ Syst Video Technol 27 1700-1713
[10] Yuan C(2018)Transfer independently together: a generalized framework for domain adaptation IEEE Trans Cybern 1 1-12

← 1 2 3 4 5 6 7 8 9 10 →