Multi-scale Deep Learning Architectures for Person Re-identification

被引:219
作者
Qian, Xuelin [1 ]
Fu, Yanwei [2 ,5 ]
Jiang, Yu-Gang [1 ,3 ]
Xiang, Tao [4 ]
Xue, Xiangyang [1 ,2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Info Proc, Shanghai, Peoples R China
[2] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
[3] Tencent AI Lab, Shenzhen, Peoples R China
[4] Queen Marys Univ London, London, England
[5] Univ Technol Sydney, Sydney, NSW, Australia
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
关键词
D O I
10.1109/ICCV.2017.577
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person Re-identification (re-id) aims to match people across non-overlapping camera views in a public space. It is a challenging problem because many people captured in surveillance videos wear similar clothes. Consequently, the differences in their appearance are often subtle and only detectable at the right location and scales. Existing re-id models, particularly the recently proposed deep learning based ones match people at a single scale. In contrast, in this paper, a novel multi-scale deep learning model is proposed. Our model is able to learn deep discriminative feature representations at different scales and automatically determine the most suitable scales for matching. The importance of different spatial locations for extracting discriminative features is also learned explicitly. Experiments are carried out to demonstrate that the proposed model outperforms the state-of-the art on a number of benchmarks.
引用
收藏
页码:5409 / 5418
页数:10
相关论文
共 61 条
[1]  
[Anonymous], IEEE TCSVT
[2]  
[Anonymous], 2016, CORR
[3]  
[Anonymous], 2015, CORR
[4]  
[Anonymous], P IEEE INT C COMP VI
[5]  
[Anonymous], IEEE TIP
[6]  
[Anonymous], 2015, ELECT J DIFFERENTIAL, DOI [DOI 10.1155/2015/786396, DOI 10.1186/S40529-015-0086-6]
[7]  
[Anonymous], IEEE TPAMI
[8]  
[Anonymous], 2015, CVPR
[9]  
[Anonymous], Rethinking the Inception Architecture for Computer Vision
[10]  
[Anonymous], PATTERN RECOGNITION