RGB-Infrared Cross-Modality Person Re-Identification

被引:598
作者
Wu, Ancong [1 ]
Zheng, Wei-Shi [2 ,5 ,6 ]
Yu, Hong-Xing [2 ]
Gong, Shaogang [4 ]
Lai, Jianhuang [2 ,3 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
[3] Guangdong Prov Key Lab Informat Secur, Guangzhou, Guangdong, Peoples R China
[4] Queen Mary Univ London, London, England
[5] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Beijing, Peoples R China
[6] NUDT, Collaborat Innovat Ctr High Performance Comp, Changsha, Hunan, Peoples R China
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
关键词
D O I
10.1109/ICCV.2017.575
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person re-identification (Re-ID) is an important problem in video surveillance, aiming to match pedestrian images across camera views. Currently, most works focus on RGB-based Re-ID. However, in some applications, RGB images are not suitable, e.g. in a dark environment or at night. Infrared (IR) imaging becomes necessary in many visual systems. To that end, matching RGB images with infrared images is required, which are heterogeneous with very different visual characteristics. For person Re-ID, this is a very challenging cross-modality problem that has not been studied so far. In this work, we address the RGB-IR cross-modality Re-ID problem and contribute a new multiple modality Re-ID dataset named SYSU-MM01, including RGB and IR images of 491 identities from 6 cameras, giving in total 287,628 RGB images and 15,792 IR images. To explore the RGB-IR Re-ID problem, we evaluate existing popular cross-domain models, including three commonly used neural network structures (one-stream, two-stream and asymmetric FC layer) and analyse the relation between them. We further propose deep zero-padding for training one-stream network towards automatically evolving domain-specific nodes in the network for cross-modality matching. Our experiments show that RGB-IR cross-modality matching is very challenging but still feasible using the proposed model with deep zero-padding, giving the best performance. Our dataset is available at http://isee.sysu.edu.cn/project/RGBIRReID.htm.
引用
收藏
页码:5390 / 5399
页数:10
相关论文
共 60 条
[1]  
[Anonymous], 2013, CVPR
[2]  
[Anonymous], 2016, CVPR
[3]  
[Anonymous], BMVC
[4]  
[Anonymous], 2012, AS C COMP VIS ACCV
[5]  
[Anonymous], ICPR
[6]  
[Anonymous], IEEE TIFS
[7]  
[Anonymous], 2013, P NIPS
[8]  
[Anonymous], 2016, CVPR
[9]  
[Anonymous], IEEE TPAMI
[10]  
[Anonymous], 2014, P 28 INT C NEUR INF