Multi-Level Common Space Learning for Person Re-Identification

被引:11
作者
An, Le [1 ]
Qin, Zhen [2 ]
Chen, Xiaojing [2 ]
Yang, Songfan [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, Natl Key Lab Sci & Technol Multispectral Informat, Wuhan 430074, Hubei, Peoples R China
[2] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
[3] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610064, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; surveillance; multi-level common space; group sparse representation; PEDESTRIAN RECOGNITION; FACE RECOGNITION; CLASSIFICATION; NETWORK;
D O I
10.1109/TCSVT.2017.2680118
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Matching people in different cameras, commonly referred to as person re-identification, is a challenging task. The challenges come from the drastic appearance variation across different camera views caused by changes in pose, lighting condition, occlusion, background, and so on. Instead of matching images in the original feature space, many existing methods learn distance metrics or feature transformations to improve the matching accuracy. For example, data from different camera views can be projected onto their common space to mitigate the feature gap caused by view discrepancy. However, a single-step mapping may not be sufficient to close this gap as the view difference can be significant between different cameras. To overcome this limitation, we propose a multi-level common space learning framework, which can gradually minimize the data discrepancy between different views in an iterative manner. At each intermediate level, synthetic data are generated using the automatically discovered grouping information, and these synthetic data can be viewed as a transitional state between the original camera views. The matching is performed by mapping the probe and gallery onto their common space in multiple steps. We evaluate the proposed method on two widely used person re-identification data sets. The results show that the proposed multi-level scheme significantly outperforms single-level mapping. In addition, competitive results are achieved as compared with over 20 state-of-the-art techniques.
引用
收藏
页码:1777 / 1787
页数:11
相关论文
共 60 条
  • [1] An L., 2013, DISTR SMART CAM ICDS, P1
  • [2] Sparse representation matching for person re-identification
    An, Le
    Chen, Xiaojing
    Yang, Songfan
    Bhanu, Bir
    [J]. INFORMATION SCIENCES, 2016, 355 : 74 - 89
  • [3] Person Reidentification With Reference Descriptor
    An, Le
    Kafai, Mehran
    Yang, Songfan
    Bhanu, Bir
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (04) : 776 - 787
  • [4] Person Re-Identification by Robust Canonical Correlation Analysis
    An, Le
    Yang, Songfan
    Bhanu, Bir
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (08) : 1103 - 1107
  • [5] Dynamic Bayesian Network for Unconstrained Face Recognition in Surveillance Camera Networks
    An, Le
    Kafai, Mehran
    Bhanu, Bir
    [J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2013, 3 (02) : 155 - 164
  • [6] [Anonymous], 2013, COMPUTER VISION ACCV, DOI 10.1007/978-3-642-37331-23
  • [7] [Anonymous], 2012, 2012 IEEE COMPUTER S
  • [8] [Anonymous], 2007, P IEEE INT WORKSH PE
  • [9] Bak S, 2014, 2014 11TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), P175, DOI 10.1109/AVSS.2014.6918664
  • [10] Bak S, 2012, LECT NOTES COMPUT SC, V7574, P806, DOI 10.1007/978-3-642-33712-3_58