Multi-Level Factorisation Net for Person Re-Identification

被引:408
作者
Chang, Xiaobin [1 ]
Hospedales, Timothy M. [2 ]
Xiang, Tao [1 ]
机构
[1] Queen Mary Univ London, London, England
[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00225
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Key to effective person re-identification (Re-ID) is modelling discriminative and view-invariant factors of person appearance at both high and low semantic levels. Recently developed deep Re-ID models either learn a holistic single semantic level feature representation and/or require laborious human annotation of these factors as attributes. We propose Multi-Level Factorisation Net (MLFN), a novel network architecture that factorises the visual appearance of a person into latent discriminative factors at multiple semantic levels without manual annotation. MLFN is composed of multiple stacked blocks. Each block contains multiple factor modules to model latent factors at a specific level, and factor selection modules that dynamically select the factor modules to interpret the content of each input image. The outputs of the factor selection modules also provide a compact latent factor descriptor that is complementary to the conventional deeply learned features. MLFN achieves state-of-the-art results on three Re-ID datasets, as well as compelling results on the general object categorisation CIFAR-100 dataset.
引用
收藏
页码:2109 / 2118
页数:10
相关论文
共 57 条
[41]  
Szegedy C, 2014, Arxiv, DOI arXiv:1312.6199
[42]   Automatic prediction of intelligible speaking rate for individuals with ALS from speech acoustic and articulatory samples [J].
Wang, Jun ;
Kothalkar, Prasanna V. ;
Kim, Myungjong ;
Bandini, Andrea ;
Cao, Beiming ;
Yunusova, Yana ;
Campbell, Thomas F. ;
Heitzman, Daragh ;
Green, Jordan R. .
INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2018, 20 (06) :669-679
[43]   Joint Detection and Identification Feature Learning for Person Search [J].
Xiao, Tong ;
Li, Shuang ;
Wang, Bochao ;
Lin, Liang ;
Wang, Xiaogang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3376-3385
[44]   Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification [J].
Xiao, Tong ;
Li, Hongsheng ;
Ouyang, Wanli ;
Wang, Xiaogang .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1249-1258
[45]  
Xie S, 2016, CVPR
[46]  
Xie SN, 2015, IEEE I CONF COMP VIS, P1395, DOI [10.1109/ICCV.2015.164, 10.1007/s11263-017-1004-z]
[47]  
Yanbei C., 2017, ICCV WORKSH
[48]   Multi-scale recognition with DAG-CNNs [J].
Yang, Songfan ;
Ramanan, Deva .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1215-1223
[49]  
Yu R., 2017, BMVC
[50]   Exploiting the complementary strengths of multi-layer CNN features for image retrieval [J].
Yu, Wei ;
Yang, Kuiyuan ;
Yao, Hongxun ;
Sun, Xiaoshuai ;
Xu, Pengfei .
NEUROCOMPUTING, 2017, 237 :235-241