Multi-Level Factorisation Net for Person Re-Identification

被引：408

作者：

Chang, Xiaobin ^{[1
]}

Hospedales, Timothy M. ^{[2
]}

Xiang, Tao ^{[1
]}

机构：

[1] Queen Mary Univ London, London, England

[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

D O I：

10.1109/CVPR.2018.00225

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Key to effective person re-identification (Re-ID) is modelling discriminative and view-invariant factors of person appearance at both high and low semantic levels. Recently developed deep Re-ID models either learn a holistic single semantic level feature representation and/or require laborious human annotation of these factors as attributes. We propose Multi-Level Factorisation Net (MLFN), a novel network architecture that factorises the visual appearance of a person into latent discriminative factors at multiple semantic levels without manual annotation. MLFN is composed of multiple stacked blocks. Each block contains multiple factor modules to model latent factors at a specific level, and factor selection modules that dynamically select the factor modules to interpret the content of each input image. The outputs of the factor selection modules also provide a compact latent factor descriptor that is complementary to the conventional deeply learned features. MLFN achieves state-of-the-art results on three Re-ID datasets, as well as compelling results on the general object categorisation CIFAR-100 dataset.

引用

页码：2109 / 2118

页数：10

共 57 条

[41]

Szegedy C, 2014, Arxiv, DOI arXiv:1312.6199

[42] Automatic prediction of intelligible speaking rate for individuals with ALS from speech acoustic and articulatory samples [J].

Wang, Jun ;

Kothalkar, Prasanna V. ;

Kim, Myungjong ;

Bandini, Andrea ;

Cao, Beiming ;

Yunusova, Yana ;

Campbell, Thomas F. ;

Heitzman, Daragh ;

Green, Jordan R. .

INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2018, 20 (06) :669-679

[43] Joint Detection and Identification Feature Learning for Person Search [J].

Xiao, Tong ;

Li, Shuang ;

Wang, Bochao ;

Lin, Liang ;

Wang, Xiaogang .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3376-3385

[44] Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification [J].

Xiao, Tong ;

Li, Hongsheng ;

Ouyang, Wanli ;

Wang, Xiaogang .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1249-1258

[45]

Xie S, 2016, CVPR

[46]

Xie SN, 2015, IEEE I CONF COMP VIS, P1395, DOI [10.1109/ICCV.2015.164, 10.1007/s11263-017-1004-z]

[47]

Yanbei C., 2017, ICCV WORKSH

[48] Multi-scale recognition with DAG-CNNs [J].

Yang, Songfan ;

Ramanan, Deva .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1215-1223

[49]

Yu R., 2017, BMVC

[50] Exploiting the complementary strengths of multi-layer CNN features for image retrieval [J].

Yu, Wei ;

Yang, Kuiyuan ;

Yao, Hongxun ;

Sun, Xiaoshuai ;

Xu, Pengfei .

NEUROCOMPUTING, 2017, 237 :235-241

← 1 2 3 4 5 6 →