Beyond Scalar Neuron: Adopting Vector-Neuron Capsules for Long-Term Person Re-Identification

被引：85

作者：

Huang, Yan ^{[1
]}

Xu, Jingsong ^{[1
]}

Wu, Qiang ^{[1
]}

Zhong, Yi ^{[2
]}

Zhang, Peng ^{[1
]}

Zhang, Zhaoxiang ^{[3
]}

机构：

[1] Univ Technol Sydney, Sch Elect & Data Engn, Global Big Data Technol Ctr GBDTC, Ultimo, NSW 2007, Australia

[2] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China

[3] Chinese Acad Sci, Res Ctr Brain Inspired Intelligence, CAS Ctr Excellence Brain Sci & Intelligence Techn, Inst Automat, Beijing 100190, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2020年 / 30卷 / 10期

关键词：

Cameras; Surveillance; Neurons; Internet; Security; Lighting; Face; Person re-identification; long-term scenario; cloth change; vector-neuron capsules; TRANSFORMATION; NETWORK;

D O I：

10.1109/TCSVT.2019.2948093

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Current person re-identification (re-ID) works mainly focus on the short-term scenario where a person is less likely to change clothes. However, in the long-term re-ID scenario, a person has a great chance to change clothes. A sophisticated re-ID system should take such changes into account. To facilitate the study of long-term re-ID, this paper introduces a large-scale re-ID dataset called "Celeb-reID" to the community. Unlike previous datasets, the same person can change clothes in the proposed Celeb-reID dataset. Images of Celeb-reID are acquired from the Internet using street snap-shots of celebrities. There is a total of 1,052 IDs with 34,186 images making Celeb-reID being the largest long-term re-ID dataset so far. To tackle the challenge of cloth changes, we propose to use vector-neuron (VN) capsules instead of the traditional scalar neurons (SN) to design our network. Compared with SN, one extra-dimensional information in VN can perceive cloth changes of the same person. We introduce a well-designed ReIDCaps network and integrate capsules to deal with the person re-ID task. Soft Embedding Attention (SEA) and Feature Sparse Representation (FSR) mechanisms are adopted in our network for performance boosting. Experiments are conducted on the proposed long-term re-ID dataset and two common short-term re-ID datasets. Comprehensive analyses are given to demonstrate the challenge exposed in our datasets. Experimental results show that our ReIDCaps can outperform existing state-of-the-art methods by a large margin in the long-term scenario. The new dataset and code will be released to facilitate future researches.

引用

页码：3459 / 3471

页数：13

共 34 条

[1]

[Anonymous], 2018, ARXIV180404241

[2]

[Anonymous], 2018, ROUTL HANDBK

[3]

Barbosa IB, 2012, LECT NOTES COMPUT SC, V7583, P433, DOI 10.1007/978-3-642-33863-2_43

[4] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[5] Multi-Level Factorisation Net for Person Re-Identification [J].

Chang, Xiaobin ;

Hospedales, Timothy M. ;

Xiang, Tao .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2109-2118

[6]

Gong S, 2014, ADV COMPUT VIS PATT, P1, DOI 10.1007/978-1-4471-6296-4

[7] Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features [J].

Gray, Douglas ;

Tao, Hai .

COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 :262-275

[8] Recurrent Attention Models for Depth-Based Person Identification [J].

Haque, Albert ;

Alahi, Alexandre ;

Li Fei-Fei .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1229-1238

[9]

He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]

[10]

Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]

← 1 2 3 4 →