Beyond Scalar Neuron: Adopting Vector-Neuron Capsules for Long-Term Person Re-Identification

被引:72
作者
Huang, Yan [1 ]
Xu, Jingsong [1 ]
Wu, Qiang [1 ]
Zhong, Yi [2 ]
Zhang, Peng [1 ]
Zhang, Zhaoxiang [3 ]
机构
[1] Univ Technol Sydney, Sch Elect & Data Engn, Global Big Data Technol Ctr GBDTC, Ultimo, NSW 2007, Australia
[2] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[3] Chinese Acad Sci, Res Ctr Brain Inspired Intelligence, CAS Ctr Excellence Brain Sci & Intelligence Techn, Inst Automat, Beijing 100190, Peoples R China
关键词
Cameras; Surveillance; Neurons; Internet; Security; Lighting; Face; Person re-identification; long-term scenario; cloth change; vector-neuron capsules; TRANSFORMATION; NETWORK;
D O I
10.1109/TCSVT.2019.2948093
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Current person re-identification (re-ID) works mainly focus on the short-term scenario where a person is less likely to change clothes. However, in the long-term re-ID scenario, a person has a great chance to change clothes. A sophisticated re-ID system should take such changes into account. To facilitate the study of long-term re-ID, this paper introduces a large-scale re-ID dataset called "Celeb-reID" to the community. Unlike previous datasets, the same person can change clothes in the proposed Celeb-reID dataset. Images of Celeb-reID are acquired from the Internet using street snap-shots of celebrities. There is a total of 1,052 IDs with 34,186 images making Celeb-reID being the largest long-term re-ID dataset so far. To tackle the challenge of cloth changes, we propose to use vector-neuron (VN) capsules instead of the traditional scalar neurons (SN) to design our network. Compared with SN, one extra-dimensional information in VN can perceive cloth changes of the same person. We introduce a well-designed ReIDCaps network and integrate capsules to deal with the person re-ID task. Soft Embedding Attention (SEA) and Feature Sparse Representation (FSR) mechanisms are adopted in our network for performance boosting. Experiments are conducted on the proposed long-term re-ID dataset and two common short-term re-ID datasets. Comprehensive analyses are given to demonstrate the challenge exposed in our datasets. Experimental results show that our ReIDCaps can outperform existing state-of-the-art methods by a large margin in the long-term scenario. The new dataset and code will be released to facilitate future researches.
引用
收藏
页码:3459 / 3471
页数:13
相关论文
共 34 条
[1]  
[Anonymous], 2017, ARXIV171108106
[2]  
[Anonymous], 2018, ROUTL HANDBK
[3]  
Barbosa IB, 2012, LECT NOTES COMPUT SC, V7583, P433, DOI 10.1007/978-3-642-33863-2_43
[4]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[5]   Multi-Level Factorisation Net for Person Re-Identification [J].
Chang, Xiaobin ;
Hospedales, Timothy M. ;
Xiang, Tao .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2109-2118
[6]  
Gong S, 2014, ADV COMPUT VIS PATT, P1, DOI 10.1007/978-1-4471-6296-4
[7]   Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features [J].
Gray, Douglas ;
Tao, Hai .
COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 :262-275
[8]   Recurrent Attention Models for Depth-Based Person Identification [J].
Haque, Albert ;
Alahi, Alexandre ;
Li Fei-Fei .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1229-1238
[9]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[10]  
Hermans A., 2017, ARXIV170307737CS