Spatiotemporal Feature Extraction for Pedestrian Re-identification

被引：3

作者：

Li, Ye ^{[1
]}

Yin, Guangqiang ^{[1
]}

Hou, Shaoqi ^{[1
]}

Cui, Jianhai ^{[2
]}

Huang, Zicheng ^{[2
]}

机构：

[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China

[2] Peoples Publ Secur Univ China, Beijing, Peoples R China

来源：

WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2019 | 2019年 / 11604卷

关键词：

ReID; Spatiotemporal feature; Mixed convolution; Non-local block; SELECTION;

D O I：

10.1007/978-3-030-23597-0_15

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video-based person re-identification (ReID) is a problem of person retrieval that aims to match the same person in two different videos, which has gradually entered the arena of public security. The system generally involve three important parts: feature extraction, feature aggregation and loss function. Pedestrian feature extraction and aggregation are critical steps in this field. Most of the previous studies concentrate on designing various feature extractors. However, these extractors cannot effectively extract spatiotemporal information. In this paper, several spatiotemporal convolution blocks were proposed to optimize the feature extraction model of person Re-identification. Firstly, 2D convolution and 3D convolution are simultaneously used on video volume to extract spatiotemporal feature. Secondly, non-local block is embedded into ResNet3D-50 to capture long-range dependencies. As a result, the proposed model could learn the inner link of pedestrian action in a video. Experimental results on MARS dataset show that our model has achieved significant progress compared to state-of-the-art methods.

引用

页码：188 / 200

页数：13

共 36 条

[1] [Anonymous], 2017, DEEPLY LEARNED PART
[2] [Anonymous], 2016, CoRR abs/1512.00567, DOI DOI 10.1109/CVPR.2016.308
[3] [Anonymous], 2016, P INT C ID INF KNOWL
[4] Bazzani L., 2012, DECENTRALIZED PARTIC
[5] A non-local algorithm for image denoising
Buades, A
Coll, B
Morel, JM
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 60 - 65
[6] Selecting dissimilar genes for multi-class classification, an application in cancer subtyping
Cai, Zhipeng
Goebel, Randy
Salavatipour, Mohammad R.
Lin, Guohui
[J]. BMC BIOINFORMATICS, 2007, 8 (1)
[7] Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function
Cheng, De
Gong, Yihong
Zhou, Sanping
Wang, Jinjun
Zheng, Nanning
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1335 - 1344
[8] A Two Stream Siamese Convolutional Neural Network For Person Re-Identification
Chung, Dahjung
Tahboub, Khalid
Delp, Edward J.
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1992 - 2000
[9] Courtney PG, 2015, IEEE COMP SEMICON
[10] Gao Jinyang., 2018, CoRR

← 1 2 3 4 →