Spatiotemporal Feature Extraction for Pedestrian Re-identification

被引:3
作者
Li, Ye [1 ]
Yin, Guangqiang [1 ]
Hou, Shaoqi [1 ]
Cui, Jianhai [2 ]
Huang, Zicheng [2 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Peoples Publ Secur Univ China, Beijing, Peoples R China
来源
WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2019 | 2019年 / 11604卷
关键词
ReID; Spatiotemporal feature; Mixed convolution; Non-local block; SELECTION;
D O I
10.1007/978-3-030-23597-0_15
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video-based person re-identification (ReID) is a problem of person retrieval that aims to match the same person in two different videos, which has gradually entered the arena of public security. The system generally involve three important parts: feature extraction, feature aggregation and loss function. Pedestrian feature extraction and aggregation are critical steps in this field. Most of the previous studies concentrate on designing various feature extractors. However, these extractors cannot effectively extract spatiotemporal information. In this paper, several spatiotemporal convolution blocks were proposed to optimize the feature extraction model of person Re-identification. Firstly, 2D convolution and 3D convolution are simultaneously used on video volume to extract spatiotemporal feature. Secondly, non-local block is embedded into ResNet3D-50 to capture long-range dependencies. As a result, the proposed model could learn the inner link of pedestrian action in a video. Experimental results on MARS dataset show that our model has achieved significant progress compared to state-of-the-art methods.
引用
收藏
页码:188 / 200
页数:13
相关论文
共 36 条
  • [1] [Anonymous], 2017, DEEPLY LEARNED PART
  • [2] [Anonymous], 2016, CoRR abs/1512.00567, DOI DOI 10.1109/CVPR.2016.308
  • [3] [Anonymous], 2016, P INT C ID INF KNOWL
  • [4] Bazzani L., 2012, DECENTRALIZED PARTIC
  • [5] A non-local algorithm for image denoising
    Buades, A
    Coll, B
    Morel, JM
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 60 - 65
  • [6] Selecting dissimilar genes for multi-class classification, an application in cancer subtyping
    Cai, Zhipeng
    Goebel, Randy
    Salavatipour, Mohammad R.
    Lin, Guohui
    [J]. BMC BIOINFORMATICS, 2007, 8 (1)
  • [7] Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function
    Cheng, De
    Gong, Yihong
    Zhou, Sanping
    Wang, Jinjun
    Zheng, Nanning
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1335 - 1344
  • [8] A Two Stream Siamese Convolutional Neural Network For Person Re-Identification
    Chung, Dahjung
    Tahboub, Khalid
    Delp, Edward J.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1992 - 2000
  • [9] Courtney PG, 2015, IEEE COMP SEMICON
  • [10] Gao Jinyang., 2018, CoRR