Weighted bilinear coding over salient body parts for person re-identification

被引:10
作者
Chang, Zhigang [1 ]
Qin, Zhou [1 ,5 ]
Fan, Heng [2 ]
Su, Hang [4 ]
Yang, Hua [1 ]
Zheng, Shibao [1 ]
Ling, Haibin [2 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Proc & Network Engn, Shanghai 200240, Peoples R China
[2] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA
[3] South China Univ Technol, Comp Sci & Engn, Guangzhou 510006, Peoples R China
[4] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[5] Alibaba Cloud, Artificial Intelligence Ctr City Brain, Hangzhou 311100, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Feature aggregation; Bilinear coding; NETWORK;
D O I
10.1016/j.neucom.2020.05.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks (CNNs) have demonstrated dominant performance in person re -identification (Re-ID). Existing CNN based methods utilize global average pooling (GAP) to aggregate intermediate convolutional features for Re-ID. However, this strategy only considers the first-order statis-tics of local features and treats local features at different locations equally important, leading to sub-optimal feature representation. To deal with these issues, we propose a novel weighted bilinear coding (WBC) framework for local feature aggregation in CNN networks to pursue more representative and dis-criminative feature representations, which can adapt to other advanced methods and improve their per-formance. In specific, bilinear coding is used to encode the channel-wise feature correlations to capture richer feature interactions. Meanwhile, a weighting scheme is applied on the bilinear coding to adaptively adjust the weights of local features at different locations based on their importance in recognition, further improving the discriminability of feature aggregation. To handle the spatial misalignment issue, we use a salient part net to derive salient body parts, and apply the WBC model on each part. The final represen-tation, formed by concatenating the WBC encoded features of each part, is both discriminative and resis-tant to spatial misalignment. Experiments on three benchmarks including Market-1501, DukeMTMC-reID and CUHK03 evidence the favorable performance of our method against other outstanding methods. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:454 / 464
页数:11
相关论文
共 56 条
  • [1] [Anonymous], 2017, ICCV
  • [2] [Anonymous], 2017, CVPR
  • [3] [Anonymous], 2017, ICCV
  • [4] [Anonymous], 2015, 3 INT C LEARN REPR I
  • [5] [Anonymous], 2017, CVPR
  • [6] [Anonymous], 2014, ECCV
  • [7] Carreira J, 2012, LECT NOTES COMPUT SC, V7578, P430, DOI 10.1007/978-3-642-33786-4_32
  • [8] Multi-Level Factorisation Net for Person Re-Identification
    Chang, Xiaobin
    Hospedales, Timothy M.
    Xiang, Tao
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2109 - 2118
  • [9] Similarity Learning with Spatial Constraints for Person Re-identification
    Chen, Dapeng
    Yuan, Zejian
    Chen, Badong
    Zheng, Nanning
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1268 - 1277
  • [10] Person Re-Identification by Deep Learning Multi-Scale Representations
    Chen, Yanbei
    Zhu, Xiatian
    Gong, Shaogang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2590 - 2600