Scalable Person Re-Identification by Harmonious Attention

被引:24
作者
Li, Wei [1 ]
Zhu, Xiatian [2 ]
Gong, Shaogang [1 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[2] Vis Semant Ltd, London E1 4NS, England
基金
“创新英国”项目;
关键词
Person re-identification; Scalable search; Compact model; Attention learning; Local and global representation learning; GLOBAL FEATURES;
D O I
10.1007/s11263-019-01274-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing person re-identification (re-id) deep learning methods rely heavily on the utilisation of large and computationally expensive convolutional neural networks. They are therefore not scalable to large scale re-id deployment scenarios with the need of processing a large amount of surveillance video data, due to the lengthy inference process with high computing costs. In this work, we address this limitation via jointly learning re-id attention selection. Specifically, we formulate a novel harmonious attention network (HAN) framework to jointly learn soft pixel attention and hard region attention alongside simultaneous deep feature representation learning, particularly enabling more discriminative re-id matching by efficient networks with more scalable model inference and feature matching. Extensive evaluations validate the cost-effectiveness superiority of the proposed HAN approach for person re-id against a wide variety of state-of-the-art methods on four large benchmark datasets: CUHK03, Market-1501, DukeMTMC, and MSMT17.
引用
收藏
页码:1635 / 1653
页数:19
相关论文
共 111 条
  • [11] Xception: Deep Learning with Depthwise Separable Convolutions
    Chollet, Francois
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
  • [12] Cormen T. H., 2009, Introduction to algorithms, VThird
  • [13] Courbariaux M, 2015, ADV NEUR IN, V28
  • [14] Denil M., 2013, Advances in Neural Information Processing Systems (NIPS), V26
  • [15] Dong XW, 2018, IEEE CONF COMPUT
  • [16] Representation is representation of similarities
    Edelman, S
    [J]. BEHAVIORAL AND BRAIN SCIENCES, 1998, 21 (04) : 449 - +
  • [17] Evgeniou Theodoros, 2004, P 10 ACM SIGKDD INT, P109, DOI DOI 10.1145/1014052.1014067
  • [18] SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
    Faraone, Julian
    Fraser, Nicholas
    Blott, Michaela
    Leong, Philip H. W.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4300 - 4309
  • [19] Spatially Adaptive Computation Time for Residual Networks
    Figurnov, Michael
    Collins, Maxwell D.
    Zhu, Yukun
    Zhang, Li
    Huang, Jonathan
    Vetrov, Dmitry
    Salakhutdinov, Ruslan
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1790 - 1799
  • [20] Geng CH, 2018, IEEE INT CONF COMM