Multi-scale local-global architecture for person re-identification

被引:2
作者
Liu, Jing [1 ,2 ]
Tiwari, Prayag [3 ]
Tri Gia Nguyen [4 ]
Gupta, Deepak [5 ]
Band, Shahab S. [6 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
[2] Weinan Normal Univ, Sch Comp Sci, Weinan 714099, Peoples R China
[3] Aalto Univ, Dept Comp Sci, Espoo, Finland
[4] FPT Univ, Danang 50509, Vietnam
[5] Maharaja Agrasen Inst Technol, Delhi, India
[6] Natl Yunlin Univ Sci & Technol, Future Technol Res Ctr, 123 Univ Rd,Sect 3, Touliu 64002, Yunlin, Taiwan
基金
欧盟地平线“2020”; 芬兰科学院;
关键词
Person re-identification; Multi-scale local-global architecture; Attention mechanism; Deep learning; ATTENTION; NETWORK;
D O I
10.1007/s00500-022-06859-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the emergence of deep learning method, which has been driven a great success for the field of person re-identification (re-ID). However, the existing works mainly focus on first-order attention (i.e., spatial and channels attention) statistics to model the valuable information for person re-ID. On the other hand, most existing methods operate data points respectively, which ignores discriminative patterns to some extent. In this paper, we present an automated framework named multi-scale local-global for person re-ID. The framework consists of two components. The first component is that a high-order attention module is adopted to learn high-order attention patterns to model the subtle differences among pedestrians and to generate the informative attention features. On the other hand, a novel architecture named spectral feature transformation is designed to make for the optimization of group wise similarities. Furthermore, we fuse the components together to form an ensemble model for person re-ID. Extensive experiments were conducted on the three benchmark datasets, i.e., Market-1501, DukeMTMC-reID, CUHK03, showing the superiority of the proposed method.
引用
收藏
页码:7967 / 7977
页数:11
相关论文
共 61 条
[1]  
[Anonymous], 2018, Spectralnet: Spectral clustering using deep neural networks
[2]  
[Anonymous], 2014, NEURAL INFORM PROCES
[3]   Multi-Level Factorisation Net for Person Re-Identification [J].
Chang, Xiaobin ;
Hospedales, Timothy M. ;
Xiang, Tao .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2109-2118
[4]   Mixed High-Order Attention Network for Person Re-Identification [J].
Chen, Binghui ;
Deng, Weihong ;
Hu, Jiani .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :371-381
[5]   Group Consistent Similarity Learning via Deep CRF for Person Re-Identification [J].
Chen, Dapeng ;
Xu, Dan ;
Li, Hongsheng ;
Sebe, Nicu ;
Wang, Xiaogang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8649-8658
[6]   SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning [J].
Chen, Long ;
Zhang, Hanwang ;
Xiao, Jun ;
Nie, Liqiang ;
Shao, Jian ;
Liu, Wei ;
Chua, Tat-Seng .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6298-6306
[7]  
Ding F, 2021, IEEE T MULTIMED
[8]   Perceptual Enhancement for Autonomous Vehicles: Restoring Visually Degraded Images for Context Prediction via Adversarial Training [J].
Ding, Feng ;
Yu, Keping ;
Gu, Zonghua ;
Li, Xiangjun ;
Shi, Yunqing .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) :9430-9441
[9]   Deep-Learning-Empowered Digital Forensics for Edge Consumer Electronics in 5G HetNets [J].
Ding, Feng ;
Zhu, Guopu ;
Alazab, Mamoun ;
Li, Xiangjun ;
Yu, Keping .
IEEE CONSUMER ELECTRONICS MAGAZINE, 2022, 11 (02) :42-50
[10]  
Donath WE, 2003, LOWER BOUNDS PARTITI, P437