GPAN-PS: Global-Response Pedestrian Attention Network for End-to-End Person Search

被引：0

作者：

Zheng, Linlin ^{[1
]}

Han, Dezhi ^{[1
]}

Xin, Xiaoqi ^{[1
]}

机构：

[1] Shanghai Maritime Univ, Sch Informat Engn, Shanghai 201306, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

中国国家自然科学基金; 上海市自然科学基金;

关键词：

Feature extraction; Pedestrians; Transformers; Head; Accuracy; Deformable models; Attention mechanisms; Residual neural networks; Adaptation models; Search problems; Person Search; pedestrian attention; person detection; person re-identification; EXTRACTION;

D O I：

10.1109/ACCESS.2024.3487235

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Person search, which involves identifying target pedestrians in extensive galleries through person detection and re-identification, has experienced significant advancements across various applications. However, it remains a challenging research area due to factors such as appearance changes, lighting variations, background interference, and pedestrian occlusion. This paper proposes an end-to-end person search framework, termed the Global-Response Pedestrian Attention Network (GPAN-PS), designed to tackle these challenges. Specifically, GPAN-PS includes a novel Global Response Pedestrian Attention (GRPA) module that samples pedestrian features using three shared-weight convolutional layers with distinct dilation rates. This enables the network to adaptively select the optimal receptive field through the Squeeze-and-Excitation (SE) module and the Global Response Normalization (GRN) module, enhancing feature stability. Furthermore, we design a GsConvNeXt Head module to bolster feature expressiveness and facilitate inter-channel information interaction. Rather than employing the ConvNeXt (conv5) module as the Box Head for generating refined proposals, our approach employs the GsConvNeXt Head module. This module is also integrated into the Re-ID Head for the extraction of pedestrian features. Both the GRPA and GsConvNeXt Head modules are flexible and adaptable, allowing for seamless integration into other models. Extensive experiments conducted on two benchmark datasets, CUHK-SYSU and PRW, underscore the superior performance of our proposed method. Notably, on the challenging PRW dataset, our approach achieves a mean Average Precision (mAP) of 59.2% and a Top-1 accuracy of 92.2%.

引用

页码：157686 / 157698

页数：13

共 29 条

[1] Learning Scene-Pedestrian Graph for End-to-End Person Search
Song, Zifan
Zhao, Cairong
Hu, Guosheng
Miao, Duoqian
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 2979 - 2990
[2] PS-ARM: An End-to-End Attention-Aware Relation Mixer Network for Person Search
Fiaz, Mustansar
Cholakkal, Hisham
Narayan, Sanath
Anwer, Rao Muhammad
Khan, Fahad Shahbaz
COMPUTER VISION - ACCV 2022, PT V, 2023, 13845 : 234 - 250
[3] Multi-Attention-Guided Cascading Network for End-to-End Person Search
Yang, Jianxi
Wang, Xiaoyong
APPLIED SCIENCES-BASEL, 2023, 13 (09):
[4] Improved Instance Discrimination and Feature Compactness for End-to-End Person Search
Hou, Shaowei
Zhao, Cairong
Chen, Zhicheng
Wu, Jun
Wei, Zhihua
Miao, Duoqian
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2079 - 2090
[5] Sequential Transformer for End-to-End Person Search
Chen, Long
Xu, Jinhua
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 226 - 238
[6] Joint discriminative representation learning for end-to-end person search
Zhang, Pengcheng
Yu, Xiaohan
Bai, Xiao
Wang, Chen
Zheng, Jin
Ning, Xin
PATTERN RECOGNITION, 2024, 147
[7] END-TO-END PERSON SEARCH SEQUENTIALLY TRAINED ON AGGREGATED DATASET
Loesch, Angelique
Rabarisoa, Jaonary
Audigier, Romaric
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4574 - 4578
[8] DTHN: Dual-Transformer Head End-to-End Person Search Network
Feng, Cheng
Han, Dezhi
Chen, Chongqing
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 245 - 261
[9] Segmentation mask guided end-to-end person search
Zheng, Dingyuan
Xiao, Jimin
Huang, Kaizhu
Zhao, Yao
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 86
[10] End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification
Khatun, Amena
Denman, Simon
Sridharan, Sridha
Fookes, Clinton
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 3803 - 3813

← 1 2 3 →