GPAN-PS: Global-Response Pedestrian Attention Network for End-to-End Person Search

被引：0

作者：

Zheng, Linlin ^{[1
]}

Han, Dezhi ^{[1
]}

Xin, Xiaoqi ^{[1
]}

机构：

[1] Shanghai Maritime Univ, Sch Informat Engn, Shanghai 201306, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

中国国家自然科学基金; 上海市自然科学基金;

关键词：

Feature extraction; Pedestrians; Transformers; Head; Accuracy; Deformable models; Attention mechanisms; Residual neural networks; Adaptation models; Search problems; Person Search; pedestrian attention; person detection; person re-identification; EXTRACTION;

D O I：

10.1109/ACCESS.2024.3487235

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Person search, which involves identifying target pedestrians in extensive galleries through person detection and re-identification, has experienced significant advancements across various applications. However, it remains a challenging research area due to factors such as appearance changes, lighting variations, background interference, and pedestrian occlusion. This paper proposes an end-to-end person search framework, termed the Global-Response Pedestrian Attention Network (GPAN-PS), designed to tackle these challenges. Specifically, GPAN-PS includes a novel Global Response Pedestrian Attention (GRPA) module that samples pedestrian features using three shared-weight convolutional layers with distinct dilation rates. This enables the network to adaptively select the optimal receptive field through the Squeeze-and-Excitation (SE) module and the Global Response Normalization (GRN) module, enhancing feature stability. Furthermore, we design a GsConvNeXt Head module to bolster feature expressiveness and facilitate inter-channel information interaction. Rather than employing the ConvNeXt (conv5) module as the Box Head for generating refined proposals, our approach employs the GsConvNeXt Head module. This module is also integrated into the Re-ID Head for the extraction of pedestrian features. Both the GRPA and GsConvNeXt Head modules are flexible and adaptable, allowing for seamless integration into other models. Extensive experiments conducted on two benchmark datasets, CUHK-SYSU and PRW, underscore the superior performance of our proposed method. Notably, on the challenging PRW dataset, our approach achieves a mean Average Precision (mAP) of 59.2% and a Top-1 accuracy of 92.2%.

引用

页码：157686 / 157698

页数：13

共 29 条

[21] End-to-End Pixel-Wisely Detection of Oceanic Eddy on SAR Images With Stacked Attention Network
Xu, Ming
Li, Hongping
Yun, Yuying
Yang, Fan
Li, Cuishu
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 10138 - 10151
[22] Simultaneous End-to-End Vehicle and License Plate Detection With Multi-Branch Attention Neural Network
Chen, Song-Lu
Yang, Chun
Ma, Jia-Wei
Chen, Feng
Yin, Xu-Cheng
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (09) : 3686 - 3695
[23] A Convolutional Network With Multi-Scale and Attention Mechanisms for End-to-End Single-Channel Speech Enhancement
Xiang, Xiaoxiao
Zhang, Xiaojuan
Chen, Haozhe
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1455 - 1459
[24] A Hybrid End-to-End Spatiotemporal Attention Neural Network With Graph-Smooth Signals for EEG Emotion Recognition
Sartipi, Shadi
Torkamani-Azar, Mastaneh
Cetin, Mujdat
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (02) : 732 - 743
[25] Tracking Beyond Detection: Learning a Global Response Map for End-to-End Multi-Object Tracking
Wan, Xingyu
Cao, Jiakai
Zhou, Sanping
Wang, Jinjun
Zheng, Nanning
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8222 - 8235
[26] Cross-scale global attention feature pyramid network for person search
Li, Yang
Xu, Huahu
Bian, Minjie
Xiao, Junsheng
IMAGE AND VISION COMPUTING, 2021, 116
[27] FPJA-Net: A Lightweight End-to-End Network for Sleep Stage Prediction Based on Feature Pyramid and Joint Attention
Liu, Zhi
Zhang, Qinhan
Luo, Sixin
Qin, Meiqiao
INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2024, 16 (04) : 769 - 780
[28] Adjacent-Atrous Mechanism for Expanding Global Receptive Fields: An End-to-End Network for Multiattribute Scene Analysis in Remote Sensing Imagery
Li, Zhengpeng
Hu, Jun
Wu, Kunyang
Miao, Jiawei
Wu, Jiansheng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
[29] DGECN plus plus : A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation via Attention Mechanism
Cao, Tuo
Zhang, Wenxiao
Fu, Yanping
Zheng, Shengjie
Luo, Fei
Xiao, Chunxia
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4214 - 4228

← 1 2 3 →