GPAN-PS: Global-Response Pedestrian Attention Network for End-to-End Person Search

被引:0
|
作者
Zheng, Linlin [1 ]
Han, Dezhi [1 ]
Xin, Xiaoqi [1 ]
机构
[1] Shanghai Maritime Univ, Sch Informat Engn, Shanghai 201306, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Feature extraction; Pedestrians; Transformers; Head; Accuracy; Deformable models; Attention mechanisms; Residual neural networks; Adaptation models; Search problems; Person Search; pedestrian attention; person detection; person re-identification; EXTRACTION;
D O I
10.1109/ACCESS.2024.3487235
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Person search, which involves identifying target pedestrians in extensive galleries through person detection and re-identification, has experienced significant advancements across various applications. However, it remains a challenging research area due to factors such as appearance changes, lighting variations, background interference, and pedestrian occlusion. This paper proposes an end-to-end person search framework, termed the Global-Response Pedestrian Attention Network (GPAN-PS), designed to tackle these challenges. Specifically, GPAN-PS includes a novel Global Response Pedestrian Attention (GRPA) module that samples pedestrian features using three shared-weight convolutional layers with distinct dilation rates. This enables the network to adaptively select the optimal receptive field through the Squeeze-and-Excitation (SE) module and the Global Response Normalization (GRN) module, enhancing feature stability. Furthermore, we design a GsConvNeXt Head module to bolster feature expressiveness and facilitate inter-channel information interaction. Rather than employing the ConvNeXt (conv5) module as the Box Head for generating refined proposals, our approach employs the GsConvNeXt Head module. This module is also integrated into the Re-ID Head for the extraction of pedestrian features. Both the GRPA and GsConvNeXt Head modules are flexible and adaptable, allowing for seamless integration into other models. Extensive experiments conducted on two benchmark datasets, CUHK-SYSU and PRW, underscore the superior performance of our proposed method. Notably, on the challenging PRW dataset, our approach achieves a mean Average Precision (mAP) of 59.2% and a Top-1 accuracy of 92.2%.
引用
收藏
页码:157686 / 157698
页数:13
相关论文
共 29 条
  • [21] End-to-End Pixel-Wisely Detection of Oceanic Eddy on SAR Images With Stacked Attention Network
    Xu, Ming
    Li, Hongping
    Yun, Yuying
    Yang, Fan
    Li, Cuishu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 10138 - 10151
  • [22] Simultaneous End-to-End Vehicle and License Plate Detection With Multi-Branch Attention Neural Network
    Chen, Song-Lu
    Yang, Chun
    Ma, Jia-Wei
    Chen, Feng
    Yin, Xu-Cheng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (09) : 3686 - 3695
  • [23] A Convolutional Network With Multi-Scale and Attention Mechanisms for End-to-End Single-Channel Speech Enhancement
    Xiang, Xiaoxiao
    Zhang, Xiaojuan
    Chen, Haozhe
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1455 - 1459
  • [24] A Hybrid End-to-End Spatiotemporal Attention Neural Network With Graph-Smooth Signals for EEG Emotion Recognition
    Sartipi, Shadi
    Torkamani-Azar, Mastaneh
    Cetin, Mujdat
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (02) : 732 - 743
  • [25] Tracking Beyond Detection: Learning a Global Response Map for End-to-End Multi-Object Tracking
    Wan, Xingyu
    Cao, Jiakai
    Zhou, Sanping
    Wang, Jinjun
    Zheng, Nanning
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8222 - 8235
  • [26] Cross-scale global attention feature pyramid network for person search
    Li, Yang
    Xu, Huahu
    Bian, Minjie
    Xiao, Junsheng
    IMAGE AND VISION COMPUTING, 2021, 116
  • [27] FPJA-Net: A Lightweight End-to-End Network for Sleep Stage Prediction Based on Feature Pyramid and Joint Attention
    Liu, Zhi
    Zhang, Qinhan
    Luo, Sixin
    Qin, Meiqiao
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2024, 16 (04) : 769 - 780
  • [28] Adjacent-Atrous Mechanism for Expanding Global Receptive Fields: An End-to-End Network for Multiattribute Scene Analysis in Remote Sensing Imagery
    Li, Zhengpeng
    Hu, Jun
    Wu, Kunyang
    Miao, Jiawei
    Wu, Jiansheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
  • [29] DGECN plus plus : A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation via Attention Mechanism
    Cao, Tuo
    Zhang, Wenxiao
    Fu, Yanping
    Zheng, Shengjie
    Luo, Fei
    Xiao, Chunxia
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4214 - 4228