GPAN-PS: Global-Response Pedestrian Attention Network for End-to-End Person Search

被引:0
|
作者
Zheng, Linlin [1 ]
Han, Dezhi [1 ]
Xin, Xiaoqi [1 ]
机构
[1] Shanghai Maritime Univ, Sch Informat Engn, Shanghai 201306, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Feature extraction; Pedestrians; Transformers; Head; Accuracy; Deformable models; Attention mechanisms; Residual neural networks; Adaptation models; Search problems; Person Search; pedestrian attention; person detection; person re-identification; EXTRACTION;
D O I
10.1109/ACCESS.2024.3487235
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Person search, which involves identifying target pedestrians in extensive galleries through person detection and re-identification, has experienced significant advancements across various applications. However, it remains a challenging research area due to factors such as appearance changes, lighting variations, background interference, and pedestrian occlusion. This paper proposes an end-to-end person search framework, termed the Global-Response Pedestrian Attention Network (GPAN-PS), designed to tackle these challenges. Specifically, GPAN-PS includes a novel Global Response Pedestrian Attention (GRPA) module that samples pedestrian features using three shared-weight convolutional layers with distinct dilation rates. This enables the network to adaptively select the optimal receptive field through the Squeeze-and-Excitation (SE) module and the Global Response Normalization (GRN) module, enhancing feature stability. Furthermore, we design a GsConvNeXt Head module to bolster feature expressiveness and facilitate inter-channel information interaction. Rather than employing the ConvNeXt (conv5) module as the Box Head for generating refined proposals, our approach employs the GsConvNeXt Head module. This module is also integrated into the Re-ID Head for the extraction of pedestrian features. Both the GRPA and GsConvNeXt Head modules are flexible and adaptable, allowing for seamless integration into other models. Extensive experiments conducted on two benchmark datasets, CUHK-SYSU and PRW, underscore the superior performance of our proposed method. Notably, on the challenging PRW dataset, our approach achieves a mean Average Precision (mAP) of 59.2% and a Top-1 accuracy of 92.2%.
引用
收藏
页码:157686 / 157698
页数:13
相关论文
共 29 条
  • [1] Learning Scene-Pedestrian Graph for End-to-End Person Search
    Song, Zifan
    Zhao, Cairong
    Hu, Guosheng
    Miao, Duoqian
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 2979 - 2990
  • [2] PS-ARM: An End-to-End Attention-Aware Relation Mixer Network for Person Search
    Fiaz, Mustansar
    Cholakkal, Hisham
    Narayan, Sanath
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    COMPUTER VISION - ACCV 2022, PT V, 2023, 13845 : 234 - 250
  • [3] Multi-Attention-Guided Cascading Network for End-to-End Person Search
    Yang, Jianxi
    Wang, Xiaoyong
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [4] Improved Instance Discrimination and Feature Compactness for End-to-End Person Search
    Hou, Shaowei
    Zhao, Cairong
    Chen, Zhicheng
    Wu, Jun
    Wei, Zhihua
    Miao, Duoqian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2079 - 2090
  • [5] Sequential Transformer for End-to-End Person Search
    Chen, Long
    Xu, Jinhua
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 226 - 238
  • [6] Joint discriminative representation learning for end-to-end person search
    Zhang, Pengcheng
    Yu, Xiaohan
    Bai, Xiao
    Wang, Chen
    Zheng, Jin
    Ning, Xin
    PATTERN RECOGNITION, 2024, 147
  • [7] END-TO-END PERSON SEARCH SEQUENTIALLY TRAINED ON AGGREGATED DATASET
    Loesch, Angelique
    Rabarisoa, Jaonary
    Audigier, Romaric
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4574 - 4578
  • [8] DTHN: Dual-Transformer Head End-to-End Person Search Network
    Feng, Cheng
    Han, Dezhi
    Chen, Chongqing
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 245 - 261
  • [9] Segmentation mask guided end-to-end person search
    Zheng, Dingyuan
    Xiao, Jimin
    Huang, Kaizhu
    Zhao, Yao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 86
  • [10] End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification
    Khatun, Amena
    Denman, Simon
    Sridharan, Sridha
    Fookes, Clinton
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 3803 - 3813