A Pedestrian Detection Network Based on an Attention Mechanism and Pose Information

被引:0
|
作者
Jiang, Zhaoyin [1 ]
Huang, Shucheng [2 ]
Li, Mingxing [3 ]
机构
[1] Yangzhou Polytech Coll, Sch Informat Engn, Yangzhou 225009, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212003, Peoples R China
[3] Jiangsu Univ, Jingjiang Coll, Zhenjiang 212013, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 18期
关键词
pedestrian detection; attention mechanism; pose information; pedestrian recognition network;
D O I
10.3390/app14188214
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Pedestrian detection has recently attracted widespread attention as a challenging problem in computer vision. The accuracy of pedestrian detection is affected by differences in gestures, background clutter, local occlusion, differences in scales, pixel blur, and other factors occurring in real scenes. These problems lead to false and missed detections. In view of these visual description deficiencies, we leveraged pedestrian pose information as a supplementary resource to address the occlusion challenges that arise in pedestrian detection. An attention mechanism was integrated into the visual information as a supplement to the pose information, because the acquisition of pose information was limited by the pose estimation algorithm. We developed a pedestrian detection method that integrated an attention mechanism with visual and pose information, including pedestrian region generation and pedestrian recognition networks, effectively addressing occlusion and false detection issues. The pedestrian region proposal network was used to generate a series of candidate regions with possible pedestrian targets from the original image. Then, the pedestrian recognition network was used to judge whether each candidate region contained pedestrian targets. The pedestrian recognition network was composed of four parts: visual features, pedestrian poses, pedestrian attention, and classification modules. The visual feature module was responsible for extracting the visual feature descriptions of candidate regions. The pedestrian pose module was used to extract pose feature descriptions. The pedestrian attention module was used to extract attention information, and the classification module was responsible for fusing visual features and pedestrian pose descriptions with the attention mechanism. The experimental results on the Caltech and CityPersons datasets demonstrated that the proposed method could substantially more accurately identify pedestrians than current state-of-the-art methods.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] GCANet: A Cross-Modal Pedestrian Detection Method Based on Gaussian Cross Attention Network
    Peng, Peiran
    Mu, Feng
    Yan, Peilin
    Song, Liqiang
    Li, Hui
    Chen, Yu
    Li, Jianan
    Xu, Tingfa
    INTELLIGENT COMPUTING, VOL 2, 2022, 507 : 520 - 530
  • [32] Pedestrian object detection with fusion of visual attention mechanism and semantic computation
    Feng Xiao
    Baotong Liu
    Runa Li
    Multimedia Tools and Applications, 2020, 79 : 14593 - 14607
  • [33] Research on Anomaly Network Detection Based on Self-Attention Mechanism
    Hu, Wanting
    Cao, Lu
    Ruan, Qunsheng
    Wu, Qingfeng
    SENSORS, 2023, 23 (11)
  • [34] Wheat Spikes Detection Method Based on Pyramidal Network of Attention Mechanism
    Zhang Q.
    Hu S.
    Shu W.
    Cheng H.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2021, 52 (11): : 253 - 262
  • [35] Pavement crack detection network based on pyramid structure and attention mechanism
    Xiang, Xuezhi
    Zhang, Yuqi
    El Saddik, Abdulmotaleb
    IET IMAGE PROCESSING, 2020, 14 (08) : 1580 - 1586
  • [36] Pedestrian object detection with fusion of visual attention mechanism and semantic computation
    Xiao, Feng
    Liu, Baotong
    Li, Runa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 14593 - 14607
  • [37] Intelligent crack detection based on attention mechanism in convolution neural network
    Cui, Xiaoning
    Wang, Qicai
    Dai, Jinpeng
    Xue, Yanjin
    Duan, Yun
    ADVANCES IN STRUCTURAL ENGINEERING, 2021, 24 (09) : 1859 - 1868
  • [38] Lightweight Deep Network With Context Information and Attention Mechanism for Vehicle Detection in Aerial Image
    Shen, Jiaquan
    Liu, Ningzhong
    Sun, Han
    Li, Deguang
    Zhang, Yongxin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [39] Cross-modality interactive attention network for multispectral pedestrian detection
    Zhang, Lu
    Liu, Zhiyong
    Zhang, Shifeng
    Yang, Xu
    Qiao, Hong
    Huang, Kaizhu
    Hussain, Amir
    INFORMATION FUSION, 2019, 50 : 20 - 29
  • [40] Accurate Pedestrian Detection by Human Pose Regression
    Zhao, Yun
    Yuan, Zejian
    Chen, Badong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1591 - 1605