Occluded Pedestrian Detection Algorithm Based on Attention Mechanism

被引:15
作者
Zou Ziyin [1 ,2 ]
Gai Shaoyan [1 ,2 ]
Da Feipeng [1 ,2 ,3 ]
Li Yu [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Minist Educ, Key Lab Measurement & Control Complex Syst Engn, Nanjing 210096, Jiangsu, Peoples R China
[3] Southeast Univ, Shenzhen Res Inst, Shenzhen 518063, Guangdong, Peoples R China
关键词
machine vision; occluded pedestrian detection; attention mechanism; k-means clustering; intersection over union; NMS;
D O I
10.3788/AOS202141.1515001
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In light of the situation that it is difficult to accurately detect pedestrians in real scenes due to mutual occlusion, a feature extraction enhanced detection algorithm based on attention mechanism is proposed. Firstly, attention modules are added to learn the relationship between feature channels and the spatial information of feature maps, so as to enhance feature extraction in the visual area of pedestrian targets. Secondly, according to the actual size of pedestrian data, the k-means++ algorithm is used to cluster pedestrian labels, so as to determine the size and proportion of anchors. Distance-intersection over union loss function (DIOULoss) is used to design the loss function of the detector, so that the regression of the detection box pays more attention to the intersection over union between the candidate box and the real box, as well as the center distance between the two boxes. Finally, a new non-maximum suppression algorithm (DSoft-NMS) is presented to preserve more accurate prediction boxes. The proposed method has been tested on CityPersons and WiderPerson datasets, and the results show that the proposed method with a simple network structure has higher detection accuracy in occluded pedestrian detection, which is convenient for subsequent research.
引用
收藏
页数:9
相关论文
共 29 条
  • [1] Soft-NMS - Improving Object Detection With One Line of Code
    Bodla, Navaneeth
    Singh, Bharat
    Chellappa, Rama
    Davis, Larry S.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
  • [2] Learning Pixel-Level and Instance-Level Context-Aware Features for Pedestrian Detection in Crowds
    Fei, Chi
    Liu, Bin
    Chen, Zhu
    Yu, Nenghai
    [J]. IEEE ACCESS, 2019, 7 : 94944 - 94953
  • [3] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [4] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [5] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
  • [6] Jialian Wu, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P13427, DOI 10.1109/CVPR42600.2020.01344
  • [7] Multi-Scale Target Detection Algorithm Based on Attention Mechanism
    Ju Moran
    Luo Jiangning
    Wang Zhongbo
    Luo Haibo
    [J]. ACTA OPTICA SINICA, 2020, 40 (13)
  • [8] PedJointNet: Joint Head-Shoulder and Full body Deep Network for Pedestrian Detection
    Lin, Chih-Yang
    Xie, Hong-Xia
    Zheng, Hua
    [J]. IEEE ACCESS, 2019, 7 : 47687 - 47697
  • [9] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944
  • [10] Adaptive NMS: Refining Pedestrian Detection in a Crowd
    Liu, Songtao
    Huang, Di
    Wang, Yunhong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6452 - 6461