MAFA-net: pedestrian detection network based on multi-scale attention feature aggregation

被引:7
作者
Xia, Hao [1 ]
Wan, Honglin [1 ]
Ou, Jiayu [1 ]
Ma, Jun [1 ]
Lv, Xinyao [1 ]
Bai, Chengjie [1 ]
机构
[1] Shandong Normal Univ, Sch Phys & Elect Sci, Jinan 250358, Peoples R China
关键词
Pedestrian detection network; Deep dilate block; Pedestrian attention blocks; Feature aggregation module;
D O I
10.1007/s10489-021-02796-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With pedestrian detection algorithms, balancing the trade-off between accuracy and speed remains challenging. Following the central point-based one-stage object detection paradigm, a pedestrian detection algorithm based on multi-scale attention feature aggregation (MAFA) is proposed to improve accuracy while considering real-time performance. We refer to the proposed algorithm as MAFA-Net. Through the design of deep dilate blocks, deeper features are extracted. Pedestrian attention blocks are added to mine more relevant information between features from the perspective of spatial and passage-wise dimensions, and pedestrian features are enhanced. Feature aggregation modules are used to fuse different scale features, and combine the rich high-level semantic features with the accurate location features of the low-level features. Experiments were conducted on two challenging pedestrian detection datasets, i.e., CityPersons and Caltech, using MR-2 as the evaluation index. For Caltech, MR-2 is 4.58% under reasonable conditions. For CityPersons, MR-2 is 11.47% and 10.05% under reasonable and partial occlusion conditions, which is 0.43% and 1.35% better than the suboptimal comparison detection method. The results demonstrate that a good performance is obtained, and the effectiveness and feasibility of the algorithm are verified.
引用
收藏
页码:7686 / 7699
页数:14
相关论文
empty
未找到相关数据