MLFFCSP: a new anti-occlusion pedestrian detection network with multi-level feature fusion for small targets

被引:5
|
作者
Huan, Ruohong [1 ]
Zhang, Ji [1 ]
Xie, Chaojie [1 ]
Liang, Ronghua [1 ]
Chen, Peng [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Pedestrian detection; Multi-level; Feature; Fusion; Anti-occlusion; Small targets;
D O I
10.1007/s11042-023-14721-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pedestrian detection relying on deep convolution neural networks has achieved significant progress. However, the performance of current pedestrian detection algorithms remains unsatisfactory when it comes to small targets or heavily occluded pedestrians. In this paper, a new anti-occlusion video pedestrian detection network with multi-level feature fusion named MLFFCSP is proposed for small targets and heavily occluded pedestrians. In the proposed network, the pyramid convolutional neural network PyConvResNet101 is used as backbone to extract features. Then, the shallow and deep features are fused at multiple levels to fully obtain the shallow location information and deep semantic information. In order to improve the robustness of the model, data augmentation is also implemented via random erasing on the training data. Experiments are carried out on Caltech and Citypersons datasets, and the log-average miss rate is used to evaluate the performance of the model. The results show that the performance of MLFFCSP is better than other pedestrian detection algorithms in the case of small targets and serious occlusion.
引用
收藏
页码:29405 / 29430
页数:26
相关论文
empty
未找到相关数据