Multi-Modal Pedestrian Detection Algorithm Based on Illumination Perception Weight Fusion

被引:1
|
作者
Liu Keqi [1 ]
Dong Mianmian [1 ]
Gao Hui [1 ]
Zhigang Lu [1 ]
Guo Baoyi [2 ]
Pang Min [3 ]
机构
[1] Xian Technol Univ, Sch Elect & Informat Engn, Xian 710021, Shaanxi, Peoples R China
[2] Xian Technol Univ, Undergrad Coll, Xian 710021, Shaanxi, Peoples R China
[3] Beijing Inst Microelect Technol, Beijing 100000, Peoples R China
关键词
multi-modal image fusion; attention mechanism; illumination perception weight fusion; pedestrian detection;
D O I
10.3788/LOP222528
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing pedestrian target detection algorithm based on visible light and infrared modal fusion has a high missed detection rate in all-weather environment. In this paper, we propose a novel multi-modal pedestrian target detection algorithm based on illumination perception weight fusion to solve this problem. First, ResNet50, incorporating an efficient channel attention (ECA) mechanism module, was used as a feature extraction network to extract the features of both visible light and infrared modes, respectively. Second, the existing illumination weighted sensing fusion strategy was improved. A new illumination weighted sensing fusion mechanism was designed to attain the corresponding weights of the visible light and infrared modes, and weighted fusion was performed to achieve fusion features to reduce the missed detection rate of the algorithm. Finally, the multi-modal features extracted from the last layer of the feature network and the generated fusion features were fed into the detection network to accomplish the detection of pedestrian targets. Experimental results show that the proposed algorithm has an excellent detection performance on the KAIST dataset, and the missed detection rate for pedestrian targets in all-weather is 11.16%.
引用
收藏
页数:11
相关论文
共 24 条
  • [1] Pedestrian detection with unsupervised multispectral feature learning using deep neural networks
    Cao, Yanpeng
    Guan, Dayan
    Huang, Weilin
    Yang, Jiangxin
    Cao, Yanlong
    Qiao, Yu
    [J]. INFORMATION FUSION, 2019, 46 : 206 - 217
  • [2] [陈莹 Chen Ying], 2020, [光学精密工程, Optics and Precision Engineering], V28, P2700
  • [3] Fusion of multispectral data through illumination-aware deep neural networks for pedestrian detection
    Guan, Dayan
    Cao, Yanpeng
    Yang, Jiangxin
    Cao, Yanlong
    Yang, Michael Ying
    [J]. INFORMATION FUSION, 2019, 50 : 148 - 157
  • [4] Identity Mappings in Deep Residual Networks
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 630 - 645
  • [5] He Z F, 2022, Chinese Journal of Lasers, V49
  • [6] Hwang S, 2015, PROC CVPR IEEE, P1037, DOI 10.1109/CVPR.2015.7298706
  • [7] Kailai Zhou, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12363), P787, DOI 10.1007/978-3-030-58523-5_46
  • [8] Fully Convolutional Region Proposal Networks for Multispectral Person Detection
    Koenig, Daniel
    Adam, Michael
    Jarvers, Christian
    Layher, Georg
    Neumann, Heiko
    Teutsch, Michael
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 243 - 250
  • [9] Lee Y, 2018, ASIAPAC SIGN INFO PR, P694, DOI 10.23919/APSIPA.2018.8659688
  • [10] Leng Y, 2019, Research on target recognition based on featurelevel fusion of visible/infrared images