Research on Pedestrian Detection Based on Multimodal Information Fusion

被引:0
|
作者
Yang, Xiaoping [1 ,2 ]
Li, Zhehong [1 ,2 ]
Liu, Yuan [3 ]
Huang, Ran [1 ,2 ]
Tan, Kai [1 ,2 ]
Huang, Lin [1 ,2 ]
机构
[1] Guilin Univ Technol, Sch Informat Sci & Engn, Guilin 541004, Guangxi, Peoples R China
[2] Guilin Univ Technol, Guangxi Key Lab Embedded Technol & Intelligent Sy, Guilin 541004, Guangxi, Peoples R China
[3] Guilin Med Univ, Coll Intelligent Med & Biotechnol, Guilin 541004, Guangxi, Peoples R China
来源
INFORMATION TECHNOLOGY AND CONTROL | 2023年 / 52卷 / 04期
基金
中国国家自然科学基金;
关键词
Multimodal Pedestrian Detection; Faster R-CNN; Generalized Intersection Over Union; Feature Fusion;
D O I
10.5755/j01.itc.52.4.33766
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic driving system based on a single-mode sensor is susceptible to the external environment in pedestrian detection. This paper proposes a fusion of light and thermal infrared multimodal pedestrian detection methodology. Firstly, 1 x 1 convolution and dilated convolution square measure are introduced within the residual network, and also the ROIAlign methodology is employed to exchange the ROIPooling methodology to map the candidate box to the feature layer to optimize the Faster R-CNN. Secondly, the generalized intersection over union (GIoU) loss function is employed as the loss function of prediction box positioning regression. Finally, to explore the performance of multimodal image pedestrian detection methods in different fusion periods in the improved Faster R-CNN, four forms of multimodal neural network structures are designed to fuse visible and thermal infrared pictures. Experimental results show that the proposed algorithm performs better on the KAIST dataset than current mainstream detection algorithms. Compared to the conventional ACF + T + THOG pedestrian detector, the AP is 8.38 percentage points greater. The miss rate is 5.34 percentage points lower than the visible light pedestrian detector.
引用
收藏
页码:1045 / 1057
页数:13
相关论文
共 50 条
  • [1] A Pedestrian Detection Model Based on Binocular Information Fusion
    Zhang, Juan
    Ma, Zhonggui
    Nuermaimaiti, Nuerxiati
    2019 28TH WIRELESS AND OPTICAL COMMUNICATIONS CONFERENCE (WOCC), 2019, : 13 - 17
  • [2] Pedestrian detection based on YOLOv3 multimodal data fusion
    Wang, Cheng
    Liu, Yuan-sheng
    Chang, Fei-xiang
    Lu, Ming
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2022, 10 (01) : 832 - 845
  • [3] Pedestrian detection based on YOLOv3 multimodal data fusion
    Wang, Cheng
    Liu, Yuan-sheng
    Chang, Fei-xiang
    Lu, Ming
    Systems Science and Control Engineering, 2022, 10 (01): : 832 - 845
  • [4] Multitarget Detection Algorithm Based on Multimodal Information Fusion
    Liu Tong
    Gao Sijie
    Nie Weizhi
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (08)
  • [5] Laboratory Abnormal Behavior Detection Based on Multimodal Information Fusion
    Zhang, Dawei
    INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2024, 16 (01)
  • [6] Pedestrian detection aided by fusion of binocular information
    Zhang, Zhiguo
    Tao, Wenbing
    Sun, Kun
    Hu, Wenbin
    Yao, Li
    PATTERN RECOGNITION, 2016, 60 : 227 - 238
  • [7] Joint probabilistic data fusion for pedestrian detection in multimodal images
    Shaikh, Zuhaib Ahmed
    Van Hamme, David
    Veelaert, Peter
    Philips, Wilfried
    2023 IEEE SENSORS, 2023,
  • [8] High-density pedestrian detection algorithm based on deep information fusion
    Zhang, Hexiang
    Yang, Xiaofang
    Hu, Ziyu
    Hao, Ruoxin
    Gao, Zehang
    Wang, Jianhao
    APPLIED INTELLIGENCE, 2022, 52 (13) : 15483 - 15495
  • [9] High-density pedestrian detection algorithm based on deep information fusion
    Hexiang Zhang
    Xiaofang Yang
    Ziyu Hu
    Ruoxin Hao
    Zehang Gao
    Jianhao Wang
    Applied Intelligence, 2022, 52 : 15483 - 15495
  • [10] Multimodal information fusion for video concept detection
    Wu, Y
    Lin, CK
    Chang, EY
    Smith, JR
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2391 - 2394