Visible-Thermal Image Object Detection via the Combination of Illumination Conditions and Temperature Information

被引:25
作者
Zhou, Hang [1 ]
Sun, Min [1 ]
Ren, Xiang [1 ]
Wang, Xiuyuan [1 ]
机构
[1] Peking Univ, Inst Remote Sensing & GIS, Beijing 100871, Peoples R China
关键词
object detection; multi-spectral fusion; visible and thermal images; RetinaNet; illumination conditions; dynamic weight fusion; temperature information; a priori knowledge; FUSION; CNN;
D O I
10.3390/rs13183656
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Object detection plays an important role in autonomous driving, disaster rescue, robot navigation, intelligent video surveillance, and many other fields. Nonetheless, visible images are poor under weak illumination conditions, and thermal infrared images are noisy and have low resolution. Consequently, neither of these two data sources yields satisfactory results when used alone. While some scholars have combined visible and thermal images for object detection, most did not consider the illumination conditions and the different contributions of diverse data sources to the results. In addition, few studies have made use of the temperature characteristics of thermal images. Therefore, in the present study, visible and thermal images are utilized as the dataset, and RetinaNet is used as the baseline to fuse features from different data sources for object detection. Moreover, a dynamic weight fusion method, which is based on channel attention according to different illumination conditions, is used in the fusion component, and the channel attention and a priori temperature mask (CAPTM) module is proposed; the CAPTM can be applied to a deep learning network as a priori knowledge and maximizes the advantage of temperature information from thermal images. The main innovations of the present research include the following: (1) the consideration of different illumination conditions and the use of different fusion parameters for different conditions in the feature fusion of visible and thermal images; (2) the dynamic fusion of different data sources in the feature fusion of visible and thermal images; (3) the use of temperature information as a priori knowledge (CAPTM) in feature extraction. To a certain extent, the proposed methods improve the accuracy of object detection at night or under other weak illumination conditions and with a single data source. Compared with the state-of-the-art (SOTA) method, the proposed method is found to achieve superior detection accuracy with an overall mean average precision (mAP) improvement of 0.69%, including an AP improvement of 2.55% for the detection of the Person category. The results demonstrate the effectiveness of the research methods for object detection, especially temperature information-rich object detection.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Visible-Thermal Pedestrian Detection via Unsupervised Transfer Learning
    Lyu, Chengjin
    Heyer, Patrick
    Munir, Asad
    Platisa, Ljiljana
    Micheloni, Christian
    Goossens, Bart
    Philips, Wilfried
    2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, : 158 - 163
  • [2] An Unsupervised Transfer Learning Framework for Visible-Thermal Pedestrian Detection
    Lyu, Chengjin
    Heyer, Patrick
    Goossens, Bart
    Philips, Wilfried
    SENSORS, 2022, 22 (12)
  • [3] Potential evaluation of visible-thermal UAV image fusion for individual tree detection based on convolutional neural network
    Moradi, Fatemeh
    Javan, Farzaneh Dadrass
    Samadzadegan, Farhad
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 113
  • [4] UAV-Based Human Detection With Visible-Thermal Fused YOLOv5 Network
    Zou, Xiongxin
    Peng, Tangle
    Zhou, Yimin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (03) : 3814 - 3823
  • [5] Learning Discriminatory Information for Object Detection on Urine Sediment Image
    Chan, Sixian
    Wu, Binghui
    Zhang, Guodao
    Yao, Yuan
    Wang, Hongqiang
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 138 (01): : 411 - 428
  • [6] Object detection based on combination of visible and thermal videos using a joint sample consensus background model
    Han, Guang
    Cai, Xi
    Wang, Jinkuan
    Journal of Software, 2013, 8 (04) : 987 - 994
  • [7] Object detection and recognition via deformable illumination and deformable shape
    Zhou, Qiang
    Ma, Limin
    Celenk, Mehmet
    Chelberg, David
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 2737 - +
  • [8] Infrared and Visible Image Object Detection via Focused Feature Enhancement and Cascaded Semantic Extension
    Xiao, Xiaowu
    Wang, Bo
    Miao, Lingjuan
    Li, Linhao
    Zhou, Zhiqiang
    Ma, Jinlei
    Dong, Dandan
    REMOTE SENSING, 2021, 13 (13)
  • [9] Eyeglasses removal of thermal image based on visible information
    Wong, W. K.
    Zhao, Haitao
    INFORMATION FUSION, 2013, 14 (02) : 163 - 176
  • [10] Object detection based on combination of local and spatial information
    Qinkun Xiao 1
    2.Fujitsu Research & Development Center Co.
    3.Department of Automation
    Journal of Systems Engineering and Electronics, 2011, 22 (04) : 715 - 720