Robust dual-modal image quality assessment aware deep learning network for traffic targets detection of autonomous vehicles

被引:0
作者
Keke Geng
Ge Dong
Wenhan Huang
机构
[1] Southeast University,School of Mechanical Engineering
[2] Tsinghua University,Institute of Aeronautics and Astronautics
来源
Multimedia Tools and Applications | 2022年 / 81卷
关键词
Autonomous driving; Deep neural network; Image quality assessment; Multi-spectral image;
D O I
暂无
中图分类号
学科分类号
摘要
The multi-spectral image pairs composed of LIDAR and RGB images demonstrates more effective detection performance in complex traffic environments, such as low illumination, motion blur and strong noise, etc. However, there is still a lack of relevant research on how to better fuse the two modalities to improve the robustness and detection accuracy of the perception system for autonomous vehicles under the condition of low visible light image quality. In this paper, we proposed a dual-modal image quality aware deep neural network (DMIQADNN). We comprehensively compared and analyzed the adaptions of fusion architectures in the early, middle, late, and score stages. By comprehensively considering the detection accuracy and detection speed, the fusion architecture in the middle stage was selected. Besides, we developed an image quality assessment network (IQAN) to evaluate the image quality score for RGB images. The corresponding fusion weights for RGB sub-network and LIDAR sub-network were adaptively assigned by using the proposed fusion weight assignment function. Then based on the calculated fusion weights, the RGB and LIDAR sub-networks were adaptively merged via a data fusion sub-network. The RGB images in the KITTI dataset were processed by reducing illumination and adding motion blur and Gaussian noises to produce a modified dataset containing 7481 RBG-LIDAR image pairs, and the DNIQADNN was trained and tested by semi-automatic annotation. The experimental results on modified KITTI Benchmark and dataset collected by using our own developed autonomous vehicle validate the robustness and effectiveness of proposed method. The ultimate FPS and AP values of the DNIQADNN reach 27 and 39.1, which are superior to those of the state-of-the-art instance segmentation networks.
引用
收藏
页码:6801 / 6826
页数:25
相关论文
共 5 条
  • [1] Robust dual-modal image quality assessment aware deep learning network for traffic targets detection of autonomous vehicles
    Geng, Keke
    Dong, Ge
    Huang, Wenhan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (05) : 6801 - 6826
  • [2] Deep Dual-Modal Traffic Objects Instance Segmentation Method Using Camera and LIDAR Data for Autonomous Driving
    Geng, Keke
    Dong, Ge
    Yin, Guodong
    Hu, Jingyu
    REMOTE SENSING, 2020, 12 (20) : 1 - 22
  • [3] Robust image retrieval by cascading a deep quality assessment network
    Somasundaran, Biju Venkadath
    Soundararajan, Rajiv
    Biswas, Soma
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 80
  • [4] Blind Image Quality Assessment with Visual Sensitivity Enhanced Dual-Channel Deep Convolutional Neural Network
    Zhang, Min
    Hou, Wenjing
    Zhang, Lei
    Feng, Jun
    2020 TWELFTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2020,
  • [5] A cutting-edge video anomaly detection method using image quality assessment and attention mechanism-based deep learning
    Cui, Chunying
    Liu, Linlin
    Qiao, Rui
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 108 : 476 - 485