MPE-DETR: A multiscale pyramid enhancement network for object detection in low-light images

被引:1
作者
Xue, Rui [1 ]
Duan, Jialu [1 ]
Du, Zhengwei [2 ]
机构
[1] Harbin Engn Univ, Sch Informat & Commun Engn, Nantong St 145, Harbin 150001, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Zheda Rd 38, Hangzhou 310027, Peoples R China
关键词
Computer vision; Object detection; Low-light images; Multiscale pyramid networks;
D O I
10.1016/j.imavis.2024.105202
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection has broad applications in areas such as autonomous driving, security surveillance, and deep-sea exploration. However, the performance of current detection algorithms significantly decreases due to the loss of detail, increased noise, and color distortion in images under low-light or nighttime conditions. To address this problem, we propose a plug-and-play multiscale pyramid enhancement network (MPENet), which elegantly cascades with RT-DETR to establish an end-to-end framework for low-light object detection, named MPE-DETR. First, MPENet utilizes Gaussian blur to decompose images into Gaussian pyramids and Laplacian pyramids at different resolutions. Specifically, we designed a high-frequency texture enhancement (HTE) module to capture the edge and texture information of images, and a low-frequency noise smoothing (LNS) module to better understand the overall structure of images and capture global-scale features. Additionally, by concatenating the output features of the HTE and LNS modules along the channel dimension, feature fusion across different scales is realized. We conducted experiments on the ExDark and ExDark + LOD datasets, which are designed for low-light object detection. The results indicate that the proposed method achieved an improvement of 2.1% in mAP@.5 compared to that of existing SOTA models on the ExDark dataset, and demonstrated strong generalizability and robustness on the ExDark + LOD dataset. The code and results are available at https://github. com/PZDJL/MPENet.
引用
收藏
页数:11
相关论文
共 63 条
  • [1] A dynamic histogram equalization for image contrast enhancement
    Abdullah-Al-Wadud, M.
    Kabir, Md. Hasanul
    Dewan, M. Ali Akber
    Chae, Oksam
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2007, 53 (02) : 593 - 600
  • [2] Adelson E. H., 1984, RCA Eng., V29, P33
  • [3] The Gaussian transform of distributions: Definition, computation and application
    Alecu, Teodor Iulian
    Voloshynovskiy, Sviatoslav
    Pun, Thierry
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (08) : 2976 - 2985
  • [4] Linear interpolation revitalized
    Blu, T
    Thévenaz, P
    Unser, M
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (05) : 710 - 719
  • [5] Burt P.J, 1984, PYRAMID STRUCTURE EF, P6, DOI [10.1007/978-3-642-51590-32, DOI 10.1007/978-3-642-51590-32, 10.1007/978-3-642-51590-3_2, DOI 10.1007/978-3-642-51590-3_2]
  • [6] Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement
    Cai, Yuanhao
    Bian, Hao
    Lin, Jing
    Wang, Haoqian
    Timofte, Radu
    Zhang, Yulun
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12470 - 12479
  • [7] A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection
    Cai, Zhaowei
    Fan, Quanfu
    Feris, Rogerio S.
    Vasconcelos, Nuno
    [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 354 - 370
  • [8] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [9] Seeing Motion in the Dark
    Chen, Chen
    Chen, Qifeng
    Do, Minh N.
    Koltun, Vladlen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3184 - 3193
  • [10] R-CNN for Small Object Detection
    Chen, Chenyi
    Liu, Ming-Yu
    Tuzel, Oncel
    Xiao, Jianxiong
    [J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 214 - 230