MPE-DETR: A multiscale pyramid enhancement network for object detection in low-light images

被引：1

作者：

Xue, Rui ^{[1
]}

Duan, Jialu ^{[1
]}

Du, Zhengwei ^{[2
]}

机构：

[1] Harbin Engn Univ, Sch Informat & Commun Engn, Nantong St 145, Harbin 150001, Peoples R China

[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Zheda Rd 38, Hangzhou 310027, Peoples R China

来源：

IMAGE AND VISION COMPUTING | 2024年 / 150卷

关键词：

Computer vision; Object detection; Low-light images; Multiscale pyramid networks;

D O I：

10.1016/j.imavis.2024.105202

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object detection has broad applications in areas such as autonomous driving, security surveillance, and deep-sea exploration. However, the performance of current detection algorithms significantly decreases due to the loss of detail, increased noise, and color distortion in images under low-light or nighttime conditions. To address this problem, we propose a plug-and-play multiscale pyramid enhancement network (MPENet), which elegantly cascades with RT-DETR to establish an end-to-end framework for low-light object detection, named MPE-DETR. First, MPENet utilizes Gaussian blur to decompose images into Gaussian pyramids and Laplacian pyramids at different resolutions. Specifically, we designed a high-frequency texture enhancement (HTE) module to capture the edge and texture information of images, and a low-frequency noise smoothing (LNS) module to better understand the overall structure of images and capture global-scale features. Additionally, by concatenating the output features of the HTE and LNS modules along the channel dimension, feature fusion across different scales is realized. We conducted experiments on the ExDark and ExDark + LOD datasets, which are designed for low-light object detection. The results indicate that the proposed method achieved an improvement of 2.1% in mAP@.5 compared to that of existing SOTA models on the ExDark dataset, and demonstrated strong generalizability and robustness on the ExDark + LOD dataset. The code and results are available at https://github. com/PZDJL/MPENet.

引用

页数：11

共 63 条

[1] A dynamic histogram equalization for image contrast enhancement
Abdullah-Al-Wadud, M.
Kabir, Md. Hasanul
Dewan, M. Ali Akber
Chae, Oksam
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2007, 53 (02) : 593 - 600
[2] Adelson E. H., 1984, RCA Eng., V29, P33
[3] The Gaussian transform of distributions: Definition, computation and application
Alecu, Teodor Iulian
Voloshynovskiy, Sviatoslav
Pun, Thierry
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (08) : 2976 - 2985
[4] Linear interpolation revitalized
Blu, T
Thévenaz, P
Unser, M
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (05) : 710 - 719
[5] Burt P.J, 1984, PYRAMID STRUCTURE EF, P6, DOI [10.1007/978-3-642-51590-32, DOI 10.1007/978-3-642-51590-32, 10.1007/978-3-642-51590-3_2, DOI 10.1007/978-3-642-51590-3_2]
[6] Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement
Cai, Yuanhao
Bian, Hao
Lin, Jing
Wang, Haoqian
Timofte, Radu
Zhang, Yulun
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12470 - 12479
[7] A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection
Cai, Zhaowei
Fan, Quanfu
Feris, Rogerio S.
Vasconcelos, Nuno
[J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 354 - 370
[8] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[9] Seeing Motion in the Dark
Chen, Chen
Chen, Qifeng
Do, Minh N.
Koltun, Vladlen
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3184 - 3193
[10] R-CNN for Small Object Detection
Chen, Chenyi
Liu, Ming-Yu
Tuzel, Oncel
Xiao, Jianxiong
[J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 214 - 230

← 1 2 3 4 5 6 7 →