MFGDAFormer: Multi-scale frequency-guided dual-branch attention transformer for low-light image enhancement

被引：0

作者：

Gong, Faming ^{[1
]}

Zhang, Yimeng ^{[1
]}

Du, Chengze ^{[1
]}

Ji, Xiaofeng ^{[1
]}

机构：

[1] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao 266580, Peoples R China

来源：

NEUROCOMPUTING | 2025年 / 651卷

关键词：

Low-light image enhancement; Fourier transform; Multi-scale features; Attention mechanism; NETWORK;

D O I：

10.1016/j.neucom.2025.130937

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Images captured in low-light conditions, such as at night or in backlit environments, often suffer from uneven illumination, insufficient contrast, and significant noise, which compromise the accuracy and robustness of visual tasks like object detection. Low-light image enhancement techniques are essential for improving visibility, restoring details, and reducing noise, thereby enhancing the performance of various visual tasks. Effective illumination adjustment and detail preservation are key to achieving successful low-light image enhancement. However, existing methods face significant challenges, including inadequate modeling of lighting patterns, insufficient local feature representation, neglect of frequency-domain information, and difficulty balancing noise suppression with detail preservation. This paper introduces the Multi-scale Frequency-Guided Dual-branch Attention Transformer (MFGDAFormer) for low-light image enhancement. The framework incorporates a CNN-based illumination estimator to guide enhancement and lighting adjustment, while the Fourier Sparse Multi-scale Attention Mechanism (FSMAM) facilitates effective frequency-domain analysis, adaptive feature modulation, and multi-scale detail preservation. Furthermore, the U-Net architecture is employed for fine-grained illumination enhancement, and the Dual-Large Kernel Activation Attention Module (DLKA) integrates large-kernel convolutions with adaptive attention mechanisms to optimize feature fusion. Experimental results demonstrate that MFGDAFormer outperforms state-of-the-art models, with PSNR improvements ranging from 0.21 dB to 0.88 dB across multiple low-light datasets.

引用

页数：14

共 61 条

[1] LGN-CNN: A biologically inspired CNN architecture [J].

Bertoni, Federico ;

Citti, Giovanna ;

Sarti, Alessandro .

NEURAL NETWORKS, 2022, 145 :42-55

[2] InstructPix2Pix: Learning to Follow Image Editing Instructions [J].

Brooks, Tim ;

Holynski, Aleksander ;

Efros, Alexei A. .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :18392-18402

[3]

Bychkovsky V, 2011, PROC CVPR IEEE, P97

[4] Fourier Transform Infrared Spectroscopy to Assess the Degree of Alteration of Artificially Aged and Environmentally Weathered Microplastics [J].

Campanale, Claudia ;

Savino, Ilaria ;

Massarelli, Carmine ;

Uricchio, Vito Felice .

POLYMERS, 2023, 15 (04)

[5] An improved image enhancement framework based on multiple attention mechanism [J].

Chen, Qili ;

Fan, Junfang ;

Chen, Wenbai .

DISPLAYS, 2021, 70

[6] Kilohertz quasiperiodic oscillations in short gamma-ray bursts [J].

Chirenti, Cecilia ;

Dichiara, Simone ;

Lien, Amy ;

Miller, M. Coleman ;

Preece, Robert .

NATURE, 2023, 613 (7943) :253-+

[7]

Cui ZT, 2022, Arxiv, DOI [arXiv:2205.14871, 10.48550/arXiv.2205.14871]

[8] Learning scene-vectors for remote sensing image scene classification [J].

Datla, Rajeshreddy ;

Perveen, Nazil ;

Mohan, C. Krishna .

NEUROCOMPUTING, 2024, 587

[9] A weighted variational model for simultaneous reflectance and illumination estimation [J].

Fu, Xueyang ;

Zeng, Delu ;

Huang, Yue ;

Zhang, Xiao-Ping ;

Ding, Xinghao .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2782-2790

[10] Learning a Simple Low-light Image Enhancer from Paired Low-light Instances [J].

Fu, Zhenqi ;

Yang, Yan ;

Tu, Xiaotong ;

Huang, Yue ;

Ding, Xinghao ;

Ma, Kai-Kuang .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :22252-22261

← 1 2 3 4 5 6 7 →