A Novel Hybrid Architecture With Fast Lightweight Encoder and Transformer Under Attention Fusion for the Enhancement of Sand Dust and Haze Image Restoration

被引：0

作者：

Masood, Muhammad Khawaja Kashif ^{[1
,2
]}

Nava Baro, Enrique ^{[1
]}

Otero, Pablo ^{[1
]}

机构：

[1] Univ Malaga, Inst Ocean Engn Res, Malaga 29071, Spain

[2] Qassim Univ, Elect Engn Dept, Buraydah 52571, Saudi Arabia

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Convolution; Image restoration; Transformers; Feature extraction; Image color analysis; Computer vision; Generative adversarial networks; Training; Computer architecture; Computational efficiency; Sand dust and haze degraded images; color distortion; low contrast; color cast; vision transformer; encoder; QUALITY ASSESSMENT; ALGORITHMS;

D O I：

10.1109/ACCESS.2025.3570983

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Outdoor weather conditions such as haze, fog, sand dust, and low light significantly degrade image quality, causing color distortions, low contrast, and poor visibility. In spite of the significant importance of restoring such degraded images, challenges still exist in haze removal and sand dust image enhancement and other restoration tasks, making this field relatively underexplored. While Encoder-Decoder-based neural networks have shown noticeable improvements in image restoration, their ability to further improve the image quality still remains constrained. Recent advancements in vision transformers and self-attention mechanisms have achieved remarkable success in various computer vision tasks. However, directly applying Vision Transformers for image restoration presents serious challenges, including feature extraction between local and global representations. This research aims to address these limitations by restoring both sand dust and haze degraded images to a more natural and visually realistic appearance, ensuring enhanced visibility, balanced colors, and refined details. We propose a novel hybrid architecture that combines depth-wise local feature extraction using lightweight Encoders with global feature extraction via Vision Transformers. These features are fused through an attention fusion mechanism, ensuring seamless interaction between local and global feature representations. Finally, a single lightweight Decoder reconstructs a high-quality restored image that closely matches the ground truth. The proposed method effectively reduces feature inconsistency between Vision Transformer-based global features and lightweight encoder-based local features, leading to state-of-the-art performance in both synthetic and real-world sand dust and haze-degraded images. Extensive evaluations show that our proposed method outperforms all previously existing image restoration methods, delivering improved visibility, realistic textures, and superior image quality. Degraded images exhibiting varying degrees of color cast from mild to severe are evaluated both qualitatively and quantitatively. In addition, a comparison of training and testing time and the novel Energy Efficiency Index (EEI) analysis is assessed. The results show that the proposed method outperforms all previous conventional and advanced deep learning methods in terms of visual quality, evaluation metrics, training and testing time, and novel EEI.

引用

页码：86874 / 86891

页数：18

共 61 条

[31]

Lin X., 2021, arXiv

[32] Single Image Defogging Method Based on Image Patch Decomposition and Multi-Exposure Image Fusion [J].

Liu, Qiuzhuo ;

Luo, Yaqin ;

Li, Ke ;

Li, Wenfeng ;

Chai, Yi ;

Ding, Hao ;

Jiang, Xinghong .

FRONTIERS IN NEUROROBOTICS, 2021, 15

[33] Retinex-Based Fast Algorithm for Low-Light Image Enhancement [J].

Liu, Shouxin ;

Long, Wei ;

He, Lei ;

Li, Yanyan ;

Ding, Wei .

ENTROPY, 2021, 23 (06)

[34] Low-Light Image Enhancement by Retinex-Based Algorithm Unrolling and Adjustment [J].

Liu, Xinyi ;

Xie, Qi ;

Zhao, Qian ;

Wang, Hong ;

Meng, Deyu .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) :15758-15771

[35]

Liu Y., 2021, IEEE Trans. Image Process., V30, P3204, DOI DOI 10.1109/TIP.2021.3073620

[36] PD-GAN: PERCEPTUAL-DETAILS GAN FOR EXTREMELY NOISY LOW LIGHT IMAGE ENHANCEMENT [J].

Liu, Yijun ;

Wang, Zhengning ;

Zeng, Yi ;

Zeng, Hao ;

Zhao, Deming .

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :1840-1844

[37] Visibility improvement of hazy images using manipulation of convex combination coefficients of equi-hue planes' vertices in the RGB color space [J].

Mukaida, Mashiho ;

Koga, Takanori ;

Suetake, Noriaki .

SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)

[38] Human-Visual-System-Inspired Underwater Image Quality Measures [J].

Panetta, Karen ;

Gao, Chen ;

Agaian, Sos .

IEEE JOURNAL OF OCEANIC ENGINEERING, 2016, 41 (03) :541-551

[39] Effects of Image Degradation and Degradation Removal to CNN-Based Image Classification [J].

Pei, Yanting ;

Huang, Yaping ;

Zou, Qi ;

Zhang, Xingyuan ;

Wang, Song .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (04) :1239-1253

[40] An adaptive gamma correction for image enhancement [J].

Rahman, Shanto ;

Rahman, Md Mostafijur ;

Abdullah-Al-Wadud, M. ;

Al-Quaderi, Golam Dastegir ;

Shoyaib, Mohammad .

EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,

← 1 2 3 4 5 6 7 →