Swin transformer and ResNet based deep networks for low-light image enhancement

被引:8
作者
Xu, Lintao [1 ,2 ]
Hu, Changhui [1 ,2 ]
Zhang, Bo [1 ,2 ]
Wu, Fei [1 ,2 ]
Cai, Ziyun [1 ,2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Artificial Intelligence, Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Low-light image enhancement; Generative adversarial network; Swin transformer; Random paired learning; QUALITY ASSESSMENT; RETINEX;
D O I
10.1007/s11042-023-16650-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Low-light image enhancement is a long-term low-level vision problem, which aims to improve the visual quality of images captured in low illumination environment. Convolutional neural network (CNN) is the foundation of the majority of low-light image enhancement algorithms now. The limitations of CNN receptive field lead to the inability to establish long-range context interaction. In recent years, Transformer has received increasing attention in computer vision due to its global attention. In this paper, we design the Swin Transformer and ResNet-based Generative Adversarial Network (STRN) for low-light image enhancement by combining the advantages of ResNet and the Swin Transformer. The STRN consists of a U-shaped generator and multiscale discriminators. The generator is composed of a shallow feature extraction, a deep feature extraction, and an image reconstruction module. To calculate the global and local attention, we alternately use Swin Transformer blocks and ResNet in the deep feature processing module. The self perceptual loss and the spatial consistency loss are employed to constrain the random paired training of STRN. The experimental results on benchmark datasets and real-world low-light images demonstrate that the proposed STRN achieves state-of-the-art performance on low-light image enhancement tasks in terms of visual quality and evaluation metrics.
引用
收藏
页码:26621 / 26642
页数:22
相关论文
共 57 条
[51]  
Ying Z., 2017, arXiv
[52]   Restormer: Efficient Transformer for High-Resolution Image Restoration [J].
Zamir, Syed Waqas ;
Arora, Aditya ;
Khan, Salman ;
Hayat, Munawar ;
Khan, Fahad Shahbaz ;
Yang, Ming-Hsuan .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :5718-5729
[53]   StyleSwin: Transformer-based GAN for High-resolution Image Generation [J].
Zhang, Bowen ;
Gu, Shuyang ;
Zhang, Bo ;
Bao, Jianmin ;
Chen, Dong ;
Wen, Fang ;
Wang, Yong ;
Guo, Baining .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11294-11304
[54]   Beyond Brightening Low-light Images [J].
Zhang, Yonghua ;
Guo, Xiaojie ;
Ma, Jiayi ;
Liu, Wei ;
Zhang, Jiawan .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (04) :1013-1037
[55]   Kindling the Darkness: A Practical Low-light Image Enhancer [J].
Zhang, Yonghua ;
Zhang, Jiawan ;
Guo, Xiaojie .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :1632-1640
[56]   Adaptive Unfolding Total Variation Network for Low-Light Image Enhancement [J].
Zheng, Chuanjun ;
Shi, Daming ;
Shi, Wentian .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :4419-4428
[57]   Machine Learning Prediction of New York Airbnb Prices [J].
Zhu, Ang ;
Li, Rong ;
Xie, Zehao .
2020 THIRD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE FOR INDUSTRIES (AI4I 2020), 2020, :1-5