Lightweight and Fast Low-Light Image Enhancement Method Based on PoolFormer

被引:3
作者
Hu, Xin [1 ]
Wang, Jinhua [1 ]
Xu, Sunhan [1 ]
机构
[1] Beijing Union Univ, Smart City Coll, Beijing 100101, Peoples R China
关键词
key computer vision; low-light image enhancement; transformer; poolformer; lightweight;
D O I
10.1587/transinf.2023EDL8051
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Images captured in low-light environments have low visibility and high noise, which will seriously affect subsequent visual tasks such as target detection and face recognition. Therefore, low-light image enhancement is of great significance in obtaining high-quality images and is a challenging problem in computer vision tasks. A low-light enhancement model, LLFormer, based on the Vision Transformer, uses axis-based multi head self-attention and a cross-layer attention fusion mechanism to reduce the complexity and achieve feature extraction. This algorithm can enhance images well. However, the calculation of the attention mechanism is complex and the number of parameters is large, which limits the application of the model in practice. In response to this problem, a lightweight module, PoolFormer, is used to replace the attention module with spatial pooling, which can increase the parallelism of the network and greatly reduce the number of model parameters. To suppress image noise and improve visual effects, a new loss function is constructed for model optimization. The experiment results show that the proposed method not only reduces the number of parameters by 49%, but also performs better in terms of image detail restoration and noise suppression compared with the baseline model. On the LOL dataset, the PSNR and SSIM were 24.098 dB and 0.8575 respectively. On the MIT-Adobe FiveK dataset, the PSNR and SSIM were 27.060 dB and 0.9490. The evaluation results on the two datasets are better than the current mainstream low-light enhancement algorithms.
引用
收藏
页码:157 / 160
页数:4
相关论文
共 9 条
  • [1] Dosovitskiy A., 2021, An image is worth 16x16 words: Transformers for image recognition at scale, P1
  • [2] SwinIR: Image Restoration Using Swin Transformer
    Liang, Jingyun
    Cao, Jiezhang
    Sun, Guolei
    Zhang, Kai
    Van Gool, Luc
    Timofte, Radu
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1833 - 1844
  • [3] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
    Liu, Ze
    Lin, Yutong
    Cao, Yue
    Hu, Han
    Wei, Yixuan
    Zhang, Zheng
    Lin, Stephen
    Guo, Baining
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
  • [4] Wang T, 2023, AAAI CONF ARTIF INTE, P2654
  • [5] Uformer: A General U-Shaped Transformer for Image Restoration
    Wang, Zhendong
    Cun, Xiaodong
    Bao, Jianmin
    Zhou, Wengang
    Liu, Jianzhuang
    Li, Houqiang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17662 - 17672
  • [6] MetaFormer is Actually What You Need for Vision
    Yu, Weihao
    Luo, Mi
    Zhou, Pan
    Si, Chenyang
    Zhou, Yichen
    Wang, Xinchao
    Feng, Jiashi
    Yan, Shuicheng
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10809 - 10819
  • [7] Restormer: Efficient Transformer for High-Resolution Image Restoration
    Zamir, Syed Waqas
    Arora, Aditya
    Khan, Salman
    Hayat, Munawar
    Khan, Fahad Shahbaz
    Yang, Ming-Hsuan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5718 - 5729
  • [8] The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
    Zhang, Richard
    Isola, Phillip
    Efros, Alexei A.
    Shechtman, Eli
    Wang, Oliver
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 586 - 595
  • [9] STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement
    Zhang, Zhaoyang
    Jiang, Yitong
    Jiang, Jun
    Wang, Xiaogang
    Luo, Ping
    Gu, Jinwei
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4086 - 4095