Multi-Scale Feature Fusion with Attention Mechanism Based on CGAN Network for Infrared Image Colorization

被引:4
|
作者
Ai, Yibo [1 ,2 ]
Liu, Xiaoxi [1 ]
Zhai, Haoyang [1 ]
Li, Jie [3 ]
Liu, Shuangli [4 ]
An, Huilong [3 ]
Zhang, Weidong [1 ]
机构
[1] Univ Sci & Technol Beijing, Natl Ctr Mat Serv Safety, Beijing 100083, Peoples R China
[2] Southern Marine Sci & Engn Guangdong Lab Zhuhai, Zhuhai 519082, Peoples R China
[3] HBIS Mat Inst, 385 South Sports St, Shijiazhuang 050023, Peoples R China
[4] Hesteel Grp Tangsteel Co, 9 Binhe Rd, Tangshan 063000, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 08期
关键词
attention mechanism module; Generative Adversarial Network (GAN); image colorization; infrared images; multi-scale feature fusion;
D O I
10.3390/app13084686
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper proposes a colorization algorithm for infrared images based on a Conditional Generative Adversarial Network (CGAN) with multi-scale feature fusion and attention mechanisms, aiming to address issues such as color leakage and unclear semantics in existing infrared image coloring methods. Firstly, we improved the generator of the CGAN network by incorporating a multi-scale feature extraction module into the U-Net architecture to fuse features from different scales, thereby enhancing the network's ability to extract features and improving its semantic understanding, which improves the problems of color leakage and blurriness during colorization. Secondly, we enhanced the discriminator of the CGAN network by introducing an attention mechanism module, which includes channel attention and spatial attention modules, to better distinguish between real and generated images, thereby improving the semantic clarity of the resulting infrared images. Finally, we jointly improved the generator and discriminator of the CGAN network by incorporating both the multi-scale feature fusion module and attention mechanism module. We tested our method on a dataset containing both infrared and near-infrared images, which retains more detailed features while also preserving the advantages of existing infrared images. The experimental results show that our proposed method achieved a peak signal-to-noise ratio (PSNR) of 16.5342 dB and a structural similarity index (SSIM) of 0.6385 on an RGB-NIR (Red, Green, Blue-Near Infrared) testing dataset, representing a 5% and 13% improvement over the original CGAN network, respectively. These results demonstrate the effectiveness of our proposed algorithm in addressing the issues of color leakage and unclear semantics in the original network. The proposed method in this paper is not only applicable to infrared image colorization but can also be widely applied to the colorization of remote sensing and CT images.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Research on Image Segmentation Method Based on Multi-Scale Feature Fusion and Dual Attention
    Wang, Zhihong
    Wang, Chaoying
    Li, Jianxin
    Wu, Tianxiang
    Li, Jiajun
    Huang, Hongxing
    Jiang, Lai
    Journal of Computers (Taiwan), 2024, 35 (06) : 45 - 54
  • [42] Multi-Scale Feature Attention Fusion for Image Splicing Forgery Detection
    Liang, Enji
    Zhang, Kuiyuan
    Hua, Zhongyun
    Jia, Xiaohua
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (01)
  • [43] Image Super-Resolution Based on Residual Attention and Multi-Scale Feature Fusion
    Kou, Qiqi
    Zhao, Jiamin
    Cheng, Deqiang
    Su, Zhen
    Zhu, Xingguang
    IEEE ACCESS, 2023, 11 : 59530 - 59541
  • [44] Dynamic graph convolutional network for assembly behavior recognition based on attention mechanism and multi-scale feature fusion
    Chen, Chengjun
    Zhao, Xicong
    Wang, Jinlei
    Li, Dongnian
    Guan, Yuanlin
    Hong, Jun
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [45] Dynamic graph convolutional network for assembly behavior recognition based on attention mechanism and multi-scale feature fusion
    Chengjun Chen
    Xicong Zhao
    Jinlei Wang
    Dongnian Li
    Yuanlin Guan
    Jun Hong
    Scientific Reports, 12
  • [46] MSAt-GAN: a generative adversarial network based on multi-scale and deep attention mechanism for infrared and visible light image fusion
    Li, Junwu
    Li, Binhua
    Jiang, Yaoxi
    Cai, Weiwei
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (06) : 4753 - 4781
  • [47] MSAt-GAN: a generative adversarial network based on multi-scale and deep attention mechanism for infrared and visible light image fusion
    Junwu Li
    Binhua Li
    Yaoxi Jiang
    Weiwei Cai
    Complex & Intelligent Systems, 2022, 8 : 4753 - 4781
  • [48] Tomato leaf disease detection based on attention mechanism and multi-scale feature fusion
    Wang, Yong
    Zhang, Panxing
    Tian, Shuang
    FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [49] Adaptive feature fusion with attention mechanism for multi-scale target detection
    Moran Ju
    Jiangning Luo
    Zhongbo Wang
    Haibo Luo
    Neural Computing and Applications, 2021, 33 : 2769 - 2781
  • [50] Adaptive feature fusion with attention mechanism for multi-scale target detection
    Ju, Moran
    Luo, Jiangning
    Wang, Zhongbo
    Luo, Haibo
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (07): : 2769 - 2781