Multi-Scale Feature Fusion with Attention Mechanism Based on CGAN Network for Infrared Image Colorization

被引:4
|
作者
Ai, Yibo [1 ,2 ]
Liu, Xiaoxi [1 ]
Zhai, Haoyang [1 ]
Li, Jie [3 ]
Liu, Shuangli [4 ]
An, Huilong [3 ]
Zhang, Weidong [1 ]
机构
[1] Univ Sci & Technol Beijing, Natl Ctr Mat Serv Safety, Beijing 100083, Peoples R China
[2] Southern Marine Sci & Engn Guangdong Lab Zhuhai, Zhuhai 519082, Peoples R China
[3] HBIS Mat Inst, 385 South Sports St, Shijiazhuang 050023, Peoples R China
[4] Hesteel Grp Tangsteel Co, 9 Binhe Rd, Tangshan 063000, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 08期
关键词
attention mechanism module; Generative Adversarial Network (GAN); image colorization; infrared images; multi-scale feature fusion;
D O I
10.3390/app13084686
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper proposes a colorization algorithm for infrared images based on a Conditional Generative Adversarial Network (CGAN) with multi-scale feature fusion and attention mechanisms, aiming to address issues such as color leakage and unclear semantics in existing infrared image coloring methods. Firstly, we improved the generator of the CGAN network by incorporating a multi-scale feature extraction module into the U-Net architecture to fuse features from different scales, thereby enhancing the network's ability to extract features and improving its semantic understanding, which improves the problems of color leakage and blurriness during colorization. Secondly, we enhanced the discriminator of the CGAN network by introducing an attention mechanism module, which includes channel attention and spatial attention modules, to better distinguish between real and generated images, thereby improving the semantic clarity of the resulting infrared images. Finally, we jointly improved the generator and discriminator of the CGAN network by incorporating both the multi-scale feature fusion module and attention mechanism module. We tested our method on a dataset containing both infrared and near-infrared images, which retains more detailed features while also preserving the advantages of existing infrared images. The experimental results show that our proposed method achieved a peak signal-to-noise ratio (PSNR) of 16.5342 dB and a structural similarity index (SSIM) of 0.6385 on an RGB-NIR (Red, Green, Blue-Near Infrared) testing dataset, representing a 5% and 13% improvement over the original CGAN network, respectively. These results demonstrate the effectiveness of our proposed algorithm in addressing the issues of color leakage and unclear semantics in the original network. The proposed method in this paper is not only applicable to infrared image colorization but can also be widely applied to the colorization of remote sensing and CT images.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] STACKED MULTI-SCALE ATTENTION NETWORK FOR IMAGE COLORIZATION
    Jiang, Bin
    Xu, Fangqiang
    Xia, Jun
    Yang, Chao
    Huang, Wei
    Huang, Yun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2225 - 2229
  • [2] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    Infrared Physics and Technology, 2022, 125
  • [3] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    INFRARED PHYSICS & TECHNOLOGY, 2022, 125
  • [4] MFANet: Multi-scale feature fusion network with attention mechanism
    Wang, Gaihua
    Gan, Xin
    Cao, Qingcheng
    Zhai, Qianyu
    VISUAL COMPUTER, 2023, 39 (07): : 2969 - 2980
  • [5] MFANet: Multi-scale feature fusion network with attention mechanism
    Gaihua Wang
    Xin Gan
    Qingcheng Cao
    Qianyu Zhai
    The Visual Computer, 2023, 39 : 2969 - 2980
  • [6] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
    Liu Y.
    Liu M.
    Lin S.
    Tao Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
  • [7] An infrared and visible image fusion network based on multi-scale feature cascades and non-local attention
    Xu, Jing
    Liu, Zhenjin
    Fang, Ming
    IET IMAGE PROCESSING, 2024, 18 (08) : 2114 - 2125
  • [8] Multi-Scale Feature Fusion Network with Attention for Single Image Dehazing
    Hu, Bin
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (04) : 608 - 615
  • [9] Multi-Scale Feature Fusion Network with Attention for Single Image Dehazing
    Pattern Recognition and Image Analysis, 2021, 31 : 608 - 615
  • [10] HAMNet: hyperspectral image classification based on hybrid neural network with attention mechanism and multi-scale feature fusion
    Shen, Jinyue
    Zheng, Zhouzhou
    Sun, Yingwei
    Zhao, Mengmeng
    Chang, Yankang
    Shao, Yuyi
    Zhang, Yan
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (11) : 4233 - 4258