Frequency-Aware Axial-ShiftedNet in Generative Adversarial Networks for Visible-to-Infrared Image Translation

被引:0
|
作者
Lin, Hsi-Ju [1 ]
Cheng, Wei-Yuan [2 ]
Chen, Duan-Yu [1 ]
机构
[1] Yuan Ze Univ, Dept Elect Engn, Zhongli City 320, Taiwan
[2] Chunghwa Telecom Labs, Taoyuan 30010, Taiwan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Generators; Feature extraction; Wavelet transforms; Training; Decoding; Computational modeling; Wavelet analysis; Generative adversarial networks; Frequency-domain analysis; Convolutional neural networks; Infrared imaging; Image processing; Infrared image; generative adversarial model; wavelet transform; image translation;
D O I
10.1109/ACCESS.2024.3478356
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Infrared imagery is indispensable for capturing temperature data by detecting infrared radiation, particularly in challenging environments characterized by low-light conditions where visual perception is compromised. As a result, there has been considerable interest in the conversion of visible images into their infrared counterparts. In this research, we present the Freq-ShiftedNet model, which employs an adversarial generative network approach for training. By harnessing the power of the Haar wavelet transform, we adeptly preserve frequency information, directing low-frequency features to the Decoder and high-frequency features to the Encoder. Analysis of the KAIST dataset demonstrates that our model outperforms InfraGAN, achieving a Structural Similarity (SSIM) score of 0.825, marking a 5.4% improvement, and a Learned Perceptual Image Patch Similarity (LPIPS) score of 0.228, indicating a 41.3% decrease. Similarly, using the VEDAI dataset, Freq-ShiftedNet surpasses InfraGAN with an SSIM score of 0.938, representing a 6.6% improvement. These results highlight the effectiveness of our proposed generator, the successful integration of wavelet features into the Freq-ShiftedNet model, and its suitability for real-world applications.
引用
收藏
页码:151432 / 151443
页数:12
相关论文
共 50 条
  • [1] Visible-to-infrared Image Translation Based on an Improved Conditional Generative Adversarial Nets
    Ma Decao
    Xian Yong
    Su Juan
    Li Shaopeng
    Li Bing
    ACTA PHOTONICA SINICA, 2023, 52 (04)
  • [2] TransImg: A Translation Algorithm of Visible-to-Infrared Image Based on Generative Adversarial Network
    Han, Shuo
    Mo, Bo
    Xu, Junwei
    Sun, Shizun
    Zhao, Jie
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [3] StawGAN: Structural-Aware Generative Adversarial Networks for Infrared Image Translation
    Sigillo, Luigi
    Grassucci, Eleonora
    Comminiello, Danilo
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [4] FCLFusion: A frequency-aware and collaborative learning for infrared and visible image fusion
    Wang, Chengchao
    Pu, Yuanyuan
    Zhao, Zhengpeng
    Nie, Rencan
    Cao, Jinde
    Xu, Dan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [5] Visible-to-Infrared Image Translation for Matching Tasks
    Ma, Decao
    Li, Shaopeng
    Su, Juan
    Xian, Yong
    Zhang, Tao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 18199 - 18213
  • [6] Discriminator guided visible-to-infrared image translation
    Ma, Decao
    Su, Juan
    Xian, Yong
    Li, Shaopeng
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (04)
  • [7] Visible-to-infrared image translation based on an improved CGAN
    Ma, Decao
    Xian, Yong
    Li, Bing
    Li, Shaopeng
    Zhang, Daqiao
    VISUAL COMPUTER, 2024, 40 (02): : 1289 - 1298
  • [8] SFA-GAN: structure–frequency-aware generative adversarial network for underwater image enhancement
    Yinghui Zhang
    Tingshuai Liu
    Bo Zhao
    Fengxiang Ge
    Signal, Image and Video Processing, 2023, 17 : 3647 - 3655
  • [9] Thermal to Visible Facial Image Translation Using Generative Adversarial Networks
    Wang, Zhongling
    Chen, Zhenzhong
    Wu, Feng
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (08) : 1161 - 1165
  • [10] Visible-to-infrared image translation based on an improved CGAN
    Decao Ma
    Yong Xian
    Bing Li
    Shaopeng Li
    Daqiao Zhang
    The Visual Computer, 2024, 40 (2) : 1289 - 1298