DDGAN: Dense Residual Module and Dual-stream Attention-Guided Generative Adversarial Network for colorizing near-infrared images

被引:7
作者
Chen, Yu [1 ]
Zhan, Weida [1 ]
Jiang, Yichun [1 ]
Zhu, Depeng [1 ]
Guo, Renzhong [1 ]
Xu, Xiaoyu [1 ]
机构
[1] Changchun Univ Sci, Technol Natl Demonstrat Ctr Expt Elect, Changchun 130022, Jilin, Peoples R China
关键词
Deep learning; Generative adversarial network; Near-infrared image colorization; Attention module;
D O I
10.1016/j.infrared.2023.104822
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
Transforming near-infrared(NIR) images into realistic RGB images is a challenging task. Recently, with the development of deep learning, the colorization of NIR images has been significantly improved. However, there are still problems with distorted textures and blurred details. The main reason is that NIR images require color prediction and gray-scale detail recovery to be colorized. Moreover, NIR images contain limited detail and lack color information, which poses a serious challenge for feature extraction. To address the problems, we propose the Dense Residual Module and Dual-stream Attention-Guided Generative Adversarial Network (DDGAN) in this paper. The Dense Residual Module (DRM) improves the network feature extraction capability in the form of dense residuals and increases the network depth to improve the adaptability of the network. The Dual-stream Attention Module (DAM) further improves the quality of colorized images by enhancing important features and suppressing unnecessary features to focus on essential visual features. We propose a composite loss function consisting of content loss, adversarial loss, perceptual loss, synthesized loss, and total variation loss, which improves the quality of colorful images in terms of both edge structure and visual perception. We consider the NIR images for the visible image conversion task to evaluate the efficiency and performance of the proposed model. The proposed DDGAN outperforms most existing methods in terms of efficiency and quality of the generated images on the RGB-NIR dataset and the OMSIV dataset. Compared to state-of-the-art methods, the proposed DDGAN shows promising results with significant improvements in PSNR, SSIM, MSE, and NRMSE. Extensive experimental data show that the proposed DDGAN can produce state-of-the-art colorized NIR images in objective metrics and subjective quality.
引用
收藏
页数:13
相关论文
共 56 条
[21]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001
[22]   ThermalGAN: Multimodal Color-to-Thermal Image Translation for Person Re-identification in Multispectral Dataset [J].
Kniaz, Vladimir V. ;
Knyaz, Vladimir A. ;
Hladuvka, Jiri ;
Kropatsch, Walter G. ;
Mizginov, Vladimir .
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT VI, 2019, 11134 :606-624
[23]   Thermal infrared colorization via conditional generative adversarial network [J].
Kuang, Xiaodong ;
Zhu, Jianfei ;
Sui, Xiubao ;
Liu, Yuan ;
Liu, Chengwei ;
Chen, Qian ;
Gu, Guohua .
INFRARED PHYSICS & TECHNOLOGY, 2020, 107
[24]   Unsupervised Generative Adversarial Networks with Cross-model Weight Transfer Mechanism for Image-to-image Translation [J].
Lai, Xuguang ;
Bai, Xiuxiu ;
Hao, Yongqiang .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :1814-1822
[25]   Learning Representations for Automatic Colorization [J].
Larsson, Gustav ;
Maire, Michael ;
Shakhnarovich, Gregory .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :577-593
[26]   Colorization using optimization [J].
Levin, A ;
Lischinski, D ;
Weiss, Y .
ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03) :689-694
[27]   I2V-GAN: Unpaired Infrared-to-Visible Video Translation [J].
Li, Shuang ;
Han, Bingfeng ;
Yu, Zhenjie ;
Liu, Chi Harold ;
Chen, Kai ;
Wang, Shuigen .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :3061-3069
[28]   Dynamic synopsis and storage algorithm based on infrared surveillance video [J].
Li, Xuemei ;
Qiu, Shi ;
Song, Yang .
INFRARED PHYSICS & TECHNOLOGY, 2022, 124
[29]   An improved DualGAN for near-infrared image colorization [J].
Liang, Wei ;
Ding, Derui ;
Wei, Guoliang .
INFRARED PHYSICS & TECHNOLOGY, 2021, 116
[30]  
Limmer M, 2016, 2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), P61, DOI [10.1109/ICMLA.2016.0019, 10.1109/ICMLA.2016.114]