The Potential of Diffusion-Based Near-Infrared Image Colorization

被引：1

作者：

Borstelmann, Ayk ^{[1
]}

Haucke, Timm ^{[1
,2
]}

Steinhage, Volker ^{[1
]}

机构：

[1] Univ Bonn, Inst Comp Sci 4, Friedrich Hirzebruch Allee 8, D-53115 Bonn, Germany

[2] MIT, Comp Sci & Artificial Intelligence Lab, 32 Vassar St, Cambridge, MA 02139 USA

来源：

SENSORS | 2024年 / 24卷 / 05期

关键词：

near-infrared; diffusion models; camera trapping; unpaired dataset; neural networks; machine learning;

D O I：

10.3390/s24051565

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Camera traps, an invaluable tool for biodiversity monitoring, capture wildlife activities day and night. In low-light conditions, near-infrared (NIR) imaging is commonly employed to capture images without disturbing animals. However, the reflection properties of NIR light differ from those of visible light in terms of chrominance and luminance, creating a notable gap in human perception. Thus, the objective is to enrich near-infrared images with colors, thereby bridging this domain gap. Conventional colorization techniques are ineffective due to the difference between NIR and visible light. Moreover, regular supervised learning methods cannot be applied because paired training data are rare. Solutions to such unpaired image-to-image translation problems currently commonly involve generative adversarial networks (GANs), but recently, diffusion models gained attention for their superior performance in various tasks. In response to this, we present a novel framework utilizing diffusion models for the colorization of NIR images. This framework allows efficient implementation of various methods for colorizing NIR images. We show NIR colorization is primarily controlled by the translation of the near-infrared intensities to those of visible light. The experimental evaluation of three implementations with increasing complexity shows that even a simple implementation inspired by visible-near-infrared (VIS-NIR) fusion rivals GANs. Moreover, we show that the third implementation is capable of outperforming GANs. With our study, we introduce an intersection field joining the research areas of diffusion models, NIR colorization, and VIS-NIR fusion.

引用

页数：21

共 46 条

[1] The Role of Citizen Science and Deep Learning in Camera Trapping [J].

Adam, Matyas ;

Tomasek, Pavel ;

Lehejcek, Jiri ;

Trojan, Jakub ;

Junek, Tomas .

SUSTAINABILITY, 2021, 13 (18)

[2]

[Anonymous], 2007, ISO 20473:2007

[3]

Antic J., DEOLDIFY

[4]

Brown M, 2011, PROC CVPR IEEE, P177, DOI 10.1109/CVPR.2011.5995637

[5] Image fusion of visible and thermal images for fruit detection [J].

Bulanon, D. M. ;

Burks, T. F. ;

Alchanatis, V. .

BIOSYSTEMS ENGINEERING, 2009, 103 (01) :12-22

[6] ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models [J].

Choi, Jooyoung ;

Kim, Sungwon ;

Jeong, Yonghyun ;

Gwon, Youngjune ;

Yoon, Sungroh .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :14347-14356

[7]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[8]

Dhariwal P, 2021, ADV NEUR IN, V34

[9]

Dong ZY, 2018, IEEE IMAGE PROC, P2242, DOI 10.1109/ICIP.2018.8451230

[10] An Inquiry-based Approach to Engaging Undergraduate Students in On-campus Conservation Research Using Camera Traps [J].

Edelman, Andrew J. ;

Edelman, Jennifer L. .

SOUTHEASTERN NATURALIST, 2017, 16 :58-69

← 1 2 3 4 5 →