Osmosis: RGBD Diffusion Prior for Underwater Image Restoration

被引：0

作者：

Bar Nathan, Opher ^{[1
]}

Levy, Deborah ^{[1
]}

Treibitz, Tali ^{[1
]}

Rosenbaum, Dan ^{[2
]}

机构：

[1] Charney Sch Marine Sci, Hatter Dept Marine Technol, Haifa, Israel

[2] Univ Haifa, Dept Comp Sci, Haifa, Israel

来源：

COMPUTER VISION - ECCV 2024, PT LXII | 2025年 / 15120卷

基金：

以色列科学基金会;

关键词：

Diffusion Models; Physics-Based Computer Vision; Underwater Image Restoration; ENHANCEMENT;

D O I：

10.1007/978-3-031-73033-7_17

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Underwater image restoration is a challenging task because of water effects that increase dramatically with distance. This is worsened by lack of ground truth data of clean scenes without water. Diffusion priors have emerged as strong image restoration priors. However, they are often trained with a dataset of the desired restored output, which is not available in our case. We also observe that using only color data is insufficient, and therefore augment the prior with a depth channel. We train an unconditional diffusion model prior on the joint space of color and depth, using standard RGBD datasets of natural outdoor scenes in air. Using this prior together with a novel guidance method based on the underwater image formation model, we generate posterior samples of clean images, removing the water effects. Even though our prior did not see any underwater images during training, our method outperforms state-of-the-art baselines for image restoration on very challenging scenes. Our code, models and data are available on the project's website.

引用

页码：302 / 319

页数：18

共 61 条

[41] Generalization of the Dark Channel Prior for Single Image Restoration [J].

Peng, Yan-Tsung ;

Cao, Keming ;

Cosman, Pamela C. .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (06) :2856-2868

[42] Underwater Image Restoration Based on Image Blurriness and Light [J].

Peng, Yan-Tsung ;

Cosman, Pamela C. .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (04) :1579-1594

[43] U-Net: Convolutional Networks for Biomedical Image Segmentation [J].

Ronneberger, Olaf ;

Fischer, Philipp ;

Brox, Thomas .

MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241

[44] DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation [J].

Ruiz, Nataniel ;

Li, Yuanzhen ;

Jampani, Varun ;

Pritch, Yael ;

Rubinstein, Michael ;

Aberman, Kfir .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :22500-22510

[45]

Saxena S., 2023, arXiv

[46]

Shen YY, 2024, Arxiv, DOI [arXiv:2301.09430, 10.48550/arXiv.2301.09430]

[47] Indoor Segmentation and Support Inference from RGBD Images [J].

Silberman, Nathan ;

Hoiem, Derek ;

Kohli, Pushmeet ;

Fergus, Rob .

COMPUTER VISION - ECCV 2012, PT V, 2012, 7576 :746-760

[48]

Sohl-Dickstein J, 2015, PR MACH LEARN RES, V37, P2256

[49]

Sohn K, 2023, Arxiv, DOI arXiv:2306.00983

[50]

Song Jiaming, 2023, INT C LEARN REPR

← 1 2 3 4 5 6 7 →