End-to-End Depth-Guided Relighting Using Lightweight Deep Learning-Based Method

被引：0

作者：

Nathan, Sabari ^{[1
]}

Kansal, Priya ^{[1
]}

机构：

[1] Couger Inc, Tokyo 1500001, Japan

来源：

JOURNAL OF IMAGING | 2023年 / 9卷 / 09期

关键词：

image enhancement; image relighting; depth-guided;

D O I：

10.3390/jimaging9090175

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Image relighting, which involves modifying the lighting conditions while preserving the visual content, is fundamental to computer vision. This study introduced a bi-modal lightweight deep learning model for depth-guided relighting. The model utilizes the Res2Net Squeezed block's ability to capture long-range dependencies and to enhance feature representation for both the input image and its corresponding depth map. The proposed model adopts an encoder-decoder structure with Res2Net Squeezed blocks integrated at each stage of encoding and decoding. The model was trained and evaluated on the VIDIT dataset, which consists of 300 triplets of images. Each triplet contains the input image, its corresponding depth map, and the relit image under diverse lighting conditions, such as different illuminant angles and color temperatures. The enhanced feature representation and improved information flow within the Res2Net Squeezed blocks enable the model to handle complex lighting variations and generate realistic relit images. The experimental results demonstrated the proposed approach's effectiveness in relighting accuracy, measured by metrics such as the PSNR, SSIM, and visual quality.

引用

页数：15

共 52 条

[1] Lambertian reflectance and linear subspaces
Basri, R
Jacobs, DW
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (02) : 218 - 233
[2] ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal
Ding, Bin
Long, Chengjiang
Zhang, Ling
Xiao, Chunxia
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10212 - 10221
[3] El Helou M., 2020, P EUR C COMP VIS WOR
[4] El Helou M., 2021, P IEEE CVF C COMP VI
[5] Gafton P, 2020, Arxiv, DOI arXiv:2006.07816
[6] Res2Net: A New Multi-Scale Backbone Architecture
Gao, Shang-Hua
Cheng, Ming-Ming
Zhao, Kai
Zhang, Xin-Yu
Yang, Ming-Hsuan
Torr, Philip
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 652 - 662
[7] Green R., 2003, P ARCH GAM DEV C SAN
[8] A New Intrinsic-Lighting Color Space for Daytime Outdoor Images
Han, Zhi
Tian, Jiandong
Qu, Liangqiong
Tang, Yandong
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (02) : 1031 - 1039
[9] Identity Mappings in Deep Residual Networks
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 630 - 645
[10] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778

← 1 2 3 4 5 6 →