End-to-End Depth-Guided Relighting Using Lightweight Deep Learning-Based Method

被引:0
作者
Nathan, Sabari [1 ]
Kansal, Priya [1 ]
机构
[1] Couger Inc, Tokyo 1500001, Japan
关键词
image enhancement; image relighting; depth-guided;
D O I
10.3390/jimaging9090175
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Image relighting, which involves modifying the lighting conditions while preserving the visual content, is fundamental to computer vision. This study introduced a bi-modal lightweight deep learning model for depth-guided relighting. The model utilizes the Res2Net Squeezed block's ability to capture long-range dependencies and to enhance feature representation for both the input image and its corresponding depth map. The proposed model adopts an encoder-decoder structure with Res2Net Squeezed blocks integrated at each stage of encoding and decoding. The model was trained and evaluated on the VIDIT dataset, which consists of 300 triplets of images. Each triplet contains the input image, its corresponding depth map, and the relit image under diverse lighting conditions, such as different illuminant angles and color temperatures. The enhanced feature representation and improved information flow within the Res2Net Squeezed blocks enable the model to handle complex lighting variations and generate realistic relit images. The experimental results demonstrated the proposed approach's effectiveness in relighting accuracy, measured by metrics such as the PSNR, SSIM, and visual quality.
引用
收藏
页数:15
相关论文
共 52 条
  • [1] Lambertian reflectance and linear subspaces
    Basri, R
    Jacobs, DW
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (02) : 218 - 233
  • [2] ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal
    Ding, Bin
    Long, Chengjiang
    Zhang, Ling
    Xiao, Chunxia
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10212 - 10221
  • [3] El Helou M., 2020, P EUR C COMP VIS WOR
  • [4] El Helou M., 2021, P IEEE CVF C COMP VI
  • [5] Gafton P, 2020, Arxiv, DOI arXiv:2006.07816
  • [6] Res2Net: A New Multi-Scale Backbone Architecture
    Gao, Shang-Hua
    Cheng, Ming-Ming
    Zhao, Kai
    Zhang, Xin-Yu
    Yang, Ming-Hsuan
    Torr, Philip
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 652 - 662
  • [7] Green R., 2003, P ARCH GAM DEV C SAN
  • [8] A New Intrinsic-Lighting Color Space for Daytime Outdoor Images
    Han, Zhi
    Tian, Jiandong
    Qu, Liangqiong
    Tang, Yandong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (02) : 1031 - 1039
  • [9] Identity Mappings in Deep Residual Networks
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 630 - 645
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778