SLIDE: Single Image 3D Photography with Soft Layering and Depth-aware Inpainting

被引:17
|
作者
Jampani, Varun [1 ]
Chang, Huiwen [1 ]
Sargent, Kyle [1 ]
Kar, Abhishek [1 ]
Tucker, Richard [1 ]
Krainin, Michael [1 ]
Kaeser, Dominik [1 ]
Freeman, William T. [1 ]
Salesin, David [1 ]
Curless, Brian [1 ]
Liu, Ce [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
关键词
D O I
10.1109/ICCV48922.2021.01229
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single image 3D photography enables viewers to view a still image from novel viewpoints. Recent approaches combine monocular depth networks with inpainting networks to achieve compelling results. A drawback of these techniques is the use of hard depth layering, making them unable to model intricate appearance details such as thin hair-like structures. We present SLIDE, a modular and unified system for single image 3D photography that uses a simple yet effective soft layering strategy to better preserve appearance details in novel views. In addition, we propose a novel depth-aware training strategy for our inpainting module, better suited for the 3D photography task. The resulting SLIDE approach is modular, enabling the use of other components such as segmentation and matting for improved layering. At the same time, SLIDE uses an efficient layered depth formulation that only requires a single forward pass through the component networks to produce high quality 3D photos. Extensive experimental analysis on three view-synthesis datasets, in combination with user studies on in-the-wild image collections, demonstrate superior performance of our technique in comparison to existing strong baselines while being conceptually much simpler. Project page: https://varunjampani.github.io/slide
引用
收藏
页码:12498 / 12507
页数:10
相关论文
共 50 条
  • [1] SLIDE: Single image 3D photography with soft layering and depth-aware inpainting
    Jampani, Varun
    Chang, Huiwen
    Sargent, Kyle
    Kar, Abhishek
    Tucker, Richard
    Krainin, Michael
    Kaeser, Dominik
    Freeman, William T.
    Salesin, David
    Curless, Brian
    Liu, Ce
    arXiv, 2021,
  • [2] SLIDE: Single Image 3D Photography with Soft Layering and Depth-aware Inpainting
    Jampani, Varun
    Chang, Huiwen
    Sargent, Kyle
    Kar, Abhishek
    Tucker, Richard
    Krainin, Michael
    Kaeser, Dominik
    Freeman, William T.
    Salesin, David
    Curless, Brian
    Liu, Ce
    Proceedings of the IEEE International Conference on Computer Vision, 2021, : 12498 - 12507
  • [3] Salient object segmentation based on depth-aware image layering
    Du, Huan
    Liu, Zhi
    Shi, Ran
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (09) : 12125 - 12138
  • [4] Salient object segmentation based on depth-aware image layering
    Huan Du
    Zhi Liu
    Ran Shi
    Multimedia Tools and Applications, 2019, 78 : 12125 - 12138
  • [5] Learning depth-aware decomposition for single image dehazing
    Kang, Yumeng
    Zhang, Lu
    Hu, Ping
    Liu, Yu
    Lu, Huchuan
    He, You
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
  • [6] MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer
    Huang, Kuan-Chih
    Wu, Tsung-Han
    Su, Hung-Ting
    Hsu, Winston H.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4002 - 4011
  • [7] REINFORCED DEPTH-AWARE DEEP LEARNING FOR SINGLE IMAGE DEHAZING
    Guo, Tiantong
    Monga, Vishal
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8891 - 8895
  • [8] DEPTH-AWARE 3D VIDEO FILTERING TARGETTING MULTIVIEW VIDEO PLUS DEPTH COMPRESSION
    Aflaki, Payman
    Hannuksela, Miska M.
    Homayouni, Maryam
    Gabbouj, Moncef
    2014 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2014,
  • [9] FineRecon: Depth-aware Feed-forward Network for Detailed 3D Reconstruction
    Stier, Noah
    Ranjan, Anurag
    Colburn, Alex
    Yan, Yajie
    Yang, Liang
    Ma, Fangchang
    Angles, Baptiste
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18377 - 18386
  • [10] DAFormer: Depth-aware 3D Object Detection Guided by Camera Model via Transformers
    Gao, Junbin
    Ruan, Hao
    Xu, Bingrong
    Zeng, Zhigang
    2022 IEEE INTERNATIONAL CONFERENCE ON CYBORG AND BIONIC SYSTEMS, CBS, 2022, : 170 - 175