SketchyDepth: from Scene Sketches to RGB-D Images

被引:0
|
作者
Berardi, Gianluca [1 ]
Salti, Samuele [1 ]
Di Stefano, Luigi [1 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn DISI, Bologna, Italy
关键词
D O I
10.1109/ICCVW54120.2021.00274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sketch-based content generation is a creative and fun activity suited to casual and professional users that has many different applications. Today it is possible to generate the geometry and appearance of a single object by sketching it. Yet, only the appearance can be synthesized from a sketch of a whole scene. In this paper we propose the first method to generate both the depth map and image of a whole scene from a sketch. We demonstrate how generating geometrical information as a depth map is beneficial from a twofold perspective. On one hand, it improves the quality of the image synthesized from the sketch. On the other, it unlocks depth-enabled creative effects like Bokeh, fog, light variation, 3D photos and many others, which help enhancing the final output in a controlled way. We validate our method showing how generating depth maps directly from sketches produces better qualitative results with respect to alternative methods, i.e. running MiDaS after image generation. Finally we introduce depth sketching, a depth manipulation technique to further condition image generation without the need of additional annotation or training.
引用
收藏
页码:2414 / 2423
页数:10
相关论文
共 50 条
  • [1] Intrinsic Scene Decomposition from RGB-D images
    Hachama, Mohammed
    Ghanem, Bernard
    Wonka, Peter
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 810 - 818
  • [2] Indoor Scene Recognition from RGB-D Images by Learning Scene Bases
    Wan, Shaohua
    Hu, Changbo
    Aggarwal, J. K.
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3416 - 3421
  • [3] Point Cloud Semantic Scene Completion from RGB-D Images
    Zhang, Shoulong
    Li, Shuai
    Hao, Aimin
    Qin, Hong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3385 - 3393
  • [4] Radiometric Scene Decomposition: Scene Reflectance, Illumination, and Geometry from RGB-D Images
    Lombardi, Stephen
    Nishino, Ko
    PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 305 - 313
  • [5] Learning 3D Scene Synthesis from Annotated RGB-D Images
    Kermani, Z. Sadeghipour
    Liao, Z.
    Tan, P.
    Zhang, H.
    COMPUTER GRAPHICS FORUM, 2016, 35 (05) : 197 - 206
  • [6] Indoor scene understanding via monocular RGB-D images
    Chen, Yanxiang
    Pan, Daru
    Pan, Yifei
    Liu, Shengzhong
    Gu, Aihua
    Wang, Meng
    INFORMATION SCIENCES, 2015, 320 : 361 - 371
  • [7] Survey on Semantic Scene Completion Based on RGB-D Images
    Zhang K.
    An B.-Z.
    Li J.
    Yuan X.
    Zhao C.-X.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (01): : 444 - 462
  • [8] SUN RGB-D: A RGB-D Scene Understanding Benchmark Suite
    Song, Shuran
    Lichtenberg, Samuel P.
    Xiao, Jianxiong
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 567 - 576
  • [9] Domain adaptation from RGB-D to RGB images
    Li, Xiao
    Fang, Min
    Zhang, Ju-Jie
    Wu, Jinqiao
    SIGNAL PROCESSING, 2017, 131 : 27 - 35
  • [10] ResFusion: deeply fused scene parsing network for RGB-D images
    Dai, Juting
    Tang, Xinyi
    IET COMPUTER VISION, 2018, 12 (08) : 1171 - 1178