Text2Scene: Text-driven Indoor Scene Stylization with Part-aware Details

被引:7
|
作者
Hwang, Inwoo [1 ]
Kim, Hyeonwoo [1 ]
Kim, Young Min [1 ,2 ,3 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul, South Korea
[2] Seoul Natl Univ, Interdisciplinary Program Artificial Intelligence, Seoul, South Korea
[3] Seoul Natl Univ, INMC, Seoul, South Korea
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/CVPR52729.2023.00188
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Text2Scene, a method to automatically create realistic textures for virtual scenes composed of multiple objects. Guided by a reference image and text descriptions, our pipeline adds detailed texture on labeled 3D geometries in the room such that the generated colors respect the hierarchical structure or semantic parts that are often composed of similar materials. Instead of applying flat stylization on the entire scene at a single step, we obtain weak semantic cues from geometric segmentation, which are further clarified by assigning initial colors to segmented parts. Then we add texture details for individual objects such that their projections on image space exhibit feature embedding aligned with the embedding of the input. The decomposition makes the entire pipeline tractable to a moderate amount of computation resources and memory. As our framework utilizes the existing resources of image and text embedding, it does not require dedicated datasets with high-quality textures designed by skillful artists. To the best of our knowledge, it is the first practical and scalable approach that can create detailed and realistic textures of the desired style that maintain structural context for scenes with multiple objects.
引用
收藏
页码:1890 / 1899
页数:10
相关论文
共 50 条
  • [1] ControlNeRF: Text-Driven 3D Scene Stylization via Diffusion Model
    Chen, Jiahui
    Yang, Chuanfeng
    Li, Kaiheng
    Wu, Qingqiang
    Hong, Qingqi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 395 - 406
  • [2] SceneScape: Text-Driven Consistent Scene Generation
    Fridman, Rafail
    Abecasis, Amit
    Kasten, Yoni
    Dekel, Tali
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Text2Mesh: Text-Driven Neural Stylization for Meshes
    Michel, Oscar
    Bar-On, Roi
    Liu, Richard
    Benaim, Sagie
    Hanocka, Rana
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13482 - 13492
  • [4] RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture
    Song, Liangchen
    Cao, Liangliang
    Xu, Hongyu
    Kang, Kai
    Tang, Feng
    Yuan, Junsong
    Yang, Zhao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6898 - 6906
  • [5] Text2NeRF: Text-Driven 3D Scene Generation With Neural Radiance Fields
    Zhang, Jingbo
    Li, Xiaoyu
    Wan, Ziyu
    Wang, Can
    Liao, Jing
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (12) : 7749 - 7762
  • [6] Text2Scene: Generating Compositional Scenes from Textual Descriptions
    Tan, Fuwen
    Feng, Song
    Ordonez, Vicente
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6703 - 6712
  • [7] DreamEditor: Text-Driven 3D Scene Editing with Neural Fields
    Zhuang, Jingyu
    Wang, Chen
    Lin, Liang
    Liu, Lingjie
    Li, Guanbin
    PROCEEDINGS OF THE SIGGRAPH ASIA 2023 CONFERENCE PAPERS, 2023,
  • [8] ConIS: controllable text-driven image stylization with semantic intensity
    Yang, Gaoming
    Li, Changgeng
    Zhang, Ji
    MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [9] DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization
    Huang, Nisha
    Zhang, Yuxin
    Tang, Fan
    Ma, Chongyang
    Huang, Haibin
    Dong, Weiming
    Xu, Changsheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 3370 - 3383
  • [10] Part-Aware Interactive Learning for Scene Graph Generation
    Tian, Hongshuo
    Xu, Ning
    Liu, An-An
    Zhang, Yongdong
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3155 - 3163