Text2Scene: Text-driven Indoor Scene Stylization with Part-aware Details

被引：7

作者：

Hwang, Inwoo ^{[1
]}

Kim, Hyeonwoo ^{[1
]}

Kim, Young Min ^{[1
,2
,3
]}

机构：

[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul, South Korea

[2] Seoul Natl Univ, Interdisciplinary Program Artificial Intelligence, Seoul, South Korea

[3] Seoul Natl Univ, INMC, Seoul, South Korea

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1109/CVPR52729.2023.00188

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose Text2Scene, a method to automatically create realistic textures for virtual scenes composed of multiple objects. Guided by a reference image and text descriptions, our pipeline adds detailed texture on labeled 3D geometries in the room such that the generated colors respect the hierarchical structure or semantic parts that are often composed of similar materials. Instead of applying flat stylization on the entire scene at a single step, we obtain weak semantic cues from geometric segmentation, which are further clarified by assigning initial colors to segmented parts. Then we add texture details for individual objects such that their projections on image space exhibit feature embedding aligned with the embedding of the input. The decomposition makes the entire pipeline tractable to a moderate amount of computation resources and memory. As our framework utilizes the existing resources of image and text embedding, it does not require dedicated datasets with high-quality textures designed by skillful artists. To the best of our knowledge, it is the first practical and scalable approach that can create detailed and realistic textures of the desired style that maintain structural context for scenes with multiple objects.

引用

页码：1890 / 1899

页数：10

共 50 条

[1] ControlNeRF: Text-Driven 3D Scene Stylization via Diffusion Model
Chen, Jiahui
Yang, Chuanfeng
Li, Kaiheng
Wu, Qingqiang
Hong, Qingqi
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 395 - 406
[2] SceneScape: Text-Driven Consistent Scene Generation
Fridman, Rafail
Abecasis, Amit
Kasten, Yoni
Dekel, Tali
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[3] Text2Mesh: Text-Driven Neural Stylization for Meshes
Michel, Oscar
Bar-On, Roi
Liu, Richard
Benaim, Sagie
Hanocka, Rana
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13482 - 13492
[4] RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture
Song, Liangchen
Cao, Liangliang
Xu, Hongyu
Kang, Kai
Tang, Feng
Yuan, Junsong
Yang, Zhao
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6898 - 6906
[5] Text2NeRF: Text-Driven 3D Scene Generation With Neural Radiance Fields
Zhang, Jingbo
Li, Xiaoyu
Wan, Ziyu
Wang, Can
Liao, Jing
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (12) : 7749 - 7762
[6] Text2Scene: Generating Compositional Scenes from Textual Descriptions
Tan, Fuwen
Feng, Song
Ordonez, Vicente
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6703 - 6712
[7] DreamEditor: Text-Driven 3D Scene Editing with Neural Fields
Zhuang, Jingyu
Wang, Chen
Lin, Liang
Liu, Lingjie
Li, Guanbin
PROCEEDINGS OF THE SIGGRAPH ASIA 2023 CONFERENCE PAPERS, 2023,
[8] ConIS: controllable text-driven image stylization with semantic intensity
Yang, Gaoming
Li, Changgeng
Zhang, Ji
MULTIMEDIA SYSTEMS, 2024, 30 (04)
[9] DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization
Huang, Nisha
Zhang, Yuxin
Tang, Fan
Ma, Chongyang
Huang, Haibin
Dong, Weiming
Xu, Changsheng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 3370 - 3383
[10] Part-Aware Interactive Learning for Scene Graph Generation
Tian, Hongshuo
Xu, Ning
Liu, An-An
Zhang, Yongdong
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3155 - 3163

← 1 2 3 4 5 →