ControlNeRF: Text-Driven 3D Scene Stylization via Diffusion Model

被引：0

作者：

Chen, Jiahui ^{[1
]}

Yang, Chuanfeng ^{[1
]}

Li, Kaiheng ^{[1
]}

Wu, Qingqiang ^{[1
]}

Hong, Qingqi ^{[1
]}

机构：

[1] Xiamen Univ, Dept Digital Media Technol, Xiamen, Peoples R China

来源：

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II | 2024年 / 15017卷

关键词：

Stylization; Neural Radiance Fields; Diffusion Model; View Synthesis;

D O I：

10.1007/978-3-031-72335-3_27

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D scene stylization aims to generate artistically rendered images from various viewpoints within a 3D space while ensuring style consistency regardless of the viewing angle. Traditional 2D methods usually used in this field struggle with maintaining this consistency when applied to 3D environments. To address this issue, we propose a novel approach named ControlNeRF, which employs a customized conditional diffusion model, ControlNet, and introduces latent variables, obtaining a stylized appearance throughout the scene solely driven by text. Specifically, this text-driven approach effectively overcomes the inconveniences associated with using images as style cues, and it not only achieves a high degree of stylistic consistency across various viewpoints but also produces high-quality images. We have conducted rigorous testing on ControlNeRF with diverse styles, which has confirmed these outcomes. Our approach not only advances the field of 3D scene stylization but also opens new possibilities for artistic expression and digital imaging.

引用

页码：395 / 406

页数：12

共 50 条

[41] Diff3DETR: Agent-Based Diffusion Model for Semi-supervised 3D Object Detection
Deng, Jiacheng
Lu, Jiahao
Zhang, Tianzhu
COMPUTER VISION - ECCV 2024, PT XXXIV, 2025, 15092 : 57 - 73
[42] Nonlocal 3D diffusion model of writing thick diffraction gratings in polymer-dispersed liquid crystals
A. L. Aslanyan
A. V. Galstyan
R. S. Hakobyan
Journal of Contemporary Physics (Armenian Academy of Sciences), 2008, 43 : 168 - 172
[43] Nonlocal 3D Diffusion Model of Writing Thick Diffraction Gratings in Polymer-Dispersed Liquid Crystals
Aslanyan, A. L.
Galstyan, A. V.
Hakobyan, R. S.
JOURNAL OF CONTEMPORARY PHYSICS-ARMENIAN ACADEMY OF SCIENCES, 2008, 43 (04) : 168 - 172
[44] DREAMCRAFT: Text-Guided Generation of Functional 3D Environments in Minecraft
Earle, Sam
Kokkinos, Filippos
Nie, Yuhe
Togelius, Julian
Raileanu, Roberta
PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES, FDG 2024, 2024,
[45] DiffuseIR: Diffusion Models for Isotropic Reconstruction of 3D Microscopic Images
Pan, Mingjie
Gan, Yulu
Zhou, Fangxu
Liu, Jiaming
Zhang, Ying
Wang, Aimin
Zhang, Shanghang
Li, Dawei
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT X, 2023, 14229 : 323 - 332
[46] Neural Wavelet-domain Diffusion for 3D Shape Generation
Hui, Ka-Hei
Li, Ruihui
Hu, Jingyu
Fu, Chi-Wing
PROCEEDINGS SIGGRAPH ASIA 2022, 2022,
[47] Synthetic CT generation from MRI using 3D transformer-based denoising diffusion model
Pan, Shaoyan
Abouei, Elham
Wynne, Jacob
Chang, Chih-Wei
Wang, Tonghe
Qiu, Richard L. J.
Li, Yuheng
Peng, Junbo
Roper, Justin
Patel, Pretesh
Yu, David S.
Mao, Hui
Yang, Xiaofeng
MEDICAL PHYSICS, 2024, 51 (04) : 2538 - 2548
[48] Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting
Durrer, Alicia
Wolleb, Julia
Bieder, Florentin
Friedrich, Paul
Melie-Garcia, Lester
Pineda, Mario Alberto Ocampo
Bercea, Cosmin I.
Hamamci, Ibrahim Ethem
Wiestler, Benedikt
Piraud, Marie
Yaldizli, Oezguer
Granziera, Cristina
Menze, Bjoern
Cattin, Philippe C.
Kofler, Florian
DEEP GENERATIVE MODELS, DGM4MICCAI 2024, 2025, 15224 : 87 - 97
[49] Locally Attentional SDF Diffusion for Controllable 3D Shape Generation
Zheng, Xin-Yang
Pan, Hao
Wang, Peng-Shuai
Tong, Xin
Liu, Yang
Shum, Heung-Yeung
ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (04):
[50] MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule Generation
Vignac, Clement
Osman, Nagham
Toni, Laura
Frossard, Pascal
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 560 - 576

← 1 2 3 4 5 →