ControlNeRF: Text-Driven 3D Scene Stylization via Diffusion Model

被引:0
|
作者
Chen, Jiahui [1 ]
Yang, Chuanfeng [1 ]
Li, Kaiheng [1 ]
Wu, Qingqiang [1 ]
Hong, Qingqi [1 ]
机构
[1] Xiamen Univ, Dept Digital Media Technol, Xiamen, Peoples R China
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II | 2024年 / 15017卷
关键词
Stylization; Neural Radiance Fields; Diffusion Model; View Synthesis;
D O I
10.1007/978-3-031-72335-3_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D scene stylization aims to generate artistically rendered images from various viewpoints within a 3D space while ensuring style consistency regardless of the viewing angle. Traditional 2D methods usually used in this field struggle with maintaining this consistency when applied to 3D environments. To address this issue, we propose a novel approach named ControlNeRF, which employs a customized conditional diffusion model, ControlNet, and introduces latent variables, obtaining a stylized appearance throughout the scene solely driven by text. Specifically, this text-driven approach effectively overcomes the inconveniences associated with using images as style cues, and it not only achieves a high degree of stylistic consistency across various viewpoints but also produces high-quality images. We have conducted rigorous testing on ControlNeRF with diverse styles, which has confirmed these outcomes. Our approach not only advances the field of 3D scene stylization but also opens new possibilities for artistic expression and digital imaging.
引用
收藏
页码:395 / 406
页数:12
相关论文
共 50 条
  • [41] Diff3DETR: Agent-Based Diffusion Model for Semi-supervised 3D Object Detection
    Deng, Jiacheng
    Lu, Jiahao
    Zhang, Tianzhu
    COMPUTER VISION - ECCV 2024, PT XXXIV, 2025, 15092 : 57 - 73
  • [42] Nonlocal 3D diffusion model of writing thick diffraction gratings in polymer-dispersed liquid crystals
    A. L. Aslanyan
    A. V. Galstyan
    R. S. Hakobyan
    Journal of Contemporary Physics (Armenian Academy of Sciences), 2008, 43 : 168 - 172
  • [43] Nonlocal 3D Diffusion Model of Writing Thick Diffraction Gratings in Polymer-Dispersed Liquid Crystals
    Aslanyan, A. L.
    Galstyan, A. V.
    Hakobyan, R. S.
    JOURNAL OF CONTEMPORARY PHYSICS-ARMENIAN ACADEMY OF SCIENCES, 2008, 43 (04) : 168 - 172
  • [44] DREAMCRAFT: Text-Guided Generation of Functional 3D Environments in Minecraft
    Earle, Sam
    Kokkinos, Filippos
    Nie, Yuhe
    Togelius, Julian
    Raileanu, Roberta
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES, FDG 2024, 2024,
  • [45] DiffuseIR: Diffusion Models for Isotropic Reconstruction of 3D Microscopic Images
    Pan, Mingjie
    Gan, Yulu
    Zhou, Fangxu
    Liu, Jiaming
    Zhang, Ying
    Wang, Aimin
    Zhang, Shanghang
    Li, Dawei
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT X, 2023, 14229 : 323 - 332
  • [46] Neural Wavelet-domain Diffusion for 3D Shape Generation
    Hui, Ka-Hei
    Li, Ruihui
    Hu, Jingyu
    Fu, Chi-Wing
    PROCEEDINGS SIGGRAPH ASIA 2022, 2022,
  • [47] Synthetic CT generation from MRI using 3D transformer-based denoising diffusion model
    Pan, Shaoyan
    Abouei, Elham
    Wynne, Jacob
    Chang, Chih-Wei
    Wang, Tonghe
    Qiu, Richard L. J.
    Li, Yuheng
    Peng, Junbo
    Roper, Justin
    Patel, Pretesh
    Yu, David S.
    Mao, Hui
    Yang, Xiaofeng
    MEDICAL PHYSICS, 2024, 51 (04) : 2538 - 2548
  • [48] Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting
    Durrer, Alicia
    Wolleb, Julia
    Bieder, Florentin
    Friedrich, Paul
    Melie-Garcia, Lester
    Pineda, Mario Alberto Ocampo
    Bercea, Cosmin I.
    Hamamci, Ibrahim Ethem
    Wiestler, Benedikt
    Piraud, Marie
    Yaldizli, Oezguer
    Granziera, Cristina
    Menze, Bjoern
    Cattin, Philippe C.
    Kofler, Florian
    DEEP GENERATIVE MODELS, DGM4MICCAI 2024, 2025, 15224 : 87 - 97
  • [49] Locally Attentional SDF Diffusion for Controllable 3D Shape Generation
    Zheng, Xin-Yang
    Pan, Hao
    Wang, Peng-Shuai
    Tong, Xin
    Liu, Yang
    Shum, Heung-Yeung
    ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (04):
  • [50] MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule Generation
    Vignac, Clement
    Osman, Nagham
    Toni, Laura
    Frossard, Pascal
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 560 - 576