Text2Mesh: Text-Driven Neural Stylization for Meshes

被引:150
作者
Michel, Oscar [1 ]
Bar-On, Roi [1 ,2 ]
Liu, Richard [1 ]
Benaim, Sagie [2 ]
Hanocka, Rana [1 ]
机构
[1] Univ Chicago, Chicago, IL 60637 USA
[2] Tel Aviv Univ, Tel Aviv, Israel
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2022年
关键词
D O I
10.1109/CVPR52688.2022.01313
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we develop intuitive controls for editing the style of 3D objects. Our framework, Text2Mesh, stylizes a 3D mesh by predicting color and local geometric details which conform to a target text prompt. We consider a disentangled representation of a 3D object using a fixed mesh input (content) coupled with a learned neural network, which we term a neural style field network (NSF). In order to modify style, we obtain a similarity score between a text prompt (describing style) and a stylized mesh by harnessing the representational power of CLIP. Text2Mesh requires neither a pre-trained generative model nor a specialized 3D mesh dataset. It can handle low-quality meshes (non-manifold, boundaries, etc.) with arbitrary genus, and does not require UV parameterization. We demonstrate the ability of our technique to synthesize a myriad of styles over a wide variety of 3D meshes. Our code and results are available in our project webpage: https://threedle.github.io/text2mesh/.
引用
收藏
页码:13482 / 13492
页数:11
相关论文
共 70 条
[61]  
Xu Kai, 2010, ACM SIGGRAPH AS 2010
[62]   LOGAN: Unpaired Shape Transform in Latent Overcomplete Space [J].
Yin, Kangxue ;
Chen, Zhiqin ;
Huang, Hui ;
Cohen-Or, Daniel ;
Zhang, Hao .
ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (06)
[63]  
Yin Kangxue, 2021, P INT C COMP VIS ICC
[64]   pixelNeRF: Neural Radiance Fields from One or Few Images [J].
Yu, Alex ;
Ye, Vickie ;
Tancik, Matthew ;
Kanazawa, Angjoo .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4576-4585
[65]  
Yu F.L., 2018, EUR J DRUG METAB PH, V3, P1
[66]   Learning Semantic Deformation Flows with 3D Convolutional Networks [J].
Yumer, M. Ersin ;
Mitra, Niloy J. .
COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 :294-311
[67]   Semantic Shape Editing Using Deformation Handles [J].
Yumer, Mehmet Ersin ;
Chaudhuri, Siddhartha ;
Hodgins, Jessica K. ;
Kara, Levent Burak .
ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04)
[68]  
Zhang Kai, 2020, NERF ANAL IMPROVING
[69]   DeepSim: Deep Learning Code Functional Similarity [J].
Zhao, Gang ;
Huang, Jeff .
ESEC/FSE'18: PROCEEDINGS OF THE 2018 26TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2018, :141-151
[70]  
Zhou Qingnan, 2016, arXiv preprint arXiv:1605.04797