DiffusionRig: Learning Personalized Priors for Facial Appearance Editing

被引:25
作者
Ding, Zheng [1 ]
Zhang, Xuaner [2 ]
Xia, Zhihao [2 ]
Jebe, Lars [2 ]
Tu, Zhuowen [1 ]
Zhang, Xiuming [2 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] Adobe, San Jose, CA USA
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
关键词
D O I
10.1109/CVPR52729.2023.01225
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of learning person-specific facial priors from a small number (e.g., 20) of portrait photos of the same person. This enables us to edit this specific person's facial appearance, such as expression and lighting, while preserving their identity and high-frequency facial details. Key to our approach, which we dub DiffusionRig, is a diffusion model conditioned on, or "rigged by," crude 3D face models estimated from single in-the-wild images by an off-the-shelf estimator. On a high level, DiffusionRig learns to map simplistic renderings of 3D face models to realistic photos of a given person. Specifically, DiffusionRig is trained in two stages: It first learns generic facial priors from a large-scale face dataset and then person-specific priors from a small portrait photo collection of the person of interest. By learning the CGI-to-photo mapping with such personalized priors, DiffusionRig can "rig" the lighting, facial expression, head pose, etc. of a portrait photo, conditioned only on coarse 3D models while preserving this person's identity and other high-frequency characteristics. Qualitative and quantitative experiments show that DiffusionRig outperforms existing approaches in both identity preservation and photorealism. Please see the project website: https://diffusionrig.github.io for the supplemental material, video, code, and data.
引用
收藏
页码:12736 / 12746
页数:11
相关论文
共 54 条
[21]   A Style-Based Generator Architecture for Generative Adversarial Networks [J].
Karras, Tero ;
Laine, Samuli ;
Aila, Timo .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4396-4405
[22]  
King DE, 2009, J MACH LEARN RES, V10, P1755
[23]  
Kingma DP, 2014, ADV NEUR IN, V27
[24]   Blind Face Restoration via Deep Multi-scale Component Dictionaries [J].
Li, Xiaoming ;
Chen, Chaofeng ;
Zhou, Shangchen ;
Lin, Xianhui ;
Zuo, Wangmeng ;
Zhang, Lei .
COMPUTER VISION - ECCV 2020, PT IX, 2020, 12354 :399-415
[25]   3D-FM GAN: Towards 3D-Controllable Face Manipulation [J].
Liu, Yuchen ;
Shu, Zhixin ;
Li, Yijun ;
Lin, Zhe ;
Zhang, Richard ;
Kung, S. Y. .
COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 :107-125
[26]   Deep Learning Face Attributes in the Wild [J].
Liu, Ziwei ;
Luo, Ping ;
Wang, Xiaogang ;
Tang, Xiaoou .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3730-3738
[27]   Deep Appearance Models for Face Rendering [J].
Lombardi, Stephen ;
Saragih, Jason ;
Simon, Tomas ;
Sheikh, Yaser .
ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04)
[28]  
Mallikarjun B. R, 2021, ARXIV210307658CS
[29]   Deep Reflectance Fields High-Quality Facial Reflectance Field Inference from Color Gradient Illumination [J].
Meka, Abhimitra ;
Hane, Christian ;
Pandey, Rohit ;
Zollhofer, Michael ;
Fanello, Sean ;
Fyffe, Graham ;
Kowdle, Adarsh ;
Yu, Xueming ;
Busch, Jay ;
Dour-Garian, Jason ;
Denny, Peter ;
Bouaziz, Sofien ;
Lincoln, Peter ;
Whalen, Matt ;
Harvey, Geoff ;
Taylor, Jonathan ;
Izadi, Shahram ;
Tagliasacchi, Andrea ;
Debevec, Paul ;
Theobalt, Christian ;
Valentin, Julien ;
Rhemann, Christoph .
ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (04)
[30]   Learning Physics-guided Face Relighting under Directional Light [J].
Nestmeyer, Thomas ;
Lalonde, Jean-Francois ;
Matthews, Iain ;
Lehrmann, Andreas .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5123-5132