3D-Aware Generative Model for Improved Side-View Image Synthesis

被引：0

作者：

Jo, Kyungmin ^{[1
]}

Jin, Wonjoon ^{[2
]}

Choo, Jaegul ^{[1
]}

Lee, Hyunjoon ^{[3
]}

Cho, Sunghyun ^{[2
]}

机构：

[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea

[2] POSTECH, Pohang, Gyeongbuk, South Korea

[3] Kakao Brain, Seongnam Si, Gyeonggi Do, South Korea

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While recent 3D-aware generative models have shown photo-realistic image synthesis with multi-view consistency, the synthesized image quality degrades depending on the camera pose (e.g., a face with a blurry and noisy boundary at a side viewpoint). Such degradation is mainly caused by the difficulty of learning both pose consistency and photorealism simultaneously from a dataset with heavily imbalanced poses. In this paper, we propose SideGAN, a novel 3D GAN training method to generate photo-realistic images irrespective of the camera pose, especially for faces of side-view angles. To ease the challenging problem of learning photo-realistic and pose-consistent image synthesis, we split the problem into two subproblems, each of which can be solved more easily. Specifically, we formulate the problem as a combination of two simple discrimination problems, one of which learns to discriminate whether a synthesized image looks real or not, and the other learns to discriminate whether a synthesized image agrees with the camera pose. Based on this, we propose a dual-branched discriminator with two discrimination branches. We also propose a pose-matching loss to learn the pose consistency of 3D GANs. In addition, we present a pose sampling strategy to increase learning opportunities for steep angles in a pose-imbalanced dataset. With extensive validation, we demonstrate that our approach enables 3D GANs to generate high-quality geometries and photo-realistic images irrespective of the camera pose.

引用

页码：22805 / 22815

页数：11

共 50 条

[21] GIRAFFE HD: A High-Resolution 3D-aware Generative Model [J].

Xue, Yang ;

Li, Yuheng ;

Singh, Krishna Kumar ;

Lee, Yong Jae .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :18419-18428

[22] Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis [J].

Do, Hoseok ;

Yoo, EunKyung ;

Kim, Taehyeong ;

Lee, Chul ;

Choi, Tin Young .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :8529-8538

[23] VoxGRAF: Fast 3D-Aware Image Synthesis with Sparse Voxel Grids [J].

Schwarz, Katja ;

Sauer, Axel ;

Niemeyer, Michael ;

Liao, Yiyi ;

Geiger, Andreas .

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,

[24] 3D-aware Image Synthesis via Learning Structural and Textural Representations [J].

Xu, Yinghao ;

Peng, Sida ;

Yang, Ceyuan ;

Shen, Yujun ;

Zhou, Bolei .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :18409-18418

[25] The 3D-aware image synthesis of prohibited items in the X-ray security inspection by stylized generative radiance fields [J].

Liu, Jian ;

Yu, Zhen ;

Guo, Wenyu .

ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (03) :1801-1821

[26] Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks [J].

Barthel, Florian ;

Beckmann, Arian ;

Morgenstern, Wieland ;

Hilsmann, Anna ;

Eisert, Peter .

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, :7963-7972

[27] DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis [J].

Xu, Yinghao ;

Chai, Menglei ;

Shi, Zifan ;

Peng, Sida ;

Skorokhodov, Ivan ;

Siarohin, Aliaksandr ;

Yang, Ceyuan ;

Shen, Yujun ;

Lee, Hsin-Ying ;

Zhou, Bolei ;

Tulyakov, Sergey .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :4402-4412

[28] NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion [J].

Gu, Jiatao ;

Trevithick, Alex ;

Lin, Kai-En ;

Susskind, Josh ;

Theohalt, Christian ;

Liu, Lingjie ;

Ramamoorthi, Ravi .

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202

[29] 3D Congealing: 3D-Aware Image Alignment in the Wild [J].

Zhang, Yunzhi ;

Li, Zizhang ;

Raj, Amit ;

Engelhardt, Andreas ;

Li, Yuanzhen ;

Hou, Tingbo ;

Wu, Jiajun ;

Jampani, Varun .

COMPUTER VISION-ECCV 2024, PT I, 2025, 15059 :387-404

[30] Generative Multiplane Images: Making a 2D GAN 3D-Aware [J].

Zhao, Xiaoming ;

Ma, Fangchang ;

Guera, David ;

Ren, Zhile ;

Schwing, Alexander G. ;

Colburn, Alex .

COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 :18-35

← 1 2 3 4 5 →