Be Your Own Prada: Fashion Synthesis with Structural Coherence

被引：179

作者：

Zhu, Shizhan ^{[1
]}

Fidler, Sanja ^{[2
,3
]}

Urtasun, Raquel ^{[2
,3
,4
]}

Lin, Dahua ^{[1
]}

Loy, Chen Change ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Hong Kong, Peoples R China

[2] Univ Toronto, Toronto, ON, Canada

[3] Vector Inst, Toronto, ON, Canada

[4] Uber Adv Technol Grp, Toronto, ON, Canada

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年

关键词：

D O I：

10.1109/ICCV.2017.186

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a novel and effective approach for generating new clothing on a wearer through generative adversarial learning. Given an input image of a person and a sentence describing a different outfit, our model "redresses" the person as desired, while at the same time keeping the wearer and her/his pose unchanged. Generating new outfits with precise regions conforming to a language description while retaining wearer's body structure is a new challenging task. Existing generative adversarial networks are not ideal in ensuring global coherence of structure given both the input photograph and language description as conditions. We address this challenge by decomposing the complex generative process into two conditional stages. In the first stage, we generate a plausible semantic segmentation map that obeys the wearer's pose as a latent spatial arrangement. An effective spatial constraint is formulated to guide the generation of this semantic segmentation map. In the second stage, a generative model with a newly proposed compositional mapping layer is used to render the final image with precise regions and textures conditioned on this map. We extended the DeepFashion dataset [8] by collecting sentence descriptions for 79K images. We demonstrate the effectiveness of our approach through both quantitative and qualitative evaluations. A user study is also conducted.

引用

页码：1689 / 1697

页数：9

共 19 条

[1]

[Anonymous], 2016, EUR C COMP VIS ECCV

[2]

[Anonymous], 2010, ACM T GRAPHIC

[3]

[Anonymous], 2016, P IEEE C COMP VIS PA

[4]

[Anonymous], 2016, ECCV

[5]

[Anonymous], 2015, ICCV

[6]

[Anonymous], 2017, PROC IEEE C COMPUTER

[7]

[Anonymous], 2016, P ADV NEURAL INFORM

[8]

[Anonymous], 2015, ICCV

[9]

[Anonymous], ICMR

[10] Generative Adversarial Networks [J].

Goodfellow, Ian ;

Pouget-Abadie, Jean ;

Mirza, Mehdi ;

Xu, Bing ;

Warde-Farley, David ;

Ozair, Sherjil ;

Courville, Aaron ;

Bengio, Yoshua .

COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144

← 1 2 →