Unsupervised many-to-many image-to-image translation across multiple domains

被引:0
作者
Lin, Ye [1 ]
Fu, Keren [2 ]
Ling, Shenggui [1 ]
Cheng, Peng [3 ]
机构
[1] Sichuan Univ, Natl Key Lab Fundamental Sci Synthet Vis, Chengdu, Sichuan, Peoples R China
[2] Sichuan Univ, Coll Comp Sci, Chengdu, Sichuan, Peoples R China
[3] Sichuan Univ, Sch Aeronaut & Astronaut, Chengdu 610065, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1049/ipr2.12227
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised multi-domain image-to-image translation aims to synthesize images among multiple domains without labelled data, which is more general and complicated than one-to-one image mapping. However, existing methods mainly focus on reducing the large costs of modelling and do not pay enough attention to the quality of generated images. In some target domains, their translation results may not be expected or even cause the model collapse. To improve the image quality, an effective many-to-many mapping framework for unsupervised multi-domain image-to-image translation is proposed. There are two key aspects to the proposed method. The first is a many-to-many architecture with only one domain-shared encoder and several domain-specialized decoders to effectively and simultaneously translate images across multiple domains. The second is two proposed constraints extended from one-to-one mappings to further help improve the generation. All the evaluations demonstrate that the proposed framework is superior to existing methods and provides an effective solution for multi-domain image-to-image translation.
引用
收藏
页码:2412 / 2423
页数:12
相关论文
共 53 条
[1]  
Abboud B., 2003, IEEE COMP SOC C COMP
[2]  
Almahairi A, 2018, PR MACH LEARN RES, V80
[3]   ComboGAN: Unrestrained Scalability for Image Domain Translation [J].
Anoosheh, Asha ;
Agustsson, Eirikur ;
Timofte, Radu ;
Van Gool, Luc .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :896-903
[4]   Pattern recognition for high performance imaging [J].
Bai, Xiao ;
Zhou, Jun ;
Robles-Kelly, Antonio .
PATTERN RECOGNITION, 2018, 82 :38-39
[5]   Memetically Optimized MCWLD for Matching Sketches With Digital Face Images [J].
Bhatt, Himanshu S. ;
Bharadwaj, Samarth ;
Singh, Richa ;
Vatsa, Mayank .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2012, 7 (05) :1522-1535
[6]  
Binkowski Mikolaj, 2018, ICLR
[7]   VGGFace2: A dataset for recognising faces across pose and age [J].
Cao, Qiong ;
Shen, Li ;
Xie, Weidi ;
Parkhi, Omkar M. ;
Zisserman, Andrew .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :67-74
[8]   StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation [J].
Choi, Yunjey ;
Choi, Minje ;
Kim, Munyoung ;
Ha, Jung-Woo ;
Kim, Sunghun ;
Choo, Jaegul .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8789-8797
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672