MatchGAN: A Self-supervised Semi-supervised Conditional Generative Adversarial Network

被引:1
作者
Sun, Jiaze [1 ]
Bhattarai, Binod [1 ]
Kim, Tae-Kyun [1 ,2 ]
机构
[1] Imperial Coll London, Exhibit Rd, London SW7 2AZ, England
[2] Korea Adv Inst Sci & Technol, 291 Daehak Ro, Daejeon 34141, South Korea
来源
COMPUTER VISION - ACCV 2020, PT IV | 2021年 / 12625卷
基金
英国工程与自然科学研究理事会;
关键词
Conditional generative adversarial network; Self-supervised learning; Semi-supervised learning; Face analysis;
D O I
10.1007/978-3-030-69538-5_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel self-supervised learning approach for conditional generative adversarial networks (GANs) under a semi-supervised setting. Unlike prior self-supervised approaches which often involve geometric augmentations on the image space such as predicting rotation angles, our pretext task leverages the label space. We perform augmentation by randomly sampling sensible labels from the label space of the few labelled examples available and assigning them as target labels to the abundant unlabelled examples from the same distribution as that of the labelled ones. The images are then translated and grouped into positive and negative pairs by their target labels, acting as training examples for our pretext task which involves optimising an auxiliary match loss on the discriminator's side. We tested our method on two challenging benchmarks, CelebA and RaFD, and evaluated the results using standard metrics including Frechet Inception Distance, Inception Score, and Attribute Classification Rate. Extensive empirical evaluation demonstrates the effectiveness of our proposed method over competitive baselines and existing arts. In particular, our method surpasses the baseline with only 20% of the labelled examples used to train the baseline.
引用
收藏
页码:608 / 623
页数:16
相关论文
共 39 条
[1]  
Bhattarai B., ECCV
[2]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[3]   Knowledge-Embedded Routing Network for Scene Graph Generation [J].
Chen, Tianshui ;
Yu, Weihao ;
Chen, Riquan ;
Lin, Liang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6156-6164
[4]  
Chen X, 2016, ADV NEUR IN, V29
[5]   StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation [J].
Choi, Yunjey ;
Choi, Minje ;
Kim, Munyoung ;
Ha, Jung-Woo ;
Kim, Sunghun ;
Choo, Jaegul .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8789-8797
[6]   Unsupervised Visual Representation Learning by Context Prediction [J].
Doersch, Carl ;
Gupta, Abhinav ;
Efros, Alexei A. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1422-1430
[7]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[8]   AttGAN: Facial Attribute Editing by Only Changing What You Want [J].
He, Zhenliang ;
Zuo, Wangmeng ;
Kan, Meina ;
Shan, Shiguang ;
Chen, Xilin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) :5464-5478
[9]  
Heusel M, 2017, ADV NEUR IN, V30
[10]  
Gulrajani I, 2017, ADV NEUR IN, V30