Combining deep generative and discriminative models for Bayesian semi-supervised learning

被引:37
作者
Gordon, Jonathan [1 ]
Hernandez-Lobato, Jose Miguel [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge, England
关键词
Probabilistic models; Semi-supervised learning; Variational autoencoders; Predictive uncertainty;
D O I
10.1016/j.patcog.2019.107156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative models can be used for a wide range of tasks, and have the appealing ability to learn from both labelled and unlabelled data. In contrast, discriminative models cannot learn from unlabelled data, but tend to outperform their generative counterparts in supervised tasks. We develop a framework to jointly train deep generative and discriminative models, enjoying the benefits of both. The framework allows models to learn from labelled and unlabelled data, as well as naturally account for uncertainty in predictive distributions, providing the first Bayesian approach to semi-supervised learning with deep generative models. We demonstrate that our blended discriminative and generative models outperform purely generative models in both predictive performance and uncertainty calibration in a number of semi-supervised learning tasks. (C) 2019 The Authors. Published by Elsevier Ltd.
引用
收藏
页数:10
相关论文
共 46 条
[1]   Infinite Variational Autoencoder for Semi-Supervised Learning [J].
Abbasnejad, M. Ehsan ;
Dick, Anthony ;
van den Hengel, Anton .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :781-790
[2]  
Bishop CM., 2006, Pattern Recognition and Machine Learning
[3]  
Blundell C, 2015, PR MACH LEARN RES, V37, P1613
[4]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[5]  
Chapelle O, 2010, SEMISUPERVISED LEARN
[6]  
Cremer C., 2018, P 35 INT C MACHINE L, P1078
[7]  
Cuff JA, 1999, PROTEINS, V34, P508, DOI 10.1002/(SICI)1097-0134(19990301)34:4<508::AID-PROT10>3.0.CO
[8]  
2-4
[9]  
Dai Zihang, 2017, ADV NEURAL INFORM PR, P6510
[10]  
DEPEWEG S, 2017, ARXIV171203605