Disentangled Variational Auto-Encoder for semi-supervised learning

被引:60
作者
Li, Yang [1 ]
Pan, Quan [1 ]
Wang, Suhang [3 ]
Peng, Haiyun [2 ]
Yang, Tao [1 ]
Cambria, Erik [2 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian, Shaanxi, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[3] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
关键词
Semi-supervised learning; Variational Auto-encoder; Disentangled representation; Neural networks; NATURAL-LANGUAGE;
D O I
10.1016/j.ins.2018.12.057
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-supervised learning is attracting increasing attention due to the fact that datasets of many domains lack enough labeled data. Variational Auto-Encoder (VAE), in particular, has demonstrated the benefits of semi-supervised learning. The majority of existing semi-supervised VAEs utilize a classifier to exploit label information, where the parameters of the classifier are introduced to the VAE. Given the limited labeled data, learning the parameters for the classifiers may not be an optimal solution for exploiting label information. Therefore, in this paper, we develop a novel approach for semi-supervised VAE without classifier. Specifically, we propose a new model called Semi-supervised Disentangled VAE (SDVAE), which encodes the input data into disentangled representation and non-interpretable representation, then the category information is directly utilized to regularize the disentangled representation via the equality constraint. To further enhance the feature learning ability of the proposed VAE, we incorporate reinforcement learning to relieve the lack of data. The dynamic framework is capable of dealing with both image and text data with its corresponding encoder and decoder networks. Extensive experiments on image and text datasets demonstrate the effectiveness of the proposed framework. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:73 / 85
页数:13
相关论文
共 48 条
[1]  
[Anonymous], 2014, Advances in Neural Information Processing Systems
[2]  
[Anonymous], 2015, ARXIV 1502 03167
[3]  
[Anonymous], 2014, P 2014 C EMP METH NA, DOI [DOI 10.3115/V1/D14-1181, 10.3115/v1/D14-1181]
[4]  
[Anonymous], 2012, SHORT PAPERS
[5]  
[Anonymous], ARXIV150505770
[6]  
[Anonymous], 2017, Communications of the ACM, DOI [DOI 10.1145/3065386, DOI 10.2165/00129785-200404040-00005, 10.1145/3065386]
[7]  
[Anonymous], 2014, APPL MULTIVARIATE ST
[8]  
[Anonymous], 2014, Synth Lect Artif Intell Mach Learn
[9]  
[Anonymous], 2017, ICLR POSTER
[10]  
[Anonymous], 2017, P 31 INT C NEURAL IN