Semisupervised Text Classification by Variational Autoencoder

被引:50
作者
Xu, Weidi [1 ]
Tan, Ying [1 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Dept Machine Intelligence, Key Lab Machine Percept,Minist Educ, Beijing 100871, Peoples R China
基金
北京市自然科学基金;
关键词
Data models; Decoding; Task analysis; Training; Semisupervised learning; Predictive models; Feature extraction; Generative models; semisupervised learning; text classification; variational autoencoder (VAE); INFERENCE;
D O I
10.1109/TNNLS.2019.2900734
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semisupervised text classification has attracted much attention from the research community. In this paper, a novel model, the semisupervised sequential variational autoencoder (SSVAE), is proposed to tackle this problem. By treating the categorical label of unlabeled data as a discrete latent variable, the proposed model maximizes the variational evidence lower bound of the data likelihood, which implicitly derives the underlying label distribution for the unlabeled data. Analytical work indicates that the autoregressive nature of the sequential model is the crucial issue that renders the vanilla model ineffective. To remedy this, two types of decoders are investigated in the SSVAE model and verified. In addition, a reweighting approach is proposed to circumvent the credit assignment problem that occurs during the reconstruction procedure, which can further improve performance for sparse text data. Experimental results show that our method significantly improves the classification accuracy compared with other modern methods.
引用
收藏
页码:295 / 308
页数:14
相关论文
共 61 条
[1]  
[Anonymous], 2016, PROC 9 ISCA SPEEC
[2]  
[Anonymous], 1997, Neural Computation
[3]  
[Anonymous], 2000, P 17 INT C MACH LEAR
[4]  
Bahdanau D., 2014, ABS14090473 CORR
[5]  
Berger J., 2010, Proceedings of the Python for Scientific Computing Conference (SciPy), number Scipy, P1
[6]  
Bowman S, 2016, P 20 SIGNLL C COMP N, P10
[7]  
Cho K., 2014, P 8 WORKSH SYNT SEM, P103, DOI DOI 10.3115/V1/W14-4012
[8]  
Conneau A, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1107
[9]  
Cozman F.G., 2003, P 20 INT C MACHINE L, P99
[10]   Inverting the Generator of a Generative Adversarial Network [J].
Creswell, Antonia ;
Bharath, Anil Anthony .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (07) :1967-1974