Condition-Transforming Variational Autoencoder for Generating Diverse Short Text Conversations

被引:3
|
作者
Ruan, Yu-Ping [1 ]
Ling, Zhen-Hua [1 ]
Zhu, Xiaodan [2 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Speech & Language Informat Proc, 443 Huangshan Rd, Hefei, Peoples R China
[2] Queens Univ, Dept Elect & Comp Engn, Walter Light Hall,Union St, Kingston, ON K7L 2N1, Canada
关键词
Neural network; variational autoencoder; conversation; text generation;
D O I
10.1145/3402884
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, conditional-transforming variational autoencoders (CTVAEs) are proposed for generating diverse short text conversations. In conditional variational autoencoders (CVAEs), the prior distribution of latent variable z follows a multivariate Gaussian distribution with mean and variance modulated by the input conditions. Previous work found that this distribution tended to become condition-independent in practical applications. Thus, this article designs CTVARs to enhance the influence of conditions in CVAEs. In a CTVAE model, the latent variable z is sampled by performing a non-linear transformation on the combination of the input conditions and the samples from a condition-independent prior distribution N(0.1). In our experiments using a Chinese Sina Weibo dataset, the CTVAE model derives z samples for decoding with better condition-dependency than that of the CVAE model. The earth mover's distance (EMD) between the distributions of the latent variable z at the training stage, and the testing stage is also reduced by using the CTVAE model. In subjective preference tests, our proposed CTVAE model performs significantly better than CVAE and sequence-to-sequence (Seq2Seq) models on generating diverse, informative, and topic-relevant responses.
引用
收藏
页数:13
相关论文
共 2 条
  • [1] CONDITION-TRANSFORMING VARIATIONAL AUTOENCODER FOR CONVERSATION RESPONSE GENERATION
    Ruan, Yu-Ping
    Ling, Zhen-Hua
    Liu, Quan
    Chen, Zhigang
    Indurkhya, Nitin
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7215 - 7219
  • [2] Generating Accurate and Diverse Audio Captions Through Variational Autoencoder Framework
    Zhang, Yiming
    Du, Ruoyi
    Tan, Zheng-Hua
    Wang, Wenwu
    Ma, Zhanyu
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2520 - 2524