Variational model for low-resource natural language generation in spoken dialogue systems

被引：4

作者：

Van-Khanh Tran ^{[1
,2
]}

Le-Minh Nguyen ^{[1
]}

机构：

[1] Japan Adv Inst Sci & Technol, JAIST 1-1 Asahidai, Nomi, Ishikawa 9231292, Japan

[2] ICTU Thai Nguyen Univ, Univ Informat & Commun Technol, Thai Nguyen, Vietnam

来源：

COMPUTER SPEECH AND LANGUAGE | 2021年 / 65卷

关键词：

Neural language generation; Domain adaptation; Low-resource data; Variational autoencoder; Deconvolutional neural network; CNN; RNN; LSTM; DOMAIN ADAPTATION;

D O I：

10.1016/j.csl.2020.101120

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Natural Language Generation (NLG) plays a critical role in Spoken Dialogue Systems (SDSs), aims at converting a meaning representation into natural language utterances. Recent deep learning-based generators have shown improving results irrespective of providing sufficient annotated data. Nevertheless, how to build a generator that can effectively utilize as much of knowledge from a low-resource setting data is a crucial issue for NLG in SDSs. This paper presents a variational-based NLG framework to tackle the NLG problem of having limited annotated data in two scenarios, domain adaptation and low-resource in-domain training data. Based on this framework, we propose a novel adversarial domain adaptation NLG taclking the former issue, while the latter issue is also handled by a second proposed dual variational model. We extensively conducted the experiments on four different domains in a variety of training scenarios, in which the experimental results show that the proposed methods not only outperform previous methods when having sufficient training dataset but also show its ability to work acceptably well when there is a small amount of in-domain data or adapt quickly to a new domain with only a low-resource target domain data. (C) 2020 Published by Elsevier Ltd.

引用

页数：25