MidGAN: Mutual information in GAN-based dialogue models

被引:1
作者
Najari, Shaghayegh [1 ]
Salehi, Mostafa [1 ,2 ]
Farahbakhsh, Reza [3 ]
Tyson, Gareth [4 ]
机构
[1] Univ Tehran, Fac New Sci & Technol, Tehran, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, POB 193955746, Tehran, Iran
[3] Inst Polytech Paris, Telecom SudParis, Evry, France
[4] Queen Mary Univ London, London, England
关键词
Conversational models; Mutual information; Generative adversarial networks; Text generation;
D O I
10.1016/j.asoc.2023.110909
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite a large amount of research into task-oriented dialogue systems, open-ended dialogue agents have received little attention. Recently, researchers have explored using Generative Adversarial Network (GAN) to build such models, however, these require extensive computation and data. This paper propose MidGAN to address prior limitations of GAN-based dialogue models. It does this by trying to maximize the mutual information of the responses generated by the model. To this end, we propose a new metric, MMI-Like. This is based on Maximizing Mutual Information (MMI), yet unlike MMI, does not rely on an auxiliary generative model. We evaluate MidGAN based on the diversity, informativeness by measuring similarity and relevance of the responses it generates by BLEU metric. Our evaluation results, based on the three benchmark datasets, show that MidGAN outperforms the existing state-of-the-art framework, ADV.
引用
收藏
页数:11
相关论文
共 56 条
[1]  
Callison-Burch C., 2006, 11 C EUROPEAN CHAPTE, P249
[2]  
Coope S, 2020, Arxiv, DOI arXiv:2005.08866
[3]  
Danescu-Niculescu-Mizil Cristian, 2011, P 2 WORKSHOP COGNITI, P76
[4]   THE HELMHOLTZ MACHINE [J].
DAYAN, P ;
HINTON, GE ;
NEAL, RM ;
ZEMEL, RS .
NEURAL COMPUTATION, 1995, 7 (05) :889-904
[5]  
Dolan B., 2018, Adv. Neural Inf. Process. Syst., V31
[6]  
Feng JZ, 2019, Arxiv, DOI arXiv:1906.04413
[7]  
Feng SX, 2020, AAAI CONF ARTIF INTE, V34, P7708
[8]  
Ferrara E, 2017, Arxiv, DOI arXiv:1707.00086
[9]  
Firdaus Mauajama, 2022, IEEE Transactions on Affective Computing
[10]  
Forgues Gabriel, 2014, NIPS MODERN MACHINE, V2