MidGAN: Mutual information in GAN-based dialogue models

被引:1
作者
Najari, Shaghayegh [1 ]
Salehi, Mostafa [1 ,2 ]
Farahbakhsh, Reza [3 ]
Tyson, Gareth [4 ]
机构
[1] Univ Tehran, Fac New Sci & Technol, Tehran, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, POB 193955746, Tehran, Iran
[3] Inst Polytech Paris, Telecom SudParis, Evry, France
[4] Queen Mary Univ London, London, England
关键词
Conversational models; Mutual information; Generative adversarial networks; Text generation;
D O I
10.1016/j.asoc.2023.110909
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite a large amount of research into task-oriented dialogue systems, open-ended dialogue agents have received little attention. Recently, researchers have explored using Generative Adversarial Network (GAN) to build such models, however, these require extensive computation and data. This paper propose MidGAN to address prior limitations of GAN-based dialogue models. It does this by trying to maximize the mutual information of the responses generated by the model. To this end, we propose a new metric, MMI-Like. This is based on Maximizing Mutual Information (MMI), yet unlike MMI, does not rely on an auxiliary generative model. We evaluate MidGAN based on the diversity, informativeness by measuring similarity and relevance of the responses it generates by BLEU metric. Our evaluation results, based on the three benchmark datasets, show that MidGAN outperforms the existing state-of-the-art framework, ADV.
引用
收藏
页数:11
相关论文
共 56 条
[31]  
Mitchell J., 2008, P ACL 08 HLT, P236
[32]   GANBOT: a GAN-based framework for social bot detection [J].
Najari, Shaghayegh ;
Salehi, Mostafa ;
Farahbakhsh, Reza .
SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
[33]   A framework for building closed-domain chat dialogue systems [J].
Nakano, Mikio ;
Komatani, Kazunori .
KNOWLEDGE-BASED SYSTEMS, 2020, 204
[34]  
Ni J., 2021, arXiv
[35]   Recent advances in deep learning based dialogue systems: a systematic survey [J].
Ni, Jinjie ;
Young, Tom ;
Pandelea, Vlad ;
Xue, Fuzhao ;
Cambria, Erik .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (04) :3055-3155
[36]  
Olabiyi O, 2020, NLP FOR CONVERSATIONAL AI, P54
[37]  
Rashkin H, 2019, Arxiv, DOI arXiv:1811.00207
[38]  
Rus Vasile, 2012, Intelligent Tutoring Systems. Proceedings 11th International Conference (ITS 2012), P675, DOI 10.1007/978-3-642-30950-2_116
[39]  
Sakaeda R, 2022, NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, P76
[40]   A MATHEMATICAL THEORY OF COMMUNICATION [J].
SHANNON, CE .
BELL SYSTEM TECHNICAL JOURNAL, 1948, 27 (04) :623-656