Joint Dual Learning With Mutual Information Maximization for Natural Language Understanding and Generation in Dialogues

被引:0
作者
Su, Shang-Yu [1 ]
Chung, Yung-Sung [2 ]
Chen, Yun-Nung [3 ]
机构
[1] Rakuten Inst Technol, Rakuten, Tokyo 1580094, Japan
[2] MIT, Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[3] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 106319, Taiwan
关键词
Dual learning; natural language understanding; natural language generation; mutual information;
D O I
10.1109/TASLP.2024.3364063
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Modular conversational systems heavily rely on the performance of their natural language understanding (NLU) and natural language generation (NLG) components. NLU focuses on extracting core semantic concepts from input texts, while NLG constructs coherent sentences based on these extracted semantics. Inspired by information theory in digital communication, we introduce a one-way communication model that mirrors human conversations, comprising two distinct phases: (1) the conversion of thoughts into messages, similar to NLG, and (2) the comprehension of received messages, similar to NLU. This paper presents a novel algorithm that trains NLU and NLG collaboratively by concatenating their models and maximizing mutual information between inputs and outputs. This approach efficiently facilitates the transmission of semantics, leading to enhanced learning performance for both components. Our experimental results, based on three benchmark datasets, consistently demonstrate significant improvements for both NLU and NLG tasks, highlighting the practical promise of our proposed method.
引用
收藏
页码:2445 / 2452
页数:8
相关论文
共 41 条
  • [1] [Anonymous], 2015, P SIGDIAL, DOI [10.18653/v1/W15-4639, DOI 10.18653/V1/W15-4639]
  • [2] Belghazi MI, 2018, PR MACH LEARN RES, V80
  • [3] Chen PC, 2017, 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), P554, DOI 10.1109/ASRU.2017.8268985
  • [4] Chen Q, 2019, Arxiv, DOI arXiv:1902.10909
  • [5] Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken Language Understanding
    Chen, Yun-Nung
    Sun, Ming
    Rudnicky, Alexander I.
    Gershman, Anatole
    [J]. ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 83 - 86
  • [6] Chen Yun-Nung, 2016, arXiv
  • [7] Cho Kyunghyun, 2014, P C EMP METH NAT LAN
  • [8] Coucke A, 2018, Arxiv, DOI arXiv:1805.10190
  • [9] Cover T. M., 1999, Elements of Information Theory, DOI DOI 10.1002/0471200611.CH2
  • [10] Goo C.-W., 2018, NAACL HLT, V2, P753