MULTITASK GENERATIVE ADVERSARIAL IMITATION LEARNING FOR MULTI-DOMAIN DIALOGUE SYSTEM

被引：10

作者：

Hsu, Chuan-En ^{[1
]}

Rohmatillah, Mahdin ^{[1
]}

Chien, Jen-Tzung ^{[1
]}

机构：

[1] Natl Yang Ming Chiao Tung Univ, Dept Elect & Comp Engn, Taipei, Taiwan

来源：

2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU) | 2021年

关键词：

Dialogue policy optimization; generative adversarial imitation learning; multi-domain dialogues;

D O I：

10.1109/ASRU51503.2021.9688234

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the task-oriented dialogue system, dialog policy plays an important role since it determines the suitable actions based on the user's goals. However, in real situations, user's goals are varying so that the system needs to deal with the complex optimization problem for dialog policy. This paper presents a novel approach to build the multi-domain dialog system based on the multitask generative adversarial imitation learning (MGAIL). MGAIL combines hierarchical reinforcement learning and generative adversarial imitation learning where a mixture of generators are represented for multitask learning. Unlike the traditional imitation learning, this method decomposes each of complex tasks into several subtasks and builds the policy in a hierarchical way to relax the agent in handling multiple complex tasks. Experiments on a multi-domain dialogue system using MultiWOZ 2.1 under ConvLab-2 framework show that the proposed method outperforms the other reinforcement learning methods in system-wise evaluation in terms of complete rate, success rate and book rate.

引用

页码：954 / 961

页数：8

共 28 条

[1] Stochastic Curiosity Exploration for Dialogue Systems [J].

Chien, Jen-Tzung ;

Hsu, Po-Chien .

INTERSPEECH 2020, 2020, :3885-3889

[2]

Chien JT, 2020, ASIAPAC SIGN INFO PR, P1611

[3] Meta Learning for Hyperparameter Optimization in Dialogue System [J].

Chien, Jen-Tzung ;

Lieow, Wei Xiang .

INTERSPEECH 2019, 2019, :839-843

[4]

Chien Jen-Tzung, 2019, P EUR SIGN PROC C, P1

[5]

Chien Jen-Tzung, 2020, P EUR SIGN PROC C, P1527

[6]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[7]

Espeholt L, 2018, PR MACH LEARN RES, V80

[8]

Goodfellow I., 2020, ADV NEUR IN, V63, P139, DOI [DOI 10.1145/3422622, 10.1145/3422622]

[9]

Ho J., 2016, ADV NEURAL INFORM PR, V29, P4565

[10]

Hoang Quan, 2018, P INT C LEARN REPR

← 1 2 3 →