Advances and Challenges in Multi-Domain Task-Oriented Dialogue Policy Optimization

被引：5

作者：

Rohmatillah, Mahdin ^{[1
]}

Chien, Jen-Tzung ^{[2
]}

机构：

[1] Natl Yang Ming Chiao Tung Univ, EECS Int Grad Program, Hsinchu, Taiwan

[2] Natl Yang Ming Chiao Tung Univ, Inst Elect & Comp Engn, Hsinchu, Taiwan

来源：

APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING | 2023年 / 12卷 / 01期

关键词：

Multi-domain task-oriented dialogue system; dialogue policy optimization; reinforcement learning; imitation learning; dialogue act prediction; word-level policy learning; MODEL;

D O I：

10.1561/116.00000132

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Developing a successful dialogue policy for a multi-domain task-oriented dialogue (MDTD) system is a challenging task. Basically, a desirable dialogue policy acts as the decision-making agent who understands the user's intention to provide suitable responses within a short conversation. Furthermore, offering the precise answers to satisfy the user requirements makes the task even more challenging. This paper surveys recent advances in multi-domain task-oriented dialogue policy optimization and summarizes a number of solutions to policy learning. In particular, the case study on the task of travel assistance using the MDTD dataset based on MultiWOZ containing seven different domains is investigated. The dialogue policy optimization methods, categorized into dialogue act level and word level, are systematically presented. Moreover, this paper addresses a number of challenges and difficulties including the user simulator design and the dialogue policy evaluation which need to be resolved to further enhance the robustness and effectiveness in multi-domain dialogue policy representation.

引用

页数：52

共 118 条

[1]

Abel D., 2016, P NIPS WORKSH FUT IN

[2]

Bang Y, 2023, Arxiv, DOI [arXiv:2302.04023, DOI 10.48550/ARXIV.2302.04023]

[3]

Budzianowski P., 2019, P 3 WORKSHOP NEURAL, DOI DOI 10.18653/V1/D19-5602

[4]

Budzianowski P, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P5016

[5]

Byrne B, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P4516

[6]

Chang Jonathan D, 2021, Advances in Neural Information Processing Systems

[7]

Chen M.-Y., 2023, P IEEE INT C AC SPEE, P1

[8]

Chen WH, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P3696

[9]

Cheng QY, 2022, Arxiv, DOI arXiv:2210.14529

[10] Bayesian Transformer Using Disentangled Mask Attention [J].

Chien, Jen-Tzung ;

Huang, Yu-Han .

INTERSPEECH 2022, 2022, :1761-1765

← 1 2 3 4 5 6 7 8 9 10 →