Optimizing pipeline task-oriented dialogue systems using post-processing networks

被引:0
|
作者
Ohashi, Atsumoto [1 ]
Higashinaka, Ryuichiro [1 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Chikusa, Aichi 4648601, Japan
关键词
Dialogue system; Task-oriented dialogue; Reinforcement learning; Post-processing;
D O I
10.1016/j.csl.2024.101742
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many studies have proposed methods for optimizing the dialogue performance of an entire pipeline task-oriented dialogue system by jointly training modules in the system using reinforcement learning. However, these methods are limited in that they can only be applied to modules implemented using trainable neural-based methods. To solve this problem, we propose a method for optimizing the dialogue performance of a pipeline system that consists of modules implemented with arbitrary methods for dialogue. With our method, neural- based components called post-processing networks (PPNs) are installed inside such a system to post-process the output of each module. All PPNs are updated to improve the overall dialogue performance of the system by using reinforcement learning, not necessitating that each module be differentiable. Through dialogue simulations and human evaluations on two well- studied task-oriented dialogue datasets, CamRest676 and MultiWOZ, we show that our method can improve the dialogue performance of pipeline systems consisting of various modules. In addition, a comprehensive analysis of the results of the MultiWOZ experiments reveals the patterns of post-processing by PPNs that contribute to the overall dialogue performance of the system.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A Survey of Task-Oriented Dialogue Policies Based on Reinforcement Learning
    Xu K.
    Wang Z.-Y.
    Wang X.
    Qin H.
    Long Y.-X.
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (06): : 1201 - 1231
  • [22] Building End-to-End Task-oriented Dialogue Systems Via CNNs And Attention Mechanisms
    Song, Meina
    Chen, Chongfu
    Niu, Peiqing
    Haihong, E.
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI-2019), 2019,
  • [23] HaGAN: Hierarchical Attentive Adversarial Learning for Task-Oriented Dialogue System
    Fang, Ting
    Qiao, Tingting
    Xu, Duanqing
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 98 - 109
  • [24] Topic model for personalized end-to-end task-oriented dialogue
    Gou, Zhinan
    Li, Yan
    Liu, Yuanzhen
    Gao, Kai
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [25] Cold-started Curriculum Learning for Task-oriented Dialogue Policy
    Zhu, Hui
    Zhao, Yangyang
    Qin, Hua
    2021 IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE 2021), 2021, : 100 - 105
  • [26] Variational Reward Estimator Bottleneck: Towards Robust Reward Estimator for Multidomain Task-Oriented Dialogue
    Park, Jeiyoon
    Lee, Chanhee
    Park, Chanjun
    Kim, Kuekyeng
    Lim, Heuiseok
    APPLIED SCIENCES-BASEL, 2021, 11 (14):
  • [27] A Pipeline Optimization Model for QKD Post-processing System
    Zhou, Jianyi
    Liu, Bo
    Zhao, Baokang
    Liu, Bo
    INFORMATION AND COMMUNICATION TECHNOLOGY, 2014, 8407 : 472 - 481
  • [28] Post-Processing Association Rules using Networks and Transductive Learning
    de Padua, Renan
    Rezende, Solange Oliveira
    de Carvalho, Veronica Oliveira
    2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 318 - 323
  • [29] Advances and Challenges in Multi-Domain Task-Oriented Dialogue Policy Optimization
    Rohmatillah, Mahdin
    Chien, Jen-Tzung
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (01)
  • [30] USET : A network based on Utterance hidden State transfEr for Task-oriented dialogue
    Song, Shezheng
    Wang, Changjian
    Zhang, Dongsong
    Peng, Yuxing
    Yuan, Yuan
    Luo, Li
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,