Performance Improvement on Traditional Chinese Task-Oriented Dialogue Systems With Reinforcement Learning and Regularized Dropout Technique

被引:3
|
作者
Sheu, Jeng-Shin [1 ]
Wu, Siang-Ru [1 ]
Wu, Wen-Hung [2 ]
机构
[1] Natl Yunlin Univ Sci & Technol, Dept Comp Sci & Informat Engn, Yunlin 640002, Taiwan
[2] Ponddy Educ Taiwan Ltd, New Taipei 231, Taiwan
关键词
Task analysis; Reinforcement learning; Computational modeling; Artificial intelligence; Tokenization; Data models; NLP; regularized dropout; reinforcement learning; task-oriented dialogue;
D O I
10.1109/ACCESS.2023.3248796
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The development of conversational voice assistant applications has been in full swing around the world. This paper aims to develop traditional Chinese multi-domain task-oriented dialogue (TOD) systems. It is typically implemented using pipeline approach, where submodules are optimized independently, resulting in inconsistencies with each other. Instead, this paper implements end-to-end multi-domain TOD models using pre-trained deep neural networks (DNNs). This allows us to integrate all the submodules into one single DNN model to solve the inconsistencies. Data shortages are common in conversational natural language processing (NLP) tasks using DNN models. In this regard, dropout regularization has been widely used to improve overfitting caused by insufficient training dataset. However, the randomness it introduces leads to non-negligible discrepancies between training and inference. On the other hand, pre-trained language models have successfully provided effective regularization for NLP tasks. An inherent disadvantage is that fine-tuning the pre-trained language model suffers from exposure bias and loss-evaluation mismatch. To this end, we propose a reinforcement learning (RL) approach to address both issues. Furthermore, we adopt a method called regularized dropout (R-Drop) to improve the inconsistency in dropout layers of DNNs. Experimental results show that both our proposed RL approach and the R-Drop technique can significantly improve the joint target accuracy (JGA) score and combined score of traditional Chinese TOD system in tasks of dialogue state tracking (DST) and end-to-end sentence prediction, respectively.
引用
收藏
页码:19849 / 19862
页数:14
相关论文
共 21 条
  • [1] Task-oriented Dialogue System Based on Reinforcement Learning
    Song, Meina
    Chen, Zhongfu
    Niu, Peiqing
    Haihong, E.
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 93 - 98
  • [2] A Survey of Task-Oriented Dialogue Policies Based on Reinforcement Learning
    Xu K.
    Wang Z.-Y.
    Wang X.
    Qin H.
    Long Y.-X.
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (06): : 1201 - 1231
  • [3] A Survey on Task-Oriented Dialogue Systems
    Zhao Y.-Y.
    Wang Z.-Y.
    Wang P.
    Yang T.
    Zhang R.
    Yin K.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (10): : 1862 - 1896
  • [4] Deep Reinforcement Learning Based Task-Oriented Communication in Multi-Agent Systems
    He, Guojun
    Feng, Mingjie
    Zhang, Yu
    Liu, Guanghua
    Dai, Yueyue
    Jiang, Tao
    IEEE WIRELESS COMMUNICATIONS, 2023, 30 (03) : 112 - 119
  • [5] A Task-oriented Chatbot Based on LSTM and Reinforcement Learning
    Hsueh, Yu-Ling
    Chou, Tai-Liang
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
  • [6] Task-oriented reinforcement learning for continuous tasks in dynamic environment
    Kamal, MAS
    Murata, J
    Hirasawa, K
    SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 829 - 832
  • [7] Cold-started Curriculum Learning for Task-oriented Dialogue Policy
    Zhu, Hui
    Zhao, Yangyang
    Qin, Hua
    2021 IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE 2021), 2021, : 100 - 105
  • [8] Optimizing pipeline task-oriented dialogue systems using post-processing networks
    Ohashi, Atsumoto
    Higashinaka, Ryuichiro
    COMPUTER SPEECH AND LANGUAGE, 2025, 90
  • [9] A multi-agent collaborative algorithm for task-oriented dialogue systems
    Sun, Jingtao
    Kou, Jiayin
    Shi, Weipeng
    Hou, Wenyan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (03) : 2009 - 2022
  • [10] Metaphorical User Simulators for Evaluating Task-oriented Dialogue Systems
    Sun, Weiwei
    Guo, Shuyu
    Zhang, Shuo
    Ren, Pengjie
    Chen, Zhumin
    de Rijke, Maarten
    Ren, Zhaochun
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (01)