Unsupervised learning of kb queries in task-oriented dialogs

被引:0
|
作者
Raghu D. [1 ,2 ]
Gupta N. [3 ]
Mausam [1 ]
机构
[1] IIT Delhi, New Delhi
[2] IBM Research, New Delhi
[3] LimeChat, Gurgaon
关键词
D O I
10.1162/tacl_a_00372/2021.
中图分类号
学科分类号
摘要
Task-oriented dialog (TOD) systems often need to formulate knowledge base (KB) queries corresponding to the user intent and use the query results to generate system responses. Existing approaches require dialog datasets to explicitly annotate these KB queries—these annotations can be time consuming, and expensive. In response, we define the novel problems of predicting the KB query and training the dialog agent, without explicit KB query annotation. For query prediction, we propose a reinforcement learning (RL) baseline, which rewards the generation of those queries whose KB results cover the entities mentioned in subsequent dialog. Further analysis reveals that correlation among query attributes in KB can significantly confuse memory augmented policy optimization (MAPO), an existing state of the art RL agent. To address this, we improve the MAPO baseline with simple but important modifications suited to our task. To train the full TOD system for our setting, we propose a pipelined approach: it independently predicts when to make a KB query (query position predictor), then predicts a KB query at the predicted position (query predictor), and uses the results of predicted query in subsequent dialog (next response predictor). Overall, our work proposes first solutions to our novel problem, and our analysis highlights the research challenges in training TOD systems without query annotation. © 2021, MIT Press Journals. All rights reserved.
引用
收藏
页码:374 / 390
页数:16
相关论文
共 50 条
  • [21] Task-oriented developmental learning for humanoid robots
    Tan, KC
    Chen, YJ
    Tan, KK
    Lee, TH
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2005, 52 (03) : 906 - 914
  • [22] Continual Learning in Task-Oriented Dialogue Systems
    Madotto, Andrea
    Lin, Zhaojiang
    Zhou, Zhenpeng
    Moon, Seungwhan
    Crook, Paul
    Liu, Bing
    Yu, Zhou
    Cho, Eunjoon
    Fung, Pascale
    Wang, Zhiguang
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7452 - 7467
  • [23] Learning Folksonomies from Task-Oriented Dialogues
    Puppi Wanderley, Gregory Moro
    Paraiso, Emerson Cabrera
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 360 - 367
  • [24] Task-Oriented Reinforcement Learning with Interest State Representation
    Li, Ziyi
    Hu, Xiangtao
    Zhang, Yongle
    Zhou, Fujie
    2024 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS, ICARM 2024, 2024, : 721 - 728
  • [25] A Task-oriented Chatbot Based on LSTM and Reinforcement Learning
    Hsueh, Yu-Ling
    Chou, Tai-Liang
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
  • [26] A Task-oriented Chatbot Based on LSTM and Reinforcement Learning
    Chou, Tai-Liang
    Hsueh, Yu-Ling
    NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 87 - 91
  • [27] TASK-ORIENTED ARCHITECTURES
    BISIANI, R
    MAUERSBERG, H
    REDDY, R
    PROCEEDINGS OF THE IEEE, 1983, 71 (07) : 885 - 898
  • [28] Unifying Task-Oriented Knowledge Graph Learning and Recommendation
    Li, Qianyu
    Tang, Xiaoli
    Wang, Tengyun
    Yang, Haizhi
    Song, Hengjie
    IEEE ACCESS, 2019, 7 : 115816 - 115828
  • [29] Adversarial Learning of Task-Oriented Neural Dialog Models
    Liu, Bing
    Lane, Ian
    19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), 2018, : 350 - 359
  • [30] Comparing Cascaded LSTM Architectures for Generating Head Motion from Speech in Task-Oriented Dialogs
    Nguyen, Duc-Canh
    Bailly, Gerard
    Elisei, Frederic
    HUMAN-COMPUTER INTERACTION: INTERACTION TECHNOLOGIES, HCI INTERNATIONAL 2018, PT III, 2018, 10903 : 164 - 175