Multi-domain gate and interactive dual attention for multi-domain dialogue state tracking

被引:1
|
作者
Jia, Xu [1 ]
Zhang, Ruochen [2 ]
Peng, Min [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Lappeenranta Univ Technol, Sch Engn Sci, Lahti, Finland
关键词
Multi-domain dialogue state tracking; Multi-domain gate; Interactive dual attention;
D O I
10.1016/j.knosys.2024.111383
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi -domain dialogue state tracking (MDST) is a crucial component of task -oriented dialogue systems. In the context of multi -turn dialogues between the user and the system, MDST necessitates the continuous keeping track of dialogue states based on the information present in the current dialogue utterance and the dialogue states from the preceding turn. Recent work achieves the successful execution of multi -domain dialogue tasks by adopting an approach that treats each state as an individual label, while regrettably neglecting the potential benefits of incorporating domain -specific information associated with these states. Simultaneous, existing models exhibit a deficiency in effectively modelling the explicit correlations between dialogue contextual semantics and dialogue states. In this paper, we introduce the modules of multi -domain gate and interactive dual attention as novel solutions to address the aforementioned concerns. For the efficient exploitation of domain -specific information within states, we leverage the multi -domain gate as indices to amplify the states pertinent to the current utterance domain while filtering out irrelevant states. Interactive dual attention comprises utterance attention and slot attention, effectively modelling the correlation between dialogue utterances and slots. Additionally, interactive dual attention ensures that each dialogue utterance interacts with the slots once to derive all state updates, thereby ensuring computational efficiency. Specifically, slot attention models the associations between slots by incorporating semantic features to forecast updates in slot values. Meanwhile, utterance attention captures the semantics of dialogue context and integrates it with slot name features to generate dialogue states. All the aforementioned modules are designed based on a slot -independent framework, enabling efficient scalability of slots and circumventing issues related to model input limitations. The experimental results on the multi -domain dialogues dataset MultiWOZ 2.4 demonstrate the superior performance of our model compared to the baselines. Additionally, we conduct a comprehensive analysis of the effectiveness of the multi -domain gate and interactive dual attention modules, elucidating their contribution to the performance of the model through visualization and case studies.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] GCDST: A Graph-based and Copy-augmented Multi-domain Dialogue State Tracking
    Wu, Peng
    Zou, Bowei
    Jiang, Ridong
    Aw, Ai Ti
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1063 - 1073
  • [32] MULTI-DOMAIN DIALOGUE SUCCESS CLASSIFIERS FOR POLICY TRAINING
    Vandyke, David
    Su, Pei-Hao
    Gasic, Milica
    Mrksic, Nikola
    Wen, Tsung-Hsien
    Young, Steve
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 763 - 770
  • [33] PyDial: A Multi-domain Statistical Dialogue System Toolkit
    Ultes, Stefan
    Rojas-Barahona, Lina
    Su, Pei-Hao
    Vandyke, David
    Kim, Dongho
    Casanueva, Inigo
    Budzianowski, Pawel
    Mrksic, Nikola
    Wen, Tsung-Hsien
    Gasic, Milica
    Young, Steve
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, : 73 - 78
  • [34] ClippyScript: A Programming Language for Multi-Domain Dialogue Systems
    Seide, Frank
    McDirmid, Sean
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 242 - 245
  • [35] Multi-domain Dialog State Tracking using Recurrent Neural Networks
    Mrksic, Nikola
    Seaghdha, Diarmuid O.
    Thomson, Blaise
    Gasic, Milica
    Su, Pei-Hao
    Vandyke, David
    Wen, Tsung-Hsien
    Young, Steve
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 794 - 799
  • [36] Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems
    Wu, Chien-Sheng
    Madotto, Andrea
    Hosseini-Asl, Ehsan
    Xiong, Caiming
    Socher, Richard
    Fung, Pascale
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 808 - 819
  • [37] Semi-supervised single- and multi-domain regression with multi-domain training
    Michaeli, Tomer
    Eldar, Yonina C.
    Sapiro, Guillermo
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2012, 1 (01) : 68 - 97
  • [38] Multi-domain Attention Fusion Network For Language Recognition
    Ju M.
    Xu Y.
    Ke D.
    Su K.
    SN Computer Science, 4 (1)
  • [39] Domain-Slot Relationship Modeling Using a Pre-Trained Language Encoder for Multi-Domain Dialogue State Tracking
    An, Jinwon
    Cho, Sungzoon
    Bang, Junseong
    Kim, Misuk
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2022, 30 : 2091 - 2102
  • [40] Utilizing online content as domain knowledge in a multi-domain dynamic dialogue system
    Wootton, Craig
    McTear, Michael
    Anderson, Terry
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 693 - 696