Multi-domain gate and interactive dual attention for multi-domain dialogue state tracking

被引：1

作者：

Jia, Xu ^{[1
]}

Zhang, Ruochen ^{[2
]}

Peng, Min ^{[1
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China

[2] Lappeenranta Univ Technol, Sch Engn Sci, Lahti, Finland

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 286卷

关键词：

Multi-domain dialogue state tracking; Multi-domain gate; Interactive dual attention;

D O I：

10.1016/j.knosys.2024.111383

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi -domain dialogue state tracking (MDST) is a crucial component of task -oriented dialogue systems. In the context of multi -turn dialogues between the user and the system, MDST necessitates the continuous keeping track of dialogue states based on the information present in the current dialogue utterance and the dialogue states from the preceding turn. Recent work achieves the successful execution of multi -domain dialogue tasks by adopting an approach that treats each state as an individual label, while regrettably neglecting the potential benefits of incorporating domain -specific information associated with these states. Simultaneous, existing models exhibit a deficiency in effectively modelling the explicit correlations between dialogue contextual semantics and dialogue states. In this paper, we introduce the modules of multi -domain gate and interactive dual attention as novel solutions to address the aforementioned concerns. For the efficient exploitation of domain -specific information within states, we leverage the multi -domain gate as indices to amplify the states pertinent to the current utterance domain while filtering out irrelevant states. Interactive dual attention comprises utterance attention and slot attention, effectively modelling the correlation between dialogue utterances and slots. Additionally, interactive dual attention ensures that each dialogue utterance interacts with the slots once to derive all state updates, thereby ensuring computational efficiency. Specifically, slot attention models the associations between slots by incorporating semantic features to forecast updates in slot values. Meanwhile, utterance attention captures the semantics of dialogue context and integrates it with slot name features to generate dialogue states. All the aforementioned modules are designed based on a slot -independent framework, enabling efficient scalability of slots and circumventing issues related to model input limitations. The experimental results on the multi -domain dialogues dataset MultiWOZ 2.4 demonstrate the superior performance of our model compared to the baselines. Additionally, we conduct a comprehensive analysis of the effectiveness of the multi -domain gate and interactive dual attention modules, elucidating their contribution to the performance of the model through visualization and case studies.

引用

页数：14

共 50 条

[31] GCDST: A Graph-based and Copy-augmented Multi-domain Dialogue State Tracking
Wu, Peng
Zou, Bowei
Jiang, Ridong
Aw, Ai Ti
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1063 - 1073
[32] MULTI-DOMAIN DIALOGUE SUCCESS CLASSIFIERS FOR POLICY TRAINING
Vandyke, David
Su, Pei-Hao
Gasic, Milica
Mrksic, Nikola
Wen, Tsung-Hsien
Young, Steve
2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 763 - 770
[33] PyDial: A Multi-domain Statistical Dialogue System Toolkit
Ultes, Stefan
Rojas-Barahona, Lina
Su, Pei-Hao
Vandyke, David
Kim, Dongho
Casanueva, Inigo
Budzianowski, Pawel
Mrksic, Nikola
Wen, Tsung-Hsien
Gasic, Milica
Young, Steve
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, : 73 - 78
[34] ClippyScript: A Programming Language for Multi-Domain Dialogue Systems
Seide, Frank
McDirmid, Sean
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 242 - 245
[35] Multi-domain Dialog State Tracking using Recurrent Neural Networks
Mrksic, Nikola
Seaghdha, Diarmuid O.
Thomson, Blaise
Gasic, Milica
Su, Pei-Hao
Vandyke, David
Wen, Tsung-Hsien
Young, Steve
PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 794 - 799
[36] Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems
Wu, Chien-Sheng
Madotto, Andrea
Hosseini-Asl, Ehsan
Xiong, Caiming
Socher, Richard
Fung, Pascale
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 808 - 819
[37] Semi-supervised single- and multi-domain regression with multi-domain training
Michaeli, Tomer
Eldar, Yonina C.
Sapiro, Guillermo
INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2012, 1 (01) : 68 - 97
[38] Multi-domain Attention Fusion Network For Language Recognition
Ju M.
Xu Y.
Ke D.
Su K.
SN Computer Science, 4 (1)
[39] Domain-Slot Relationship Modeling Using a Pre-Trained Language Encoder for Multi-Domain Dialogue State Tracking
An, Jinwon
Cho, Sungzoon
Bang, Junseong
Kim, Misuk
IEEE/ACM Transactions on Audio Speech and Language Processing, 2022, 30 : 2091 - 2102
[40] Utilizing online content as domain knowledge in a multi-domain dynamic dialogue system
Wootton, Craig
McTear, Michael
Anderson, Terry
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 693 - 696

← 1 2 3 4 5 →