Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems

被引：0

作者：

Lin, Hsien-chin ^{[1
]}

Lubis, Nurul ^{[1
]}

Hu, Songbo ^{[2
]}

van Niekerk, Carel ^{[1
]}

Geishauser, Christian ^{[1
]}

Heck, Michael ^{[1
]}

Feng, Shutong ^{[1
]}

Gasic, Milica ^{[1
]}

机构：

[1] Heinrich Heine Univ Dusseldorf, Dusseldorf, Germany

[2] Univ Cambridge, Dept Comp Sci & Technol, Cambridge, England

来源：

SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021) | 2021年

基金：

欧洲研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dialogue policy optimisation via reinforcement learning requires a large number of training interactions, which makes learning with real users time consuming and expensive. Many set-ups therefore rely on a user simulator instead of humans. These user simulators have their own problems. While hand-coded, rule-based user simulators have been shown to be sufficient in small, simple domains, for complex domains the number of rules quickly becomes intractable. State-of-the-art datadriven user simulators, on the other hand, are still domain-dependent. This means that adaptation to each new domain requires redesigning and retraining. In this work, we propose a domain-independent transformer-based user simulator (TUS). The structure of our TUS is not tied to a specific domain, enabling domain generalisation and learning of cross-domain user behaviour from data. We compare TUS with the state of the art using automatic as well as human evaluations. TUS can compete with rule-based user simulators on pre-defined domains and is able to generalise to unseen domains in a zero-shot fashion.

引用

页码：445 / 456

页数：12

共 30 条

[1] Budzianowski P, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P5016
[2] Cuayáhuitl H, 2005, 2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), P290
[3] A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimization
Daubigney, Lucie
Geist, Matthieu
Chandramohan, Senthilkumar
Pietquin, Olivier
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2012, 6 (08) : 891 - 902
[4] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5] User modeling for spoken dialogue system evaluation
Eckert, W
Levin, E
Pieraccini, R
[J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 80 - 87
[6] A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems
El Asri, Layla
He, Jing
Suleman, Kaheer
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1151 - 1155
[7] Eric M, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P422
[8] Eshky A., 2012, Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, P71
[9] Gasic M., 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), P312, DOI 10.1109/ASRU.2011.6163950
[10] Georgila K., 2006, INTERSPEECH 2006 ICS

← 1 2 3 →