Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems

被引:0
作者
Lin, Hsien-chin [1 ]
Lubis, Nurul [1 ]
Hu, Songbo [2 ]
van Niekerk, Carel [1 ]
Geishauser, Christian [1 ]
Heck, Michael [1 ]
Feng, Shutong [1 ]
Gasic, Milica [1 ]
机构
[1] Heinrich Heine Univ Dusseldorf, Dusseldorf, Germany
[2] Univ Cambridge, Dept Comp Sci & Technol, Cambridge, England
来源
SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021) | 2021年
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dialogue policy optimisation via reinforcement learning requires a large number of training interactions, which makes learning with real users time consuming and expensive. Many set-ups therefore rely on a user simulator instead of humans. These user simulators have their own problems. While hand-coded, rule-based user simulators have been shown to be sufficient in small, simple domains, for complex domains the number of rules quickly becomes intractable. State-of-the-art datadriven user simulators, on the other hand, are still domain-dependent. This means that adaptation to each new domain requires redesigning and retraining. In this work, we propose a domain-independent transformer-based user simulator (TUS). The structure of our TUS is not tied to a specific domain, enabling domain generalisation and learning of cross-domain user behaviour from data. We compare TUS with the state of the art using automatic as well as human evaluations. TUS can compete with rule-based user simulators on pre-defined domains and is able to generalise to unseen domains in a zero-shot fashion.
引用
收藏
页码:445 / 456
页数:12
相关论文
共 30 条
  • [1] Budzianowski P, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P5016
  • [2] Cuayáhuitl H, 2005, 2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), P290
  • [3] A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimization
    Daubigney, Lucie
    Geist, Matthieu
    Chandramohan, Senthilkumar
    Pietquin, Olivier
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2012, 6 (08) : 891 - 902
  • [4] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [5] User modeling for spoken dialogue system evaluation
    Eckert, W
    Levin, E
    Pieraccini, R
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 80 - 87
  • [6] A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems
    El Asri, Layla
    He, Jing
    Suleman, Kaheer
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1151 - 1155
  • [7] Eric M, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P422
  • [8] Eshky A., 2012, Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, P71
  • [9] Gasic M., 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), P312, DOI 10.1109/ASRU.2011.6163950
  • [10] Georgila K., 2006, INTERSPEECH 2006 ICS