CONTEXTUAL OUT-OF-DOMAIN UTTERANCE HANDLING WITH COUNTERFEIT DATA AUGMENTATION

被引:0
作者
Lee, Sungjin [1 ]
Shalyminov, Igor [1 ,2 ]
机构
[1] Microsoft Res, One Microsoft Way, Redmond, WA 98052 USA
[2] Heriot Watt Univ, Sch Math & Comp Sci, Edinburgh EH14 4AS, Midlothian, Scotland
来源
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年
关键词
Out-of-domain utterance; Neural dialog model; Counterfeit data augmentation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Neural dialog models often lack robustness to anomalous user input and produce inappropriate responses which leads to frustrating user experience. Although there are a set of prior approaches to out-of-domain (OOD) utterance detection, they share a few restrictions: they rely on OOD data or multiple sub-domains, and their OOD detection is context-independent which leads to suboptimal performance in a dialog. The goal of this paper is to propose a novel OOD detection method that does not require OOD data by utilizing counterfeit OOD turns in the context of a dialog. For the sake of fostering further research, we also release new dialog datasets which are 3 publicly available dialog corpora augmented with OOD turns in a controllable way. Our method outperforms state-of-the-art dialog models equipped with a conventional OOD detection mechanism by a large margin in the presence of OOD utterances.
引用
收藏
页码:7205 / 7209
页数:5
相关论文
共 16 条
[1]  
[Anonymous], 2017, ARXIV170203274
[2]  
[Anonymous], 2011, P SIGDIAL 2011 C
[3]  
Bordes A., 2016, Learning end-to-end goal-oriented dialog
[4]  
Cho K., 2014, P SSST 8 8 WORKSH SY, P103, DOI DOI 10.3115/V1/W14-4012
[5]  
El Asri L, 2017, 18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017)
[6]  
Eric M, 2017, 18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), P37
[7]  
Hochreiter S, 1997, Neural Computation, V9, P1735
[8]  
Kim Y., 2014, P 2014 C EMP METH NA, P1746, DOI [10.3115/v1/D14-1181, DOI 10.3115/V1/D14-1181]
[9]  
Kingma D.P., 2015, INT C LEARN REPR, DOI DOI 10.1002/9781118900772.ETRDS0277
[10]   Out-of-domain utterance detection using classification confidences of multiple topics [J].
Lane, Ian ;
Kawahara, Tatsuya ;
Matsui, Tomoko ;
Nakamura, Satoshi .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01) :150-161