Robust Spoken Language Understanding with Acoustic and Domain Knowledge

被引:6
作者
Li, Hao [1 ]
Liu, Chen [1 ]
Zhu, Su [1 ]
Yu, Kai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
来源
ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION | 2019年
关键词
Spoken Language Understanding; Robustness; NETWORKS;
D O I
10.1145/3340555.3356100
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Spoken language understanding (SLU) converts user utterances into structured semantic forms. There are still two main issues for SLU: robustness to ASR-errors and the data sparsity of new and extended domains. In this paper, we propose a robust SLU system by leveraging both acoustic and domain knowledge. We extract audio features by training ASR models on a large number of utterances without semantic annotations. For exploiting domain knowledge, we design lexicon features from the domain ontology and propose an error elimination algorithm to help predicted values recovered from ASR-errors. The results of CATSLU challenge show that our systems can outperform all of the other teams across four domains.
引用
收藏
页码:531 / 535
页数:5
相关论文
共 24 条
[1]  
[Anonymous], 2015, 16 ANN C INT SPEECH
[2]  
[Anonymous], ARXIV180700267
[3]  
[Anonymous], 2007, INT P
[4]  
[Anonymous], 2016, 17 ANN C INT SPEECH
[5]  
[Anonymous], 2014, ARXIV
[6]  
[Anonymous], 2019, 2019 INT C MULT INT
[7]  
[Anonymous], 2019, ARXIV190505526
[8]  
[Anonymous], 2012, SUPERVISED SEQUENCE
[9]   Towards Zero-Shot Frame Semantic Parsing for Domain Scaling [J].
Bapna, Ankur ;
Tur, Gokhan ;
Hakkani-Tur, Dilek ;
Heck, Larry .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :2476-2480
[10]   Spoken language understanding - Interpreting the signs given by a speech signal [J].
De Mori, Renato ;
Bechet, Frederic ;
Hakkani-Tuer, Dilek ;
McTear, Michael ;
Riccardi, Giuseppe ;
Tu, Gokhan .
IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (03) :50-58