Unsupervised Spoken Language Understanding for a Multi-Domain Dialog System

被引:8
作者
Lee, Donghyeon [1 ]
Jeong, Minwoo [1 ]
Kim, Kyungduk [1 ]
Ryu, Seonghan [1 ]
Lee, Gary Geunbae [1 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, Dept Comp Sci & Engn, Pohang 790784, South Korea
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 11期
关键词
Dialog system; spoken language understanding; unsupervised learning; MANAGEMENT;
D O I
10.1109/TASL.2013.2280212
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes an unsupervised spoken language understanding (SLU) framework for a multi-domain dialog system. Our unsupervised SLU framework applies a non-parametric Bayesian approach to dialog acts, intents and slot entities, which are the components of a semantic frame. The proposed approach reduces the human effort necessary to obtain a semantically annotated corpus for dialog system development. In this study, we analyze clustering results using various evaluation metrics for four dialog corpora. We also introduce a multi-domain dialog system that uses the unsupervised SLU framework. We argue that our unsupervised approach can help overcome the annotation acquisition bottleneck in developing dialog systems. To verify this claim, we report a dialog system evaluation, in which our method achieves competitive results in comparison with a system that uses a manually annotated corpus. In addition, we conducted several experiments to explore the effect of our approach on reducing development costs. The results show that our approach be helpful for the rapid development of a prototype system and reducing the overall development costs.
引用
收藏
页码:2451 / 2464
页数:14
相关论文
共 55 条
  • [1] Allen J., 2000, Natural Language Engineering, V6, P213, DOI 10.1017/S135132490000245X
  • [2] [Anonymous], 1993, 31 ANN M ASS COMP LI
  • [3] [Anonymous], 2002, P 3 SIGDIAL WORKSH D
  • [4] [Anonymous], 2006, Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics: Posters Demonstrations, EACL '06
  • [5] [Anonymous], P EUR
  • [6] [Anonymous], 2006, PROCEEDING INT C COM
  • [7] [Anonymous], 2001, PROC 18 INT C MACH L
  • [8] Barzilay R, 2004, HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, P113
  • [9] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [10] The RavenClaw dialog management framework: Architecture and systems
    Bohus, Dan
    Rudnicky, Alexander I.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2009, 23 (03) : 332 - 361