Labeled Data Generation with Encoder-decoder LSTM for Semantic Slot Filling

被引:23
作者
Kurata, Gakuto [1 ]
Xiang, Bing [1 ]
Zhou, Bowen [1 ]
机构
[1] IBM Watson, Yorktown Hts, NY 10598 USA
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
关键词
spoken language understanding; semantic slot filling; labeled data generation; encoder-decoder LSTM; ATIS; NEURAL-NETWORKS;
D O I
10.21437/Interspeech.2016-727
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
To train a model for semantic slot filling, manually labeled data in which each word is annotated with a semantic slot label is necessary while manually preparing such data is costly. Starting from a small amount of manually labeled data, we propose a method to generate the labeled data with using the encoder decoder LSTM. We first train the encoder-decoder LSTM that accepts and generates the same manually labeled data. Then, to generate a wide variety of labeled data, we add perturbations to the vector that encodes the manually labeled data and generate labeled data with the decoder LSTM based on the perturbated encoded vector. We also try to enhance the encoder decoder LSTM to generate the word sequences and their label sequences separately to obtain new pairs of words and their labels. Through the experiments with the standard ATIS slot filling task, by using the generated data, we obtained improvement in slot filling accuracy over the strong baseline with the NN-based slot filling model.
引用
收藏
页码:725 / 729
页数:5
相关论文
共 35 条
[1]  
[Anonymous], 2014, Generating sequences with recurrent neural networks
[2]  
[Anonymous], 2013, P 2013 C EMPIRICAL M
[3]  
[Anonymous], 2013, P ANN C INT SPEECH C
[4]  
[Anonymous], 2015, ARXIV150601057
[5]  
[Anonymous], 2016, P 2016 C EMP METH NA
[6]  
[Anonymous], 1997, Neural Computation
[7]  
[Anonymous], P INTERSPEECH
[8]  
[Anonymous], 2006, PROCEEDING INT C COM
[9]  
[Anonymous], 2015, ARXIV150600195
[10]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127