On the effects of using word2vec representations in neural networks for dialogue act recognition

被引:42
作者
Cerisara, Christophe [1 ]
Kral, Pavel [2 ,3 ]
Lenc, Ladislav [2 ,3 ]
机构
[1] LORIA, UMR7503, Campus Sci, F-54506 Vandoeuvre Les Nancy, France
[2] Univ West Bohemia, Fac Sci Appl, Dept Comp Sci & Engn, Plzen, Czech Republic
[3] Univ West Bohemia, Fac Sci Appl, NTIS, Plzen, Czech Republic
关键词
Dialogue act; Deep learning; LSTM; Word embeddings; Word2vec;
D O I
10.1016/j.csl.2017.07.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dialogue act recognition is an important component of a large number of natural language processing pipelines. Many research works have been carried out in this area, but relatively few investigate deep neural networks and word embeddings. This is surprising, given that both of these techniques have proven exceptionally good in most other language-related domains. We propose in this work a new deep neural network that explores recurrent models to capture word sequences within sentences, and further study the impact of pretrained word embeddings. We validate this model on three languages: English, French and Czech. The performance of the proposed approach is consistent across these languages and it is comparable to the state-of-the-art results in English. More importantly, we confirm that deep neural networks indeed outperform a Maximum Entropy classifier, which was expected. However, and this is more surprising, we also found that standard word2vec embeddings do not seem to bring valuable information for this task and the proposed model, whatever the size of the training corpus is. We thus further analyse the resulting embeddings and conclude that a possible explanation may be related to the mismatch between the type of lexical-semantic information captured by the word2vec embeddings, and the kind of relations between words that is the most useful for the dialogue act recognition task. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:175 / 193
页数:19
相关论文
共 44 条
[1]  
Allen J., 1997, DRAFT DAMSL DIALOG A
[2]  
Ang J, 2005, INT CONF ACOUST SPEE, P1061
[3]  
[Anonymous], 1997, SWITCHBOARD SWBD DAM
[4]  
[Anonymous], P 4 SIGDIAL WORKSH D
[5]  
[Anonymous], P 2 INT C HYBR INT S
[6]  
[Anonymous], TEXT SPEECH DIALOGUE
[7]  
[Anonymous], P IEEE WORKSH SPEECH
[8]  
[Anonymous], 2002, 3 ACL SIGDIAL WORKSH
[9]  
[Anonymous], 2004, TR04002 INT COMP SCI
[10]  
[Anonymous], ASS COMPUTATIONAL LI