Intent Detection-Based Lithuanian Chatbot Created via Automatic DNN Hyper-Parameter Optimization

被引:3
作者
Kapociute-Dzikiene, Jurgita [1 ,2 ]
机构
[1] JSC Tilde Informat Technol, Naugarduko Str 100, LT-03160 Vilnius, Lithuania
[2] Vytautas Magnus Univ, Fac Informat, Kaunas, Lithuania
来源
HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE (HLT 2020) | 2020年 / 328卷
关键词
NLU; Intent detection; LSTM; BiLSTM; CNN; hyper-parameter optimization; fastText and BERT embeddings; the Lithuanian language;
D O I
10.3233/FAIA200608
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we tackle an intent detection problem for the Lithuanian language with the real supervised data. Our main focus is on the enhancement of the Natural Language Understanding (NLU) module, responsible for the comprehension of user's questions. The NLU model is trained with a properly selected word vectorization type and Deep Neural Network (DNN) classifier. During our experiments, we have experimentally investigated fastText and BERT embeddings. Besides, we have automatically optimized different architectures and hyper-parameters of the following DNN approaches: Long Short-Term Memory (LSTM), Bidirectional LSTM (BiLSTM) and Convolutional Neural Network (CNN). The highest accuracy=similar to 0.715 (similar to 0.675 and similar to 0.625 over random and majority baselines, respectively) was achieved with the CNN classifier applied on a top of BERT embeddings. The detailed error analysis revealed that prediction accuracies degrade for the least covered intents and due to intent ambiguities; therefore, in the future, we are planning to make necessary adjustments to boost the intent detection accuracy for the Lithuanian language even more.
引用
收藏
页码:95 / 102
页数:8
相关论文
共 23 条
[1]  
[Anonymous], 2011, NeurIPS
[2]   FastText-Based Intent Detection for Inflected Languages [J].
Balodis, Kaspars ;
Deksne, Daiga .
INFORMATION, 2019, 10 (05)
[3]  
Braun D, 2017, 18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), P174
[4]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5]  
Du N, 2018, ABS181209471 CORR
[6]   Framewise phoneme classification with bidirectional LSTM and other neural network architectures [J].
Graves, A ;
Schmidhuber, J .
NEURAL NETWORKS, 2005, 18 (5-6) :602-610
[7]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[8]  
Kim JK, 2016, IEEE W SP LANG TECH, P414, DOI 10.1109/SLT.2016.7846297
[9]  
Kim Y., 2014, P EMNLP 19, P1746, DOI DOI 10.3115/V1/D14-1181
[10]  
Larson S, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P1311