Toward Universal Word Sense Disambiguation Using Deep Neural Networks

被引:9
作者
Calvo, Hiram [1 ]
Rocha-Ramirez, Arturo P. [1 ]
Moreno-Armendariz, Marco A. [1 ]
Duchanoy, Carlos A. [1 ,2 ]
机构
[1] Inst Politecn Nacl JD Batiz E MO Mendizabal, Ctr Invest Comp, Mexico City 07738, DF, Mexico
[2] Catedra CONACyT, Mexico City 03940, DF, Mexico
关键词
Word sense disambiguation; recurrent neural networks; LSTM; multilayer perceptron; senseval english lexical sample test;
D O I
10.1109/ACCESS.2019.2914921
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditionally, approaches based on neural networks to solve the problem of disambiguation of the meaning of words (WSD) use a set of classidiers at the end, which results in a specialization in a single set of words-those for which they were trained. This makes impossible to apply the learned models to words not previously seen in the training corpus. This paper seeks to address a generalization of the problem of WSD in order to solve it through deep neural networks without limiting the method to a fixed set of words, with a performance close to the state-of-the-art, and an acceptable computational cost. We explore different architectures based on multilayer perceptrons, recurrent cells (Long Short-Term Memory-LSTM and Gated Recurrent Units-GRU), and a classifier model. Different sources and dimensions of embeddings were tested as well. The main evaluation was performed on the Senseval 3 English Lexical Sample. To evaluate the application to an unseen set of words, learned models are evaluated in the completely unseen words of a different corpus (Senseval 2 English Lexical Sample), overcoming the random baseline.
引用
收藏
页码:60264 / 60275
页数:12
相关论文
共 21 条
[1]  
[Anonymous], 2018, Proceedings of the 27th International Conference on Computational Linguistics
[2]  
[Anonymous], 2016, Word sense disambiguation using a bidirectional lstm
[3]  
[Anonymous], 2014, P 18 C COMP NAT LANG
[4]  
Bird S., 2009, Natural language processing with Python: analyzing text with the natural language toolkit
[5]  
Grozea C., 2004, P 3 INT WORKSH EV SY, P1
[6]  
Jozefowicz R, 2015, PR MACH LEARN RES, V37, P2342
[7]  
Kilgarriff Adam., 2001, Proceedings of SENSEVAL-2 Second International Workshop on Evaluating Word Sense Disambiguation Systems, P17, DOI DOI 10.5555/2387364.2387368
[8]  
Melamud O., 2016, Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning, DOI DOI 10.18653/V1/K16-1006
[9]  
Mihalcea R.F., 2004, Procs, of the SENSEVAL-3: Third Inti Workshop on the Evaluation of Systemsfor the Semantic Analysis ofText, Barcelona, Spain, P25
[10]  
Mikolov T., 2013, P NAACL 2013, P746, DOI DOI 10.3109/10826089109058901