Toward Universal Word Sense Disambiguation Using Deep Neural Networks

被引：9

作者：

Calvo, Hiram ^{[1
]}

Rocha-Ramirez, Arturo P. ^{[1
]}

Moreno-Armendariz, Marco A. ^{[1
]}

Duchanoy, Carlos A. ^{[1
,2
]}

机构：

[1] Inst Politecn Nacl JD Batiz E MO Mendizabal, Ctr Invest Comp, Mexico City 07738, DF, Mexico

[2] Catedra CONACyT, Mexico City 03940, DF, Mexico

来源：

IEEE ACCESS | 2019年 / 7卷

关键词：

Word sense disambiguation; recurrent neural networks; LSTM; multilayer perceptron; senseval english lexical sample test;

D O I：

10.1109/ACCESS.2019.2914921

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Traditionally, approaches based on neural networks to solve the problem of disambiguation of the meaning of words (WSD) use a set of classidiers at the end, which results in a specialization in a single set of words-those for which they were trained. This makes impossible to apply the learned models to words not previously seen in the training corpus. This paper seeks to address a generalization of the problem of WSD in order to solve it through deep neural networks without limiting the method to a fixed set of words, with a performance close to the state-of-the-art, and an acceptable computational cost. We explore different architectures based on multilayer perceptrons, recurrent cells (Long Short-Term Memory-LSTM and Gated Recurrent Units-GRU), and a classifier model. Different sources and dimensions of embeddings were tested as well. The main evaluation was performed on the Senseval 3 English Lexical Sample. To evaluate the application to an unseen set of words, learned models are evaluated in the completely unseen words of a different corpus (Senseval 2 English Lexical Sample), overcoming the random baseline.

引用

页码：60264 / 60275

页数：12

共 21 条

[1]

[Anonymous], 2018, Proceedings of the 27th International Conference on Computational Linguistics

[2]

[Anonymous], 2016, Word sense disambiguation using a bidirectional lstm

[3]

[Anonymous], 2014, P 18 C COMP NAT LANG

[4]

Bird S., 2009, Natural language processing with Python: analyzing text with the natural language toolkit

[5]

Grozea C., 2004, P 3 INT WORKSH EV SY, P1

[6]

Jozefowicz R, 2015, PR MACH LEARN RES, V37, P2342

[7]

Kilgarriff Adam., 2001, Proceedings of SENSEVAL-2 Second International Workshop on Evaluating Word Sense Disambiguation Systems, P17, DOI DOI 10.5555/2387364.2387368

[8]

Melamud O., 2016, Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning, DOI DOI 10.18653/V1/K16-1006

[9]

Mihalcea R.F., 2004, Procs, of the SENSEVAL-3: Third Inti Workshop on the Evaluation of Systemsfor the Semantic Analysis ofText, Barcelona, Spain, P25

[10]

Mikolov T., 2013, P NAACL 2013, P746, DOI DOI 10.3109/10826089109058901

← 1 2 3 →