A Comparative Evaluation of Word Sense Disambiguation Algorithms for German

被引:0
作者
Henrich, Verena [1 ]
Hinrichs, Erhard [1 ]
机构
[1] Univ Tubingen, Dept Linguist, D-72074 Tubingen, Germany
来源
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年
关键词
Word sense disambiguation; German; combined classifiers;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The present paper explores a wide range of word sense disambiguation (WSD) algorithms for German. These WSD algorithms are based on a suite of semantic relatedness measures, including path-based, information-content-based, and gloss-based methods. Since the individual algorithms produce diverse results in terms of precision and thus complement each other well in terms of coverage, a set of combined algorithms is investigated and compared in performance to the individual algorithms. Among the single algorithms considered, a word overlap method derived from the Lesk algorithm that uses Wiktionary glosses and GermaNet lexical fields yields the best F-score of 56.36. This result is outperformed by a combined WSD algorithm that uses weighted majority voting and obtains an F-score of 63.59. The WSD experiments utilize the German wordnet GermaNet as a sense inventory as well as WebCAGe (short for: Web-Harvested Corpus Annotated with GermaNet Senses), a newly constructed, sense-annotated corpus for this language. The WSD experiments also confirm that WSD performance is lower for words with fine-grained sense distinctions compared to words with coarse-grained senses.
引用
收藏
页码:576 / 583
页数:8
相关论文
共 32 条
  • [1] Agirre E, 2006, TEXT SPEECH LANG TEC, V33, P1, DOI 10.1007/978-1-4020-4809-8
  • [2] [Anonymous], 2009, Proceedings of the 12th conference of the European chapter of the Association for Computational Linguistics, DOI DOI 10.3115/1609067.1609070
  • [3] [Anonymous], 200525 UMSI
  • [4] [Anonymous], 1997, P 10 RES COMPUTATION
  • [5] [Anonymous], 1998, WordNet, DOI DOI 10.7551/MITPRESS/7287.001.0001
  • [6] [Anonymous], 1995, ACL, DOI 10.3115/981658.981684
  • [7] [Anonymous], 2004, P 42 ANN M ASS COMPU, DOI DOI 10.3115/1218955.1218991
  • [8] Broscheit S., 2010, P 10 K VER NAT SPRAC, P19
  • [9] Budanitsky A, 2006, COMPUT LINGUIST, V32, P13, DOI 10.1162/coli.2006.32.1.13
  • [10] Chad Lane H., 2009, FLAIRS C