Automatic Construction of a Morphological Dictionary of Multi-Word Units

被引:0
作者
Krstev, Cvetana [1 ]
Stankovic, Ranka [2 ]
Obradovic, Ivan [2 ]
Vitas, Dusko [3 ]
Utvic, Milos [1 ]
机构
[1] Univ Belgrade, Fac Philol, Belgrade 11001, Serbia
[2] Univ Belgrade, Fac Min & Geol, Belgrade, Serbia
[3] Univ Belgrade, Fac Math, Belgrade, Serbia
来源
ADVANCES IN NATURAL LANGUAGE PROCESSING | 2010年 / 6233卷
关键词
electronic dictionary; Serbian; morphology; inflection; multi-word units; noun phrases; query expansion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation of the proposed procedure on several different sets of data. Finally, we discuss some implementation issues and present how the same procedure is used for other languages.
引用
收藏
页码:226 / +
页数:2
相关论文
共 15 条
[1]  
Courtois B, 1990, DICT ELECT FRANCAIS
[2]  
COURTOIS B, 1997, 55 LADL U PAR 7
[3]  
ELIA A, ATLAS DICOMP DIZIONA
[4]  
GRASS T, 2002, LECT NOTES COMPUTER, V2389, P137
[5]  
Jacquemin C, 2001, Spotting and discovering terms through natural language processing
[6]  
KRSTEV C, 2008, 6 LREC MARR MAR
[7]  
KRSTEV C, 2006, IS LTC 2006 LJUBLJ S, P192
[8]  
Krstev C, 2006, LECT NOTES ARTIF INT, V4139, P552
[9]  
Krstev Cvetana, 2008, Processing of Serbian: Automata, Texts and Electronic Dictionaries
[10]  
LAPORTE E, 2009, TRILHAS LINGUISTICAS, V16, P51