Accessing and Standardizing Wiktionary Lexical Entries for Supporting the Translation of Labels in Taxonomies for Digital Humanities

被引:0
作者
Declerck, Thierry [1 ,2 ]
Moerth, Karlheinz [2 ]
Lendvai, Piroska [3 ]
机构
[1] Deutsch Forschungszentrum Kunstliche Intelligenz, Stuhlsatzenhausweg 3, D-66123 Saarbrucken, Germany
[2] Austrian Acad Sci, ICLTT, A-1010 Vienna, Austria
[3] Hungarian Acad Sci, Res Inst Linguist, H-1068 Budapest, Hungary
来源
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年
关键词
multilingual lexical resources; standards; digital humanities;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
We describe the usefulness of Wiktionary, the freely available web-based lexical resource, in providing multilingual extensions to catalogues that serve content-based indexing of folktales and related narratives. We develop conversion tools between Wiktionary and TEI, using ISO standards (LMF, MAF), to make such resources available to both the Digital Humanities community and the Language Resources community. The converted data can be queried via a web interface, while the tools of the workflow are to be released with an open source license. We report on the actual state and functionality of our tools and analyse some shortcomings of Wiktionary, as well as potential domains of application.
引用
收藏
页码:2511 / 2514
页数:4
相关论文
共 13 条
[1]  
[Anonymous], P C LANG RES EV LREC
[2]  
Budin G., 2012, JTEI IN PRESS, V3
[3]  
Budin G., 2012, ELECT LEXICOGRAPHY 2
[4]  
Declerck T., 2012, P DIG HUM DH2012
[5]  
Henrich V., 2011, P 5 LANG TECHN C LTC
[6]  
John McCrae J., 2012, P DGFS WORKSH LINK D
[7]  
Krizhanovsky A., 2010, COMP WIKTIONARY THES
[8]  
Krizhanovsky A., 2009, P 11 RUSS C DIG LIB
[9]  
MEYER CM, 2010, P 11 INT C INT TEXT, V6008, P38
[10]  
Moerth K., 2011, P 2 INT WORKSH MULT