Providing corpus data for a dictionary for German juridical phraseology

被引:1
作者
Heid, Ulrich
Fritzinger, Fabienne
Hauptmann, Susanne
Weidenkaff, Julia
Weller, Marion
机构
来源
TEXT RESOURCES AND LEXICAL KNOWLEDGE | 2008年 / 8卷
关键词
D O I
10.1515/9783110211818.2.131
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We report on a set of procedures to extract juridical terminology and in particular collocations and chains of collocations from German texts from the field of the protection of intellectual property. The extraction work is based on standard corpus technology, and it produces different types of output which can be made available to lexicographers: lists of term candidates, of collocation candidates, as well as example sentences which show the use of these items in context. German juridical language is a particularly interesting domain to carry out analyses of multiword expressions, as it is very rich in collocations and in chains of collocations. Many of these multiword expressions correspond to juridical concepts and thus merit being extracted and described in an electronic dictionary. We describe our sources, our tools, and examples of the output of the extraction procedures; and we discuss options for the presentation of these results towards lexicographers.
引用
收藏
页码:131 / 144
页数:14
相关论文
共 18 条
[1]  
AHMAD K, 1992, TRANSLATION STUDIES
[2]  
[Anonymous], 2005, CQP QUERY LANGUAGE T
[3]  
Bergenholtz H., 1995, Manual of Specialised Lexicography: The preparation of specialised dictionaries
[4]  
Dunning T., 1993, Computational Linguistics, V19, P61
[5]  
Evert S., 2004, THESIS U STUTTGART
[6]  
Hausmann F. J., 2004, Wortverbindungen-mehr oder weniger fest, P309
[7]  
Heid U., 1998, Terminology, V5, P161, DOI 10.1075/term.5.2.06hei
[8]  
HEID U, 2008, P LREC 2008 MARR MOR
[9]  
HEID U, 2006, P 12 EUR INT C, P979
[10]  
HEID U, 2004, P 11 EURALEX INT C L, V1, P255