Representing Multiword Term Variation in a Terminological Knowledge Base: a Corpus-Based Study

被引:0
|
作者
Leon-Arauz, Pilar [1 ]
Cabezas-Garcia, Melania [1 ]
Reimerink, Arianne [1 ]
机构
[1] Univ Granada, Dept Translat & Interpreting, Buensuceso 11, Granada, Spain
来源
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年
关键词
multiword expressions and collocations; information extraction; lexical database; TRANSLATION; CORPORA;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In scientific and technical communication, multiword terms are the most frequent type of lexical units. Rendering them in another language is not an easy task due to their cognitive complexity, the proliferation of different forms, and their unsystematic representation in terminographic resources. This often results in a broad spectrum of translations for multiword terms, which also foment term variation since they consist of two or more constituents. In this study we carried out a quantitative and qualitative analysis of Spanish translation variants of a set of environment-related concepts by evaluating equivalents in three parallel corpora, two comparable corpora and two terminological resources. Our results showed that MWTs exhibit a significant degree of term variation of different characteristics, which were used to establish a set of criteria according to which term variants should be selected, organized and described in terminological knowledge bases.
引用
收藏
页码:2358 / 2367
页数:10
相关论文
共 50 条