On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market

被引:0
作者
Lewis, David [1 ]
O'Connor, Alexander [1 ]
Zydron, Andrzej [2 ]
Sjoegren, Gerd [3 ]
Choudhury, Rahzeb [4 ]
机构
[1] Trinity Coll Dublin, Knowledge & Data Engn Grp, Ctr Next Generat Localisat, Dublin, Ireland
[2] XTM Int, Rochester, NY USA
[3] Interverbum Technol, Stockholm, Sweden
[4] TAUS, Plzen, Czech Republic
来源
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年
基金
爱尔兰科学基金会;
关键词
Localisation; Linked Data; Language resource sharing; RDF;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Innovations in localisation have focused on the collection and leverage of language resources. However, smaller localisation clients and Language Service Providers are poorly positioned to exploit the benefits of language resource reuse in comparison to larger companies. Their low throughput of localised content means they have little opportunity to amass significant resources, such as Translation memories and Terminology databases, to reuse between jobs or to train statistical machine translation engines tailored to their domain specialisms and language pairs. We propose addressing this disadvantage via the sharing and pooling of language resources. However, the current localisation standards do not support multiparty sharing, are not well integrated with emerging language resource standards and do not address key requirements in determining ownership and license terms for resources. We survey standards and research in the area of Localisation, Language Resources and Language Technologies to leverage existing localisation standards via Linked Data methodologies. This points to the potential of using semantic representation of existing data models for localisation workflow metadata, terminology, parallel text, provenance and access control, which we illustrate with an RDF example.
引用
收藏
页码:1403 / 1409
页数:7
相关论文
共 31 条
[1]  
Abel F, 2007, LECT NOTES COMPUT SC, V4825, P1
[2]  
[Anonymous], 2006, LEXICAL MARKUP FRAME
[3]  
Auer S., 2007, J SEMANTIC WEB LNCS, V4825
[4]  
Banerjee P., 2011, P 13 MACH TRANSL SUM
[5]  
Bizer C., 2011, STATE LOD CLOUD V0 3
[6]  
Buitelaar P, 2009, LECT NOTES COMPUT SC, V5554, P111, DOI 10.1007/978-3-642-02121-3_12
[7]  
Clark Jonathan H., 2010, Prague Bulletin of Mathematical Linguistics, P117, DOI 10.2478/v10108-010-0002-x
[8]  
Cruz-Lara S., 2004, TOPICS LANGUAGE RESO, P151
[9]  
Declerck T., 2010, W3C WORKSH MULT WEB
[10]  
Falk I., 2010, LREC WORKSH LANG RES, P19