A Comparative Evaluation of Cross-Lingual Text Annotation Techniques

被引:0
作者
Zhang, Lei [1 ]
Rettinger, Achim [1 ]
Faerber, Michael [1 ]
Tadic, Marko [2 ]
机构
[1] Karlsruhe Inst Technol, Inst AIFB, D-76021 Karlsruhe, Germany
[2] Univ Zagreb, Fac Humanities & Social Sci, Zagreb, Croatia
来源
INFORMATION ACCESS EVALUATION: MULTILINGUALITY, MULTIMODALITY, AND VISUALIZATION | 2013年 / 8138卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the problem of extracting knowledge from textual documents written in different languages by annotating the text on the basis of a cross-lingual knowledge base, namely Wikipedia. Our contribution is twofold. First, we propose a novel framework for evaluating cross-lingual text annotation techniques, based on annotation of a parallel corpus to a hub-language in a cross-lingual knowledge base. Second, we investigate the performance of different cross-lingual text annotation techniques according to our proposed evaluation framework. We perform experiments for an empirical comparison of three approaches: (i) Cross-lingual Named Entity Annotation (CL-NEA), (ii) Cross-lingual Wikifier Annotation (CL-WIFI), and (iii) Cross-lingual Explicit Semantic Analysis (CL-ESA). Besides establishing an evaluation framework, our results show the differences between the three investigated approaches and demonstrate their advantages and disadvantages.
引用
收藏
页码:124 / 135
页数:12
相关论文
共 15 条
[1]  
[Anonymous], 1997, P 5 APPL NAT LANG PR, DOI DOI 10.3115/974557.974586
[2]  
[Anonymous], 2007, Proceedings of the 16th ACM Conference on Con- ference on Information and Knowledge Management, DOI DOI 10.1145/1321440.1321475.19
[3]  
[Anonymous], 2008, Proceedings of the 17th ACM conference on Information and knowledge management
[4]  
[Anonymous], 2003, Proceedings of CoNLL-2003
[5]  
Asahara M, 2003, HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, P8
[6]  
Borthwick A., 1998, P MESS UND C
[7]   The Google similarity distance [J].
Cilibrasi, Rudi L. ;
Vitanyi, Paul M. B. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (03) :370-383
[8]  
Faruqui M., KONF VER NAT SPRACH
[9]  
Gabrilovich E., 2007, P 20 INT JOINT C ART, V6, P12
[10]  
Gabrilovich E., 2006, AAAI, P1301