Orwell's 1984-From Simple to Multi-word Units

被引:1
作者
Krstev, Cvetana [1 ]
Vitas, Dusko [2 ]
Trtovac, Aleksandra [3 ]
机构
[1] Univ Belgrade, Fac Philol, Studentski Trg 1, Belgrade, Serbia
[2] Univ Belgrade, Fac Math, YU-11001 Belgrade, Serbia
[3] Univ Belgrade, Univ Library, Belgrade, Serbia
来源
HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS | 2014年 / 8387卷
关键词
Morphosyntactic annotation; Multi-word units; Finite-state transducers; MULTEXT-East;
D O I
10.1007/978-3-319-08958-4_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present an alternative version of the morphosyntactically annotated Serbian translation of 1984. This version follows the basic principles of the MULTEXT-East version, except for one addition-the text will be annotated with multi-word units as well. We will present the resources used for annotation with multi-word units and explain how these resources were enriched with multi-word units extracted from the processed text. Finally, we will present the format of this alternative version and the benefits obtained both from preparing the new resource and from the resource itself.
引用
收藏
页码:276 / 287
页数:12
相关论文
共 27 条
[11]  
Ermolaev N., 2012, P 12 INT C LIB DIG A
[12]  
GESMUNDO A., 2012, P 50 ANN M ASS COMP, V2, P368
[13]  
Gross M., 1986, 11th International Conference on Computational Linguistics. Proceedings of Coling '86, P1
[14]  
Krstev C., 2004, Informatica, V28, P431
[15]   A system for named entity recognition based on local grammars [J].
Krstev, Cvetana ;
Obradovic, Ivan ;
Utvic, Milos ;
Vitas, Dusko .
JOURNAL OF LOGIC AND COMPUTATION, 2014, 24 (02) :473-489
[16]  
Krstev C, 2013, STUD COMPUT INTELL, V458, P109, DOI 10.1007/978-3-642-34399-5_6
[17]  
Krstev Cvetana., 2006, Proceedings of the 5th Slovenian and 1st International Conference Language Technologies, IS-LTC 2006, P192
[18]  
Laporte E., 2008, Towards a Shared Task for Multiword Expressions (MWE 2008), P27
[19]  
Paumier S, 2013, UNITEX 3 1BETA USER
[20]  
Popovic Z.., 2010, INFOTHECA J DIGIT HU, V11, p21a