Using a Database of Multiword Expressions in Dependency Parsing

被引:0
作者
Jelinek, Tomas [1 ]
机构
[1] Charles Univ Prague, Fac Arts, Inst Theoret & Computat Linguist, Prague, Czech Republic
来源
TEXT, SPEECH, AND DIALOGUE (TSD 2019) | 2019年 / 11697卷
关键词
Multiword expressions; Dependency parsing; MWE database;
D O I
10.1007/978-3-030-27947-9_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying and correctly handling multiword expressions is critical for understanding a language system and for properly functioning NLP tools. This paper presents a database of multiword expressions (MWE) we build for the Czech language which currently contains more than 7,000 entries. It contains detailed information about the properties of MWEs, e.g. about their idiomaticity and variability. The database also contains manually verified dependency structures of MWEs. We show one of the possible uses of the database: identification and correction of parsing errors in sentences containing MWEs.
引用
收藏
页码:19 / 31
页数:13
相关论文
共 21 条
[1]  
[Anonymous], 2016, CORR
[2]  
[Anonymous], 2016, P 10 INT C LANG RES
[3]  
[Anonymous], 2014, P 9 INT C LANG RES E
[4]  
Baldwin T, 2010, CH CRC MACH LEARN PA, P267
[5]   Annotation of multiword expressions in the Prague dependency treebank [J].
Bejcek, Eduard ;
Stranak, Pavel .
LANGUAGE RESOURCES AND EVALUATION, 2010, 44 (1-2) :7-21
[6]  
Cermak F., 2016, SLOVNIK CESKE FRAZEO
[7]   Multiword Expression Processing: A Survey [J].
Constant, Mathieu ;
Eryigit, Gulsen ;
Monti, Johanna ;
van der Plas, Lonneke ;
Ramisch, Carlos ;
Rosner, Michael ;
Todirascu, Amalia .
COMPUTATIONAL LINGUISTICS, 2017, 43 (04) :837-892
[8]  
Czerepowicka M., 2018, LNCS LNAI, P59, DOI [10.1007/978-3-319-93782-3_5, DOI 10.1007/978-3-319-93782-3_5]
[9]  
Geyken A., 2004, P 4 INT C LANG RES E
[10]   DuELME: a Dutch electronic lexicon of multiword expressions [J].
Gregoire, Nicole .
LANGUAGE RESOURCES AND EVALUATION, 2010, 44 (1-2) :23-39