Multiword expressions processing in Galician using Deep Learning

被引:1
|
作者
Darriba, Victor [1 ]
Doval, Yerai [1 ]
Kuriyozov, Elmurod [2 ]
机构
[1] Univ Vigo, Dept Informat, Vigo, Spain
[2] Univ A Coruna, CITIC, La Coruna, Spain
来源
PROCESAMIENTO DEL LENGUAJE NATURAL | 2021年 / 67期
关键词
Multiword expressions; machine learning; transformers; Galician;
D O I
10.26342/2021-67-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Treatment of Multiword Expressions is still a pending task in Natural Language Processing. In this work, we want to experimentally determine the usefulness of Machine Learning models for Multiword Expression processing in Galician. With that aim, we use CORGA, a 40 million word corpus, with which we train Deep Learning-based transformers, comparing their performances with those of more traditional conditional random fields.
引用
收藏
页码:45 / 57
页数:13
相关论文
共 50 条
  • [1] Using a Database of Multiword Expressions in Dependency Parsing
    Jelinek, Tomas
    TEXT, SPEECH, AND DIALOGUE (TSD 2019), 2019, 11697 : 19 - 31
  • [2] Multiword Expressions and Lexicalism
    Findlay, Jamie Y.
    PROCEEDINGS OF LFG'17 CONFERENCE, 2017, : 209 - 229
  • [3] Discovering multiword expressions
    Villavicencio, Aline
    Idiart, Marco
    NATURAL LANGUAGE ENGINEERING, 2019, 25 (06) : 715 - 733
  • [4] Prepositional multiword expressions
    Ivankovic, Ivana Matas
    RASPRAVE, 2016, 42 (02): : 543 - 562
  • [5] Identifying Bengali Multiword Expressions using semantic clustering
    Chakraborty, Tanmoy
    Das, Dipankar
    Bandyopadhyay, Sivaji
    LINGUISTICAE INVESTIGATIONES, 2014, 37 (01): : 106 - 128
  • [6] Using Semantic Clustering for Detecting Bengali Multiword Expressions
    Chakraborty, Tanmoy
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2014, 38 (02): : 103 - 113
  • [7] Identification of Nominal Multiword Expressions in Bengali Using CRF
    Chakraborty, Tanmoy
    4TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2012), 2012,
  • [8] Identification of Multiword Expressions in the brWaC
    Scheller Boos, Rodrigo Augusto
    Prestes, Kassius Vargas
    Villavicencio, Aline
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 728 - 735
  • [9] Multiword Expressions in Child Language
    Wilkens, Rodrigo
    Idiart, Marco
    Villavicencio, Aline
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2307 - 2311
  • [10] Multiword Expressions in Machine Translation
    Kordoni, Valia
    Simova, Iliana
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1208 - 1211