Multiword expressions processing in Galician using Deep Learning

被引:1
|
作者
Darriba, Victor [1 ]
Doval, Yerai [1 ]
Kuriyozov, Elmurod [2 ]
机构
[1] Univ Vigo, Dept Informat, Vigo, Spain
[2] Univ A Coruna, CITIC, La Coruna, Spain
来源
PROCESAMIENTO DEL LENGUAJE NATURAL | 2021年 / 67期
关键词
Multiword expressions; machine learning; transformers; Galician;
D O I
10.26342/2021-67-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Treatment of Multiword Expressions is still a pending task in Natural Language Processing. In this work, we want to experimentally determine the usefulness of Machine Learning models for Multiword Expression processing in Galician. With that aim, we use CORGA, a 40 million word corpus, with which we train Deep Learning-based transformers, comparing their performances with those of more traditional conditional random fields.
引用
收藏
页码:45 / 57
页数:13
相关论文
共 50 条
  • [31] Identification of Multiword Expressions in Tweets for Hate Speech Detection
    Zampieri, Nicolas
    Ramisch, Carlos
    Illina, Irina
    Fohr, Dominique
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 202 - 210
  • [32] Comprehensive Annotation of Multiword Expressions in a Social Web Corpus
    Schneider, Nathan
    Onuffer, Spencer
    Kazour, Nora
    Danchik, Emily
    Mordowanec, Michael T.
    Conrad, Henrietta
    Smith, Noah A.
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 455 - 461
  • [33] Towards Comprehensive Computational Representations of Arabic Multiword Expressions
    Alghamdi, Ayman
    Atwell, Eric
    COMPUTATIONAL AND CORPUS-BASED PHRASEOLOGY, EUROPHRAS 2017, 2017, 10596 : 415 - 431
  • [34] Multiword Expressions (MWE) for Mizo Language: Literature Survey
    Majumder, Goutam
    Pakray, Partha
    Khiangte, Zoramdinthara
    Gelbukh, Alexander
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 623 - 635
  • [35] Extracting Multiword Expressions in Machine Translation from English to Urdu using Relational Data Approach
    Bilal, Kashif
    Muhammad, Uzair
    Khan, Atif
    Khan, M. Nasir
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 6, 2005, : 312 - 314
  • [36] PARSEME-AR: Arabic reference corpus for multiword expressions using PARSEME annotation guidelines
    Mohamed, Najet Hadj
    Khelil, Cherifa Ben
    Savary, Agata
    Keskes, Iskander
    Antoine, Jean Yves
    Hadrich, Lamia Belguith
    LANGUAGE RESOURCES AND EVALUATION, 2024, : 1331 - 1361
  • [37] Concreteness ratings for 62,000 English multiword expressions
    Emiko J. Muraki
    Summer Abdalla
    Marc Brysbaert
    Penny M. Pexman
    Behavior Research Methods, 2023, 55 : 2522 - 2531
  • [38] Facial Expressions based Error Detection for Smart Environment Using Deep Learning
    Yaddaden, Yacine
    Adda, Mehdi
    Bouzouane, Abdenour
    Gaboury, Sebastien
    Bouchard, Bruno
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [39] Searching for Illustrative Sentences for Multiword Expressions in a Research Paper Database
    Nanba, Hidetsugu
    Morishita, Satoshi
    DIGITAL LIBRARIES: UNIVERSAL AND UBIQUITOUS ACCESS TO INFORMATION, PROCEEDINGS, 2008, 5362 : 114 - +
  • [40] Eye of a Needle in a Haystack Multiword Expressions in Czech: Typology and Lexicon
    Hnatkova, Milena
    Jelinek, Tomas
    Koprivova, Marie
    Petkevic, Vladimir
    Rosen, Alexandr
    Skoumalova, Hana
    Vondricka, Pavel
    COMPUTATIONAL AND CORPUS-BASED PHRASEOLOGY, EUROPHRAS 2017, 2017, 10596 : 160 - 175