Syntax-aware Neural Semantic Role Labeling for Morphologically Rich Languages

被引:0
作者
Vasic, Daniel [1 ]
Vasic, Mirela Kundid [1 ]
机构
[1] Univ Mostar, Mostar 88000, Bosnia & Herceg
来源
2020 28TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM) | 2020年
关键词
semantic role labeling; deep learning; semantic parsing; morphologically rich languages;
D O I
10.23919/softcom50211.2020.9238179
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic Role Labeling (SRL) is one of the most challenging tasks in Natural Language Processing (NLP). SRL is a task that consists of predicate identification, argument identification and argument classification. In this article we present novel approach for argument classification that is based on deep neural network architecture. Traditional discrete based SRL relies heavily on feature engineering that uses syntactic structures in contrast to deep learning approaches that encode whole sentences without taking into account syntactic features. We present an approach that uses combination of syntactic features and external word representations from FastText. The advantages of using FastText embeddings is the generation of better vector representations for rare words and FastText gives better results for words that are not within the dictionary. These attributes for vector representations give good results for morphologically rich languages. Most of the SRL approaches today are trained on resource rich languages. In this article we present novel neural architecture for SRL that is suitable for resource poor morphology rich languages. Experiments on hr500k corpus shows that our syntax-aware approach shows competitive results for argument classification. We present architecture for argument classification that is based on Bidirectional Long-Short Term Memory (Bi-LSTM) and Conditional Random Field (CRF) decoding for finding optimal sequence. Our approach showed results that are very close to benchmark results with F1 score of 72%.
引用
收藏
页码:327 / 332
页数:6
相关论文
共 25 条
  • [1] Bojanowski P., 2016, Trans. Assoc. Comput. Linguist., V5, P135, DOI [10.1162/tacla00051, DOI 10.1162/TACL_A_00051]
  • [2] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [3] FitzGerald Nicholas, 2015, P C EMPIRICAL MET NA, P960, DOI DOI 10.18653/V1/D15-1112
  • [4] VITERBI ALGORITHM
    FORNEY, GD
    [J]. PROCEEDINGS OF THE IEEE, 1973, 61 (03) : 268 - 278
  • [5] Gantar P, 2018, P C LANG TECHN DIG H, P93
  • [6] Glorot X., 2010, P 13 INT C ARTIFICIA, P249, DOI DOI 10.1109/LGRS.2016.2565705
  • [7] Hajic J., 2018, Prague dependency Treebank 3.5
  • [8] He L., P 55 ANN M ASS COMPU, P473
  • [9] Deep Semantic Role Labeling: What Works and What's Next
    He, Luheng
    Lee, Kenton
    Lewis, Mike
    Zettlemoyer, Luke
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 473 - 483
  • [10] Kingma DP., 2017, A method for stochastic optimization, DOI DOI 10.48550/ARXIV.1412.6980