Semi-automatic construction of word-formation networks

被引:1
|
作者
Lango, Mateusz [1 ]
Zabokrtsky, Zdenek [2 ]
Sevcikova, Magda [2 ]
机构
[1] Poznan Univ Tech, Fac Comp, Inst Comp Sci, Poznan, Poland
[2] Charles Univ Prague, Fac Math & Phys, Inst Formal & Appl Linguist, Prague, Czech Republic
关键词
Derivation; Derivational morphology; Word-formation; Lexical network; Learning to rank; Sequential pattern mining; Active learning;
D O I
10.1007/s10579-019-09484-2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The article presents a semi-automatic method for the construction of word-formation networks focusing particularly on derivation. The proposed approach applies a sequential pattern mining technique to construct useful morphological features in an unsupervised manner. The features take the form of regular expressions and later they are used to feed a machine-learned ranking model. The network is constructed by applying the learned model to sort the lists of possible base words and selecting the most probable ones. This approach, besides relatively small training set and a lexicon, does not require any additional language resources such as a list of vowel and consonant alternations, part-of-speech tags etc. The proposed approach is evaluated on lexeme sets of four languages, namely Polish, Spanish, Czech, and French. The conducted experiments demonstrate the ability of the proposed method to construct linguistically adequate word-formation networks from small training sets. Furthermore, the performed feasibility study shows that the method can further benefit from the interaction with a human language expert within the active learning framework.
引用
收藏
页码:3 / 32
页数:30
相关论文
共 50 条
  • [1] Semi-automatic construction of word-formation networks
    Mateusz Lango
    Zdeněk Žabokrtský
    Magda Ševčíková
    Language Resources and Evaluation, 2021, 55 : 3 - 32
  • [2] Semi-Automatic Construction of Word-Formation Networks (for Polish and Spanish)
    Lango, Mateusz
    Sevcikova, Magda
    Zabokrtsky, Zdenek
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1853 - 1860
  • [3] Word-Formation Network for Czech
    Sevcikova, Magda
    Zabokrtsky, Zdenek
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1087 - 1093
  • [4] Next Step in Online Querying and Visualization of Word-Formation Networks
    Vidra, Jonas
    Zabokrtsky, Zdenek
    TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 144 - 152
  • [5] WORD-FORMATION ASPECTS OF PROPER NAMES - WORD-FORMATION OR NAME-FORMATION?
    Harvalik, Milan
    NAME AND NAMING: CONVENTIONAL / UNCONVENTIONAL IN ONOMASTICS, 2015, : 37 - 43
  • [6] Word-formation & Word Memorization
    马红英
    民营科技, 2010, (03) : 51+109 - 51
  • [7] Word-Formation and Contextualism
    Meibauer, Joerg
    INTERNATIONAL REVIEW OF PRAGMATICS, 2014, 6 (01) : 103 - 126
  • [8] Metonymy in word-formation
    Janda, Laura A.
    COGNITIVE LINGUISTICS, 2011, 22 (02) : 359 - 392
  • [9] An Introduction of word-formation
    Shi Dongdong
    环球人文地理, 2014, (20) : 177 - 177
  • [10] Optimality Theory and word-formation
    Stichauer, Pavel
    SLOVO A SLOVESNOST, 2009, 70 (01): : 36 - 48