Modeling Word Learning and Processing with Recurrent Neural Networks

被引:1
作者
Marzi, Claudia [1 ]
机构
[1] Italian Natl Res Council, Inst Computat Linguist, I-56124 Pisa, Italy
关键词
word-learning; serial word processing; recurrent neural networks; long short-term memories; temporal self-organizing memories; MEMORY;
D O I
10.3390/info11060320
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper focuses on what two different types of Recurrent Neural Networks, namely a recurrent Long Short-Term Memory and a recurrent variant of self-organizing memories, a Temporal Self-Organizing Map, can tell us about speakers' learning and processing a set of fully inflected verb forms selected from the top-frequency paradigms of Italian and German. Both architectures, due to the re-entrant layer of temporal connectivity, can develop a strong sensitivity to sequential patterns that are highly attested in the training data. The main goal is to evaluate learning and processing dynamics of verb inflection data in the two neural networks by focusing on the effects of morphological structure on word production and word recognition, as well as on word generalization for untrained verb forms. For both models, results show that production and recognition, as well as generalization, are facilitated for verb forms in regular paradigms. However, the two models are differently influenced by structural effects, with the Temporal Self-Organizing Map more prone to adaptively find a balance between processing issues of learnability and generalization, on the one side, and discriminability on the other side.
引用
收藏
页数:14
相关论文
共 31 条
  • [1] [Anonymous], 2016, ARXIV PREPRINT ARXIV
  • [2] Aronoff M., 1994, Morphology by Itself
  • [3] Baayen H., 1995, The CELEX lexical database (Release 2)
  • [4] Probability and surprisal in auditory comprehension of morphologically complex words
    Balling, Laura Winther
    Baayen, R. Harald
    [J]. COGNITION, 2012, 125 (01) : 80 - 106
  • [5] Morphological effects in auditory word recognition: Evidence from Danish
    Balling, Laura Winther
    Baayen, R. Harald
    [J]. LANGUAGE AND COGNITIVE PROCESSES, 2008, 23 (7-8): : 1159 - 1190
  • [6] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
    BENGIO, Y
    SIMARD, P
    FRASCONI, P
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 157 - 166
  • [7] Bittner D., 2003, DEV VERB INFLECTION
  • [8] Short-term memory for serial order: A recurrent neural network model
    Botvinick, MM
    Plaut, DC
    [J]. PSYCHOLOGICAL REVIEW, 2006, 113 (02) : 201 - 233
  • [9] A Fundamental Limitation of the Conjunctive Codes Learned in PDP Models of Cognition: Comment on Botvinick and Plaut (2006)
    Bowers, Jeffrey S.
    Damian, Markus F.
    Davis, Colin J.
    [J]. PSYCHOLOGICAL REVIEW, 2009, 116 (04) : 986 - 995
  • [10] Cardillo F. A., 2018, J COMPUTATIONAL LING, V4, P57, DOI 10.4000/ijcol.540