Implementing Neural Turing Machines

被引:26
作者
Collier, Mark [1 ]
Beel, Joeran [1 ]
机构
[1] Trinity Coll Dublin, Dublin, Ireland
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷
关键词
Neural Turing Machines; Memory Augmented; Neural Networks;
D O I
10.1007/978-3-030-01424-7_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Turing Machines (NTMs) are an instance of Memory Augmented Neural Networks, a new class of recurrent neural networks which decouple computation from memory by introducing an external memory unit. NTMs have demonstrated superior performance over Long Short- Term Memory Cells in several sequence learning tasks. A number of open source implementations of NTMs exist but are unstable during training and/ or fail to replicate the reported performance of NTMs. This paper presents the details of our successful implementation of a NTM. Our implementation learns to solve three sequential learning tasks from the original NTM paper. We find that the choice of memory contents initialization scheme is crucial in successfully implementing a NTM. Networks with memory contents initialized to small constant values converge on average 2 times faster than the next best memory contents initialization scheme.
引用
收藏
页码:94 / 104
页数:11
相关论文
共 12 条
  • [1] [Anonymous], 2016, ARXIV160700036
  • [2] [Anonymous], 1997, Neural Computation
  • [3] [Anonymous], 2014, ABS14105401 CORR
  • [4] [Anonymous], 2016, P C ASS MACH TRANSL
  • [5] [Anonymous], 2015, ADV NEURAL INFORM PR
  • [6] Bahdanau D., 2015, Neural machine translation
  • [7] Hybrid computing using a neural network with dynamic external memory
    Graves, Alex
    Wayne, Greg
    Eynolds, Malcolm R.
    Harley, Tim
    Danihelka, Ivo
    Grabska-Barwinska, Agnieszka
    Colmenarejo, Sergio Gomez
    Grefenstette, Edward
    Amalho, Tiago R.
    Agapiou, John
    Badia, Adria Puigdomenech
    Hermann, Karl Moritz
    Zwols, Yori
    Strovski, Georg O.
    Ain, Adam C.
    King, Helen
    Summerfield, Christopher
    Lunsom, Phil B.
    Kavukcuoglu, Koray
    Hassabis, Demis
    [J]. NATURE, 2016, 538 (7626) : 471 - +
  • [8] Graves A, 2013, INT CONF ACOUST SPEE, P6645, DOI 10.1109/ICASSP.2013.6638947
  • [9] A Novel Connectionist System for Unconstrained Handwriting Recognition
    Graves, Alex
    Liwicki, Marcus
    Fernandez, Santiago
    Bertolami, Roman
    Bunke, Horst
    Schmidhuber, Juergen
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (05) : 855 - 868
  • [10] King DB, 2015, ACS SYM SER, V1214, P1