Online learning of long-range dependencies

被引:0
|
作者
Zucchet, Nicolas [1 ]
Meier, Robert [1 ]
Schug, Simon [1 ]
Mujika, Asier [1 ]
Sacramento, Joao [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
基金
瑞士国家科学基金会;
关键词
BACKPROPAGATION; TIME;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online learning holds the promise of enabling efficient long-term credit assignment in recurrent neural networks. However, current algorithms fall short of offline backpropagation by either not being scalable or failing to learn long-range dependencies. Here we present a high-performance online learning algorithm that merely doubles the memory and computational requirements of a single inference pass. We achieve this by leveraging independent recurrent modules in multi-layer networks, an architectural motif that has recently been shown to be particularly powerful. Experiments on synthetic memory problems and on the challenging long-range arena benchmark suite reveal that our algorithm performs competitively, establishing a new standard for what can be achieved through online learning. This ability to learn long-range dependencies offers a new perspective on learning in the brain and opens a promising avenue in neuromorphic computing.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Learning and generation of long-range correlated sequences
    Priel, A
    Kanter, I
    PHYSICAL REVIEW E, 2000, 62 (02): : 1617 - 1621
  • [32] Small-Scale Martian Crater Detection by Deep Learning With Enhanced Capture of Features Information and Long-Range Dependencies
    Yu, Zhichao
    Fong, Simon
    Millham, Richard Charles
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [33] ABSENCE OF LONG-RANGE ORDER WITH LONG-RANGE POTENTIALS
    BAUS, M
    JOURNAL OF STATISTICAL PHYSICS, 1980, 22 (01) : 111 - 119
  • [34] Long-range correlations in an online betting exchange for a football tournament
    Hardiman, Stephen J.
    Richmond, Peter
    Hutzler, Stefan
    NEW JOURNAL OF PHYSICS, 2010, 12
  • [35] Shaken, and Stirred: Long-Range Dependencies Enable Robust Outlier Detection with PixelCNN plus
    Umapathi, Barath Mohan
    Chauhan, Kushal
    Shenoy, Pradeep
    Sridharan, Devarajan
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1440 - 1450
  • [36] BlockEcho: Retaining Long-Range Dependencies for Imputing Block-Wise Missing Data
    Han, Qiao
    Li, Mingqian
    Yang, Yao
    Zhai, Yiteng
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 4098 - 4106
  • [37] Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulation
    Birnbaum, Sawyer
    Kuleshov, Volodymyr
    Enam, S. Zayd
    Koh, Pang Wei
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [38] Modeling long-range dependencies in speech data for text-independent speaker recognition
    Ming, Ji
    Lin, Jie
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4825 - +
  • [39] Abandoning Emotion Classes - Towards Continuous Emotion Recognition with Modelling of Long-Range Dependencies
    Woellmer, Martin
    Eyben, Florian
    Reiter, Stephan
    Schuller, Bjoern
    Cox, Cate
    Douglas-Cowie, Ellen
    Cowie, Roddy
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 597 - +
  • [40] What Limits Our Capacity to Process Nested Long-Range Dependencies in Sentence Comprehension?
    Lakretz, Yair
    Dehaene, Stanislas
    King, Jean-Remi
    ENTROPY, 2020, 22 (04)