Temporal sequence learning, prediction, and control:: A review of different models and their relation to biological mechanisms

被引：129

作者：

Wörgötter, F ^{[1
]}

Porr, B ^{[1
]}

机构：

[1] Univ Stirling, Dept Psychol, Stirling FK9 4LA, Scotland

来源：

NEURAL COMPUTATION | 2005年 / 17卷 / 02期

关键词：

D O I：

10.1162/0899766053011555

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To what degree are reward-based (e.g., TD learning) and correlation-based (Hebbian) learning related? and How do the different models correspond to possibly underlying biological mechanisms of synaptic plasticity? We first compare the different models in an open-loop condition, where behavioral feedback does not alter the learning. Here we observe that reward-based and correlation-based learning are indeed very similar. Machine control is then used to introduce the problem of closed-loop control (e.g., actor-critic architectures). Here the problem of evaluative (rewards) versus nonevaluative (correlations) feedback from the environment will be discussed, showing that both learning approaches are fundamentally different in the closed-loop condition. In trying to answer the second question, we compare neuronal versions of the different learning architectures to the anatomy of the involved brain structures (basal-ganglia, thalamus, and cortex) and the molecular biophysics of glutamatergic and dopaminergic synapses. Finally, we discuss the different algorithms used to model STDP and compare them to reward-based learning rules. Certain similarities are found in spite of the strongly different timescales. Here we focus on the biophysics of the different calcium-release mechanisms known to be involved in STDP.

引用

页码：245 / 319

页数：75

共 276 条

[1] Biophysical model of synaptic plasticity dynamics [J].

Abarbanel, HDI ;

Gibb, L ;

Huerta, R ;

Rabinovich, MI .

BIOLOGICAL CYBERNETICS, 2003, 89 (03) :214-226

[2] Dynamical model of long-term synaptic plasticity [J].

Abarbanel, HDI ;

Huerta, R ;

Rabinovich, MI .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (15) :10132-10137

[3] Functional significance of long-term potentiation for sequence learning and prediction [J].

Abbott, LF ;

Blum, KI .

CEREBRAL CORTEX, 1996, 6 (03) :406-416

[4]

Abbott LF, 1999, ADV NEUR IN, V11, P69

[5]

ABELES M, 1991, CORTICOTRONICS NEURA

[6]

Akopian G, 2000, SYNAPSE, V38, P271, DOI 10.1002/1098-2396(20001201)38:3<271::AID-SYN6>3.0.CO

[7]

2-A

[8]

ALTAR CA, 1990, EUR J PHARMACOL, V181, P17

[9]

ANDERSON C, 1994, CS94121 COL STAT U

[10] LONG-TERM DEPRESSION OF EXCITATORY SYNAPTIC TRANSMISSION AND ITS RELATIONSHIP TO LONG-TERM POTENTIATION [J].

ARTOLA, A ;

SINGER, W .

TRENDS IN NEUROSCIENCES, 1993, 16 (11) :480-487

← 1 2 3 4 5 6 7 8 9 10 →