Effective reinforcement learning following cerebellar damage requires a balance between exploration and motor noise

被引：130

作者：

Therrien, Amanda S. ^{[1
,2
]}

Wolpert, Daniel M. ^{[3
]}

Bastian, Amy J. ^{[1
,2
]}

机构：

[1] Kennedy Krieger Inst, Ctr Movement Studies, 707 N Broadway, Baltimore, MD USA

[2] Johns Hopkins Univ, Sch Med, Dept Neurosci, 725 N Wolfe St, Baltimore, MD 21205 USA

[3] Univ Cambridge, Dept Engn, Trumpington St, Cambridge CB2 1PZ, England

来源：

BRAIN | 2016年 / 139卷

基金：

英国惠康基金;

关键词：

reinforcement learning; adaptation; visuomotor rotation; ataxia; cerebellum; ADAPTATION; DYNAMICS; REWARD; CONSEQUENCES; DEGENERATION; VARIABILITY; BEHAVIOR; ABILITY; LESIONS; SYSTEM;

D O I：

10.1093/brain/awv329

中图分类号：

R74 [神经病学与精神病学];

学科分类号：

摘要：

Reinforcement and error-based processes are essential for motor learning, with the cerebellum thought to be required only for the error-based mechanism. Here we examined learning and retention of a reaching skill under both processes. Control subjects learned similarly from reinforcement and error-based feedback, but showed much better retention under reinforcement. To apply reinforcement to cerebellar patients, we developed a closed-loop reinforcement schedule in which task difficulty was controlled based on recent performance. This schedule produced substantial learning in cerebellar patients and controls. Cerebellar patients varied in their learning under reinforcement but fully retained what was learned. In contrast, they showed complete lack of retention in errorbased learning. We developed a mechanistic model of the reinforcement task and found that learning depended on a balance between exploration variability and motor noise. While the cerebellar and control groups had similar exploration variability, the patients had greater motor noise and hence learned less. Our results suggest that cerebellar damage indirectly impairs reinforcement learning by increasing motor noise, but does not interfere with the reinforcement mechanism itself. Therefore, reinforcement can be used to learn and retain novel skills, but optimal reinforcement learning requires a balance between exploration variability and motor noise.

引用

页码：101 / 114

页数：14

共 34 条

[1]

[Anonymous], 2020, Reinforcement Learning, An Introduction

[2] Cerebellar ataxia: Abnormal control of interaction torques across multiple joints [J].

Bastian, AJ ;

Martin, TA ;

Keating, JG ;

Thach, WT .

JOURNAL OF NEUROPHYSIOLOGY, 1996, 76 (01) :492-509

[3] Predictive Modeling by the Cerebellum Improves Proprioception [J].

Bhanpuri, Nasir H. ;

Okamura, Allison M. ;

Bastian, Amy J. .

JOURNAL OF NEUROSCIENCE, 2013, 33 (36) :14301-14306

[4]

Campbell W., 2005, DEJONGS NEUROLOGIC E, VSixth

[5] Credit Assignment during Movement Reinforcement Learning [J].

Dam, Gregory ;

Kording, Konrad ;

Wei, Kunlin .

PLOS ONE, 2013, 8 (02)

[6]

Doucet A., 2009, The Oxford Handbook of Nonlinear Filtering, V12, P3, DOI DOI 10.1111/1467-9868.00280

[7] The dissociable effects of punishment and reward on motor learning [J].

Galea, Joseph M. ;

Mallia, Elizabeth ;

Rothwell, John ;

Diedrichsen, Joern .

NATURE NEUROSCIENCE, 2015, 18 (04) :597-+

[8] Cerebellar motor learning: are environment dynamics more important than error size? [J].

Gibo, Tricia L. ;

Criscimagna-Hemminger, Sarah E. ;

Okamura, Allison M. ;

Bastian, Amy J. .

JOURNAL OF NEUROPHYSIOLOGY, 2013, 110 (02) :322-333

[9] Partitioning neuronal variability [J].

Goris, Robbe L. T. ;

Movshon, J. Anthony ;

Simoncelli, Eero P. .

NATURE NEUROSCIENCE, 2014, 17 (06) :858-865

[10] Model-Based and Model-Free Mechanisms of Human Motor Learning [J].

Haith, Adrian M. ;

Krakauer, John W. .

PROGRESS IN MOTOR CONTROL: NEURAL, COMPUTATIONAL AND DYNAMIC APPROACHES, 2013, 782 :1-21

← 1 2 3 4 →