Interactions between motor exploration and reinforcement learning

被引：38

作者：

Uehara, Shintaro ^{[1
,2
]}

Mawase, Firas ^{[1
]}

Therrien, Amanda S. ^{[3
,4
]}

Cherry-Allen, Kendra M. ^{[1
]}

Celnik, Pablo ^{[1
,3
]}

机构：

[1] Johns Hopkins Med Inst, Dept Phys Med & Rehabil, 600 N Wolfe St, Baltimore, MD 21287 USA

[2] Japan Soc Promot Sci, Tokyo, Japan

[3] Johns Hopkins Med Inst, Dept Neurosci, Baltimore, MD 21287 USA

[4] Kennedy Krieger Inst, Ctr Movement Studies, Baltimore, MD USA

来源：

JOURNAL OF NEUROPHYSIOLOGY | 2019年 / 122卷 / 02期

基金：

日本学术振兴会; 美国国家卫生研究院;

关键词：

meta-learning; motor exploration; reinforcement learning; savings; trial and error; ADAPTATION; SAVINGS; VARIABILITY; MODULATION; NOISE; MODEL; PERTURBATION; PROBABILITY; PLASTICITY; MEMORIES;

D O I：

10.1152/jn.00390.2018

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

Motor exploration, a trial-and-error process in search for better motor outcomes, is known to serve a critical role in motor learning. This is particularly relevant during reinforcement learning, where actions leading to a successful outcome are reinforced while unsuccessful actions arc avoided. Although early on motor exploration is beneficial to finding the correct solution. maintaining high levels of exploration later in the learning process might be deleterious. Whether and how the level of exploration changes over the course of reinforcement learning, however, remains poorly understood. Here we evaluated temporal changes in motor exploration while healthy participants learned a reinforcement-based motor task. We defined exploration as the magnitude of trial-to-trial change in movements as a function of whether the preceding trial resulted in success or failure. Participants were required to find the optimal finger-pointing direction using binary feedback of success or failure. We found that the magnitude of exploration gradually increased over time when participants were learning the task. Conversely, exploration remained low in participants who were unable to correctly adjust their pointing direction. Interestingly, exploration remained elevated when participants underwent a second training session, which was associated with faster relearning. These results indicate that the motor system may flexibly upregulate the extent of exploration during reinforcement learning as if acquiring a specific strategy to facilitate subsequent learning. Also, our findings showed that exploration affects reinforcement learning and vice versa, indicating an interactive relationship between them. Reinforcement-based tasks could be used as primers to increase exploratory behavior leading to more efficient subsequent learning. NEW & NOTEWORTHY Motor exploration, the ability to search for the correct actions, is critical to learning motor skills. Despite this, whether and how the level of exploration changes over the course of training remains poorly understood. We showed that exploration increased and remained high throughout training of a reinforcement-based motor task. Interestingly, elevated exploration persisted and facilitated subsequent learning. These results suggest that the motor system upregulates exploration as if learning a strategy to facilitate subsequent learning.

引用

页码：797 / 808

页数：12

共 50 条

[31] Balancing exploration and exploitation in episodic reinforcement learning [J].

Chen, Qihang ;

Zhang, Qiwei ;

Liu, Yunlong .

EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231

[32] Enriching behavioral ecology with reinforcement learning methods [J].

Frankenhuis, Willem E. ;

Panchanathan, Karthik ;

Barto, Andrew G. .

BEHAVIOURAL PROCESSES, 2019, 161 :94-100

[33] A reinforcement learning approach to model interactions between landmarks and geometric cues during spatial learning [J].

Sheynikhovich, Denis ;

Arleo, Angelo .

BRAIN RESEARCH, 2010, 1365 :35-47

[34] Learning Task Decomposition and Exploration Shaping for Reinforcement Learning Agents [J].

Djurdjevic, Predrag ;

Huber, Manfred .

2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, :365-372

[35] Learning to soar: Resource-constrained exploration in reinforcement learning [J].

Chung, Jen Jen ;

Lawrance, Nicholas R. J. ;

Sukkarieh, Salah .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2015, 34 (02) :158-172

[36] Learning Transferable Domain Priors for Safe Exploration in Reinforcement Learning [J].

Karimpanal, Thommen George ;

Rana, Santu ;

Gupta, Sunil ;

Truyen Tran ;

Venkatesh, Svetha .

2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,

[37] Recognition memory for human motor learning [J].

Kumar, Neeraj ;

van Vugt, Floris T. ;

Ostry, David J. .

CURRENT BIOLOGY, 2021, 31 (08) :1678-+

[38] Motor competence is related to acquisition of error-based but not reinforcement learning in children ages 6 to 12 [J].

Konrad, Jeffrey D. ;

Marrus, Natasha ;

Lohse, Keith R. ;

Thuet, Kayla M. ;

Lang, Catherine E. .

HELIYON, 2024, 10 (12)

[39] Reinforcement learning of motor skills with policy gradients [J].

Peters, Jan ;

Schaal, Stefan .

NEURAL NETWORKS, 2008, 21 (04) :682-697

[40] The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior [J].

Katahira, Kentaro .

JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2015, 66 :59-69

← 1 2 3 4 5 →