Multi-task reinforcement learning in humans

被引：43

作者：

Tomov, Momchil S. ^{[1
,2
]}

Schulz, Eric ^{[3
,4
]}

Gershman, Samuel J. ^{[2
,4
,5
]}

机构：

[1] Harvard Med Sch, Program Neurosci, Boston, MA 02115 USA

[2] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA

[3] Max Planck Inst Biol Cybernet, Tubingen, Germany

[4] Harvard Univ, Dept Psychol, 33 Kirkland St, Cambridge, MA 02138 USA

[5] Ctr Brains Minds & Machines, Cambridge, MA USA

来源：

NATURE HUMAN BEHAVIOUR | 2021年 / 5卷 / 06期

关键词：

ORBITOFRONTAL CORTEX; COGNITIVE MAP; ATTENTION;

D O I：

10.1038/s41562-020-01035-y

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

The ability to transfer knowledge across tasks and generalize to novel ones is an important hallmark of human intelligence. Yet not much is known about human multitask reinforcement learning. We study participants' behaviour in a two-step decision-making task with multiple features and changing reward functions. We compare their behaviour with two algorithms for multitask reinforcement learning, one that maps previous policies and encountered features to new reward functions and one that approximates value functions across tasks, as well as to standard model-based and model-free algorithms. Across three exploratory experiments and a large preregistered confirmatory experiment, our results provide evidence that participants who are able to learn the task use a strategy that maps previously learned policies to novel scenarios. These results enrich our understanding of human reinforcement learning in complex environments with changing task demands. Studying behaviour in a decision-making task with multiple features and changing reward functions, Tomov et al. find that a strategy that combines successor features with generalized policy iteration predicts behaviour best.

引用

页码：764 / +

页数：12

共 50 条

[41] DeepCenterline: A Multi-task Fully Convolutional Network for Centerline Extraction [J].

Guo, Zhihui ;

Bai, Junjie ;

Lu, Yi ;

Wang, Xin ;

Cao, Kunlin ;

Song, Qi ;

Sonka, Milan ;

Yin, Youbing .

INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2019, 2019, 11492 :441-453

[42] Using multi-task experiments to test principles of hippocampal function [J].

Han, Claire Z. ;

Donoghue, Thomas ;

Cao, Runnan ;

Kunz, Lukas ;

Wang, Shuo ;

Jacobs, Joshua .

HIPPOCAMPUS, 2023, 33 (05) :646-657

[43] MCAN: An Efficient Multi-Task Network for Facial Expression Analysis [J].

Wang, Haitao ;

Wang, Rui ;

Zou, Peng ;

Ni, Qingjian ;

Sun, Xiao .

PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, :1037-1042

[44] MTPR: A Multi-Task Learning Based POI Recommendation Considering Temporal Check-Ins and Geographical Locations [J].

Xia, Bin ;

Bai, Yuxuan ;

Yin, Junjie ;

Li, Qi ;

Xu, Lijie .

APPLIED SCIENCES-BASEL, 2020, 10 (19)

[45] Enhancing Inference on Physiological and Kinematic Periodic Signals via Phase-Based Interpretability and Multi-Task Learning [J].

Soleimani, Reza ;

Lobaton, Edgar .

INFORMATION, 2022, 13 (07)

[46] JOINT CTC-ATTENTION BASED END-TO-END SPEECH RECOGNITION USING MULTI-TASK LEARNING [J].

Kim, Suyoun ;

Hori, Takaaki ;

Watanabe, Shinji .

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, :4835-4839

[47] Computational evidence for hierarchically structured reinforcement learning in humans [J].

Eckstein, Maria K. ;

Collins, Anne G. E. .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (47) :29381-29389

[48] HF-UNet: Learning Hierarchically Inter-Task Relevance in Multi-Task U-Net for Accurate Prostate Segmentation in CT Images [J].

He, Kelei ;

Lian, Chunfeng ;

Zhang, Bing ;

Zhang, Xin ;

Cao, Xiaohuan ;

Nie, Dong ;

Gao, Yang ;

Zhang, Junfeng ;

Shen, Dinggang .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (08) :2118-2128

[49] Impact of multi-task on symptomatic patient affected by chronical vestibular disorders [J].

Regrain, Edwin ;

Regnault, Philippe ;

Kirtley, Christopher ;

Shamshirband, Shahaboddin ;

Chays, Andre ;

Boyer, Francois-Constant ;

Taiar, Redha .

ACTA OF BIOENGINEERING AND BIOMECHANICS, 2016, 18 (03) :123-129

[50] Aspect-based sentiment analysis of drug reviews using multi-task learning based dual BiLSTM model [J].

Rani, Somiya ;

Jain, Amita .

MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) :22473-22501

← 1 2 3 4 5 →