Multi-task reinforcement learning in humans

被引：42

作者：

Tomov, Momchil S. ^{[1
,2
]}

Schulz, Eric ^{[3
,4
]}

Gershman, Samuel J. ^{[2
,4
,5
]}

机构：

[1] Harvard Med Sch, Program Neurosci, Boston, MA 02115 USA

[2] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA

[3] Max Planck Inst Biol Cybernet, Tubingen, Germany

[4] Harvard Univ, Dept Psychol, 33 Kirkland St, Cambridge, MA 02138 USA

[5] Ctr Brains Minds & Machines, Cambridge, MA USA

来源：

NATURE HUMAN BEHAVIOUR | 2021年 / 5卷 / 06期

关键词：

ORBITOFRONTAL CORTEX; COGNITIVE MAP; ATTENTION;

D O I：

10.1038/s41562-020-01035-y

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

The ability to transfer knowledge across tasks and generalize to novel ones is an important hallmark of human intelligence. Yet not much is known about human multitask reinforcement learning. We study participants' behaviour in a two-step decision-making task with multiple features and changing reward functions. We compare their behaviour with two algorithms for multitask reinforcement learning, one that maps previous policies and encountered features to new reward functions and one that approximates value functions across tasks, as well as to standard model-based and model-free algorithms. Across three exploratory experiments and a large preregistered confirmatory experiment, our results provide evidence that participants who are able to learn the task use a strategy that maps previously learned policies to novel scenarios. These results enrich our understanding of human reinforcement learning in complex environments with changing task demands. Studying behaviour in a decision-making task with multiple features and changing reward functions, Tomov et al. find that a strategy that combines successor features with generalized policy iteration predicts behaviour best.

引用

页码：764 / +

页数：12

共 50 条

[11] Multi-task transfer learning for biomedical machine reading comprehension [J].

Guo, Wenyang ;

Du, Yongping ;

Zhao, Yiliang ;

Ren, Keyan .

INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2020, 23 (03) :234-250

[12] Multi-task learning for X-vector based speaker recognition [J].

Zhang Y. ;

Liu L. .

International Journal of Speech Technology, 2023, 26 (04) :817-823

[13] Enhancing Emotion Prediction in Multimedia Content Through Multi-Task Learning [J].

Fan, Wan .

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (02) :1198-1209

[14] Single Model for Influenza Forecasting of Multiple Countries by Multi-task Learning [J].

Murayama, Taichi ;

Wakamiya, Shoko ;

Aramaki, Eiji .

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: APPLIED DATA SCIENCE TRACK, PT IV, 2021, 12978 :335-350

[15] Deep multi-task learning with relational attention for business success prediction [J].

Zhao, Jiejie ;

Du, Bowen ;

Sun, Leilei ;

Lv, Weifeng ;

Liu, Yanchi ;

Xiong, Hui .

PATTERN RECOGNITION, 2021, 110

[16] Enhancing relation extraction using multi-task learning with SDP evidence [J].

Wang, Hailin ;

Zhang, Dan ;

Liu, Guisong ;

Huang, Li ;

Qin, Ke .

INFORMATION SCIENCES, 2024, 670

[17] Multi-Task Deep Learning with Task Attention for Post-Click Conversion Rate Prediction [J].

Luo, Hongxin ;

Zhou, Xiaobing ;

Ding, Haiyan ;

Wang, Liqing .

INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03) :3583-3593

[18] Attention-based LSTM with Multi-task Learning for Distant Speech Recognition [J].

Zhang, Yu ;

Zhang, Pengyuan ;

Yan, Yonghong .

18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :3857-3861

[19] Multi-Task Learning for Authorship Attribution via Topic Approximation and Competitive Attention [J].

Song, Wei ;

Zhao, Chen ;

Liu, Lizhen .

IEEE ACCESS, 2019, 7 :177114-177121

[20] Multi-State Online Estimation of Lithium-Ion Batteries Based on Multi-Task Learning [J].

Bao, Xiang ;

Liu, Yuefeng ;

Liu, Bo ;

Liu, Haofeng ;

Wang, Yue .

ENERGIES, 2023, 16 (07)

← 1 2 3 4 5 →