Multi-task reinforcement learning in humans

被引:39
作者
Tomov, Momchil S. [1 ,2 ]
Schulz, Eric [3 ,4 ]
Gershman, Samuel J. [2 ,4 ,5 ]
机构
[1] Harvard Med Sch, Program Neurosci, Boston, MA 02115 USA
[2] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA
[3] Max Planck Inst Biol Cybernet, Tubingen, Germany
[4] Harvard Univ, Dept Psychol, 33 Kirkland St, Cambridge, MA 02138 USA
[5] Ctr Brains Minds & Machines, Cambridge, MA USA
关键词
ORBITOFRONTAL CORTEX; COGNITIVE MAP; ATTENTION;
D O I
10.1038/s41562-020-01035-y
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
The ability to transfer knowledge across tasks and generalize to novel ones is an important hallmark of human intelligence. Yet not much is known about human multitask reinforcement learning. We study participants' behaviour in a two-step decision-making task with multiple features and changing reward functions. We compare their behaviour with two algorithms for multitask reinforcement learning, one that maps previous policies and encountered features to new reward functions and one that approximates value functions across tasks, as well as to standard model-based and model-free algorithms. Across three exploratory experiments and a large preregistered confirmatory experiment, our results provide evidence that participants who are able to learn the task use a strategy that maps previously learned policies to novel scenarios. These results enrich our understanding of human reinforcement learning in complex environments with changing task demands. Studying behaviour in a decision-making task with multiple features and changing reward functions, Tomov et al. find that a strategy that combines successor features with generalized policy iteration predicts behaviour best.
引用
收藏
页码:764 / +
页数:12
相关论文
共 50 条
[21]   Cardiac Complication Risk Profiling for Cancer Survivors via Multi-View Multi-Task Learning [J].
Pham, Thai-Hoang ;
Yin, Changchang ;
Mehta, Laxmi ;
Zhang, Xueru ;
Zhang, Ping .
2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, :499-508
[22]   SodStereo: An Effective Multi-task Learning Network for Stereo Matching and Salient Object Detection [J].
Han, Lei ;
Gu, Yunchang ;
Lu, Shengfang ;
Shi, Zhan ;
Shang, Yiming .
BIG DATA AND SECURITY, ICBDS 2023, PT II, 2024, 2100 :3-14
[23]   AdMISC: Advanced Multi-Task Learning and Feature-Fusion for Emotional Support Conversation [J].
Jia, Xuhui ;
He, Jia ;
Zhang, Qian ;
Jin, Jin .
ELECTRONICS, 2024, 13 (08)
[24]   A fair and interpretable network for clinical risk prediction: a regularized multi-view multi-task learning approach [J].
Pham, Thai-Hoang ;
Yin, Changchang ;
Mehta, Laxmi ;
Zhang, Xueru ;
Zhang, Ping .
KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (04) :1487-1521
[25]   A fair and interpretable network for clinical risk prediction: a regularized multi-view multi-task learning approach [J].
Thai-Hoang Pham ;
Changchang Yin ;
Laxmi Mehta ;
Xueru Zhang ;
Ping Zhang .
Knowledge and Information Systems, 2023, 65 :1487-1521
[26]   Generalization of value in reinforcement learning by humans [J].
Wimmer, G. Elliott ;
Daw, Nathaniel D. ;
Shohamy, Daphna .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) :1092-1104
[27]   BusWTE: Realtime Bus Waiting Time Estimation of GPS Missing via Multi-task Learning [J].
Rong, Yuecheng ;
Liu, Jun ;
Xu, Zhilin ;
Ding, Jian ;
Zhang, Chuangming ;
Gao, Jiaxiang .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT VI, 2023, 13718 :554-570
[28]   DGE-GSIM: A multi-task dual graph embedding learning for graph similarity computation [J].
Tan, Wenhui ;
Cao, Peng ;
Jin, Zhiyong ;
Luo, Futao ;
Wen, Guangqi ;
Li, Weiping .
PROCEEDINGS OF 2022 THE 6TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, ICMLSC 20222, 2022, :39-47
[29]   Multi-task Attention-Based Semi-supervised Learning for Medical Image Segmentation [J].
Chen, Shuai ;
Bortsova, Gerda ;
Juarez, Antonio Garcia-Uceda ;
van Tulder, Gijs ;
de Bruijne, Marleen .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT III, 2019, 11766 :457-465
[30]   A multi-task graph deep learning model to predict drugs combination of synergy and sensitivity scores [J].
Monem, Samar ;
Hassanien, Aboul Ella ;
Abdel-Hamid, Alaa H. .
BMC BIOINFORMATICS, 2024, 25 (01)