共 158 条
Beyond dichotomies in reinforcement learning
被引:73
作者:
Collins, Anne G. E.
[1
,2
]
Cockburn, Jeffrey
[3
]
机构:
[1] Univ Calif Berkeley, Dept Psychol, 3210 Tolman Hall, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA
[3] CALTECH, Div Humanities & Social Sci, Pasadena, CA 91125 USA
关键词:
DOPAMINE NEURONS ENCODE;
MODEL-BASED CONTROL;
PREFRONTAL CORTEX;
WORKING-MEMORY;
INDIVIDUAL-DIFFERENCES;
PREDICTION ERRORS;
DECISION-MAKING;
BASAL GANGLIA;
SYSTEMS;
REWARD;
D O I:
10.1038/s41583-020-0355-6
中图分类号:
Q189 [神经科学];
学科分类号:
071006 ;
摘要:
Reinforcement learning (RL) is a framework of particular importance to psychology, neuroscience and machine learning. Interactions between these fields, as promoted through the common hub of RL, has facilitated paradigm shifts that relate multiple levels of analysis in a singular framework (for example, relating dopamine function to a computationally defined RL signal). Recently, more sophisticated RL algorithms have been proposed to better account for human learning, and in particular its oft-documented reliance on two separable systems: a model-based (MB) system and a model-free (MF) system. However, along with many benefits, this dichotomous lens can distort questions, and may contribute to an unnecessarily narrow perspective on learning and decision-making. Here, we outline some of the consequences that come from overconfidently mapping algorithms, such as MB versus MF RL, with putative cognitive processes. We argue that the field is well positioned to move beyond simplistic dichotomies, and we propose a means of refocusing research questions towards the rich and complex components that comprise learning and decision-making. Reinforcement learning has been suggested to come in two flavours: model-free and model-based. In this Perspective, Collins and Cockburn explain why viewing reinforcement learning through this dichotomous lens is not always accurate or helpful, and suggest paths forward.
引用
收藏
页码:576 / 586
页数:11
相关论文