Reinforcement learning and its connections with neuroscience and psychology

被引：26

作者：

Subramanian, Ajay ^{[1
,3
]}

Chitlangia, Sharad ^{[2
,3
]}

Baths, Veeky ^{[3
,4
]}

机构：

[1] NYU, Dept Psychol, 6 Washington Pl, New York, NY 10003 USA

[2] Amazon, Mumbai, Maharashtra, India

[3] BITS Pilani, Cognit Neurosci Lab, KK Birla Goa Campus,NH-17B, Zuarinagar 403726, Goa, India

[4] BITS Pilani, Dept Biol Sci, KK Birla Goa Campus,NH-17B, Zuarinagar 403726, Goa, India

来源：

NEURAL NETWORKS | 2022年 / 145卷

关键词：

Reinforcement learning; Neuroscience; Psychology; TEMPORALLY DISCOUNTED VALUES; PREFRONTAL CORTEX; ORBITOFRONTAL CORTEX; REWARD SIGNALS; PREDICTION ERRORS; COGNITIVE CONTROL; HUMAN STRIATUM; CAUSAL POWER; PLACE CELLS; DOPAMINE;

D O I：

10.1016/j.neunet.2021.10.003

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning methods have recently been very successful at performing complex sequential tasks like playing Atari games, Go and Poker. These algorithms have outperformed humans in several tasks by learning from scratch, using only scalar rewards obtained through interaction with their environment. While there certainly has been considerable independent innovation to produce such results, many core ideas in reinforcement learning are inspired by phenomena in animal learning, psychology and neuroscience. In this paper, we comprehensively review a large number of findings in both neuroscience and psychology that evidence reinforcement learning as a promising candidate for modeling learning and decision making in the brain. In doing so, we construct a mapping between various classes of modern RL algorithms and specific findings in both neurophysiological and behavioral literature. We then discuss the implications of this observed relationship between RL, neuroscience and psychology and its role in advancing research in both AI and brain science. (C) 2021 Elsevier Ltd. All rights reserved.

引用

页码：271 / 287

页数：17

共 50 条

[41] The Misbehavior of Reinforcement Learning
Mongillo, Gianluigi
Shteingart, Hanan
Loewenstein, Yonatan
PROCEEDINGS OF THE IEEE, 2014, 102 (04) : 528 - 541
[42] Meta-learning in Reinforcement Learning
Schweighofer, N
Doya, K
NEURAL NETWORKS, 2003, 16 (01) : 5 - 9
[43] Integrating Psychology and Neuroscience: Comment on Schwartz et al. (2016)
Tryon, Warren W.
AMERICAN PSYCHOLOGIST, 2016, 71 (09) : 896 - 897
[44] A plausible neural circuit for decision making and its formation based on reinforcement learning
Wei, Hui
Dai, Dawei
Bu, Yijie
COGNITIVE NEURODYNAMICS, 2017, 11 (03) : 259 - 281
[45] Beyond simple reinforcement learning: the computational neurobiology of reward-learning and valuation
O'Doherty, John P.
EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) : 987 - 990
[46] Relevance of working memory for reinforcement learning in older adults varies with timescale of learning
van de Vijver, Irene
Ligneul, Romain
AGING NEUROPSYCHOLOGY AND COGNITION, 2020, 27 (05) : 654 - 676
[47] A robotic model of hippocampal reverse replay for reinforcement learning
Whelan, Matthew T.
Jimenez-Rodriguez, Alejandro
Prescott, Tony J.
Vasilaki, Eleni
BIOINSPIRATION & BIOMIMETICS, 2023, 18 (01)
[48] Hippocampal replays under the scrutiny of reinforcement learning models
Caze, Romain
Khamassi, Mehdi
Aubin, Lise
Girard, Benoit
JOURNAL OF NEUROPHYSIOLOGY, 2018, 120 (06) : 2877 - 2896
[49] Reinforcement Learning in Multidimensional Environments Relies on Attention Mechanisms
Niv, Yael
Daniel, Reka
Geana, Andra
Gershman, Samuel J.
Leong, Yuan Chang
Radulescu, Angela
Wilson, Robert C.
JOURNAL OF NEUROSCIENCE, 2015, 35 (21) : 8145 - 8157
[50] Vicarious Reinforcement Learning Signals When Instructing Others
Apps, Matthew A. J.
Lesage, Elise
Ramnani, Narender
JOURNAL OF NEUROSCIENCE, 2015, 35 (07) : 2904 - 2913

← 1 2 3 4 5 →