Reinforcement learning and its connections with neuroscience and psychology

被引:26
|
作者
Subramanian, Ajay [1 ,3 ]
Chitlangia, Sharad [2 ,3 ]
Baths, Veeky [3 ,4 ]
机构
[1] NYU, Dept Psychol, 6 Washington Pl, New York, NY 10003 USA
[2] Amazon, Mumbai, Maharashtra, India
[3] BITS Pilani, Cognit Neurosci Lab, KK Birla Goa Campus,NH-17B, Zuarinagar 403726, Goa, India
[4] BITS Pilani, Dept Biol Sci, KK Birla Goa Campus,NH-17B, Zuarinagar 403726, Goa, India
关键词
Reinforcement learning; Neuroscience; Psychology; TEMPORALLY DISCOUNTED VALUES; PREFRONTAL CORTEX; ORBITOFRONTAL CORTEX; REWARD SIGNALS; PREDICTION ERRORS; COGNITIVE CONTROL; HUMAN STRIATUM; CAUSAL POWER; PLACE CELLS; DOPAMINE;
D O I
10.1016/j.neunet.2021.10.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning methods have recently been very successful at performing complex sequential tasks like playing Atari games, Go and Poker. These algorithms have outperformed humans in several tasks by learning from scratch, using only scalar rewards obtained through interaction with their environment. While there certainly has been considerable independent innovation to produce such results, many core ideas in reinforcement learning are inspired by phenomena in animal learning, psychology and neuroscience. In this paper, we comprehensively review a large number of findings in both neuroscience and psychology that evidence reinforcement learning as a promising candidate for modeling learning and decision making in the brain. In doing so, we construct a mapping between various classes of modern RL algorithms and specific findings in both neurophysiological and behavioral literature. We then discuss the implications of this observed relationship between RL, neuroscience and psychology and its role in advancing research in both AI and brain science. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页码:271 / 287
页数:17
相关论文
共 50 条
  • [41] The Misbehavior of Reinforcement Learning
    Mongillo, Gianluigi
    Shteingart, Hanan
    Loewenstein, Yonatan
    PROCEEDINGS OF THE IEEE, 2014, 102 (04) : 528 - 541
  • [42] Meta-learning in Reinforcement Learning
    Schweighofer, N
    Doya, K
    NEURAL NETWORKS, 2003, 16 (01) : 5 - 9
  • [43] Integrating Psychology and Neuroscience: Comment on Schwartz et al. (2016)
    Tryon, Warren W.
    AMERICAN PSYCHOLOGIST, 2016, 71 (09) : 896 - 897
  • [44] A plausible neural circuit for decision making and its formation based on reinforcement learning
    Wei, Hui
    Dai, Dawei
    Bu, Yijie
    COGNITIVE NEURODYNAMICS, 2017, 11 (03) : 259 - 281
  • [45] Beyond simple reinforcement learning: the computational neurobiology of reward-learning and valuation
    O'Doherty, John P.
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) : 987 - 990
  • [46] Relevance of working memory for reinforcement learning in older adults varies with timescale of learning
    van de Vijver, Irene
    Ligneul, Romain
    AGING NEUROPSYCHOLOGY AND COGNITION, 2020, 27 (05) : 654 - 676
  • [47] A robotic model of hippocampal reverse replay for reinforcement learning
    Whelan, Matthew T.
    Jimenez-Rodriguez, Alejandro
    Prescott, Tony J.
    Vasilaki, Eleni
    BIOINSPIRATION & BIOMIMETICS, 2023, 18 (01)
  • [48] Hippocampal replays under the scrutiny of reinforcement learning models
    Caze, Romain
    Khamassi, Mehdi
    Aubin, Lise
    Girard, Benoit
    JOURNAL OF NEUROPHYSIOLOGY, 2018, 120 (06) : 2877 - 2896
  • [49] Reinforcement Learning in Multidimensional Environments Relies on Attention Mechanisms
    Niv, Yael
    Daniel, Reka
    Geana, Andra
    Gershman, Samuel J.
    Leong, Yuan Chang
    Radulescu, Angela
    Wilson, Robert C.
    JOURNAL OF NEUROSCIENCE, 2015, 35 (21) : 8145 - 8157
  • [50] Vicarious Reinforcement Learning Signals When Instructing Others
    Apps, Matthew A. J.
    Lesage, Elise
    Ramnani, Narender
    JOURNAL OF NEUROSCIENCE, 2015, 35 (07) : 2904 - 2913