The neural architecture of theory-based reinforcement learning

被引:7
|
作者
Tomov, Momchil S. [1 ,2 ,4 ,5 ]
Tsividis, Pedro A. [3 ,4 ]
Pouncy, Thomas [1 ,2 ]
Tenenbaum, Joshua B. [3 ,4 ]
Gershman, Samuel J. [1 ,2 ,4 ]
机构
[1] Harvard Univ, Dept Psychol, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA
[3] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
[4] MIT, Ctr Brains Minds & Machines, Cambridge, MA 02139 USA
[5] Mot AD Inc, Boston, MA 02210 USA
关键词
ORBITOFRONTAL CORTEX; CAUSAL INFERENCE; COGNITIVE MAPS; MODEL; PREDICTION; BRAIN; GO; REPRESENTATIONS; KNOWLEDGE; HUMANS;
D O I
10.1016/j.neuron.2023.01.023
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Humans learn internal models of the world that support planning and generalization in complex environ-ments. Yet it remains unclear how such internal models are represented and learned in the brain. We approach this question using theory-based reinforcement learning, a strong form of model-based reinforce-ment learning in which the model is a kind of intuitive theory. We analyzed fMRI data from human participants learning to play Atari-style games. We found evidence of theory representations in prefrontal cortex and of theory updating in prefrontal cortex, occipital cortex, and fusiform gyrus. Theory updates coincided with transient strengthening of theory representations. Effective connectivity during theory updating suggests that information flows from prefrontal theory-coding regions to posterior theory-updating regions. Together, our results are consistent with a neural architecture in which top-down theory representations originating in prefrontal regions shape sensory predictions in visual areas, where factored theory prediction errors are computed and trigger bottom-up updates of the theory.
引用
收藏
页码:1331 / +
页数:23
相关论文
共 50 条
  • [41] Cooperative Learning in Engineering Education: a Game Theory-Based Approach
    Huang, Huei-Chun
    Shih, Shen-Guan
    Lai, Wei-Cheng
    INTERNATIONAL JOURNAL OF ENGINEERING EDUCATION, 2011, 27 (04) : 875 - 884
  • [42] Using choice theory-based instruction to foster vocabulary learning
    Marashi, Hamid
    Erami, Neda
    LANGUAGE LEARNING JOURNAL, 2021, 49 (01): : 66 - 76
  • [43] Theory-Based Practice
    Marshak, Robert J.
    JOURNAL OF APPLIED BEHAVIORAL SCIENCE, 2020, 56 (02): : 140 - 142
  • [44] Theory-based induction
    Kemp, C
    Tenenbaum, JB
    PROCEEDINGS OF THE TWENTY-FIFTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, PTS 1 AND 2, 2003, : 658 - 663
  • [45] THEORY-BASED NEUROREHABILITATION
    BACHYRITA, P
    ARCHIVES OF PHYSICAL MEDICINE AND REHABILITATION, 1989, 70 (02): : 162 - 162
  • [46] Game Theory-Based Control System Algorithms with Real-Time Reinforcement Learning HOW TO SOLVE MULTIPLAYER GAMES ONLINE
    Vamvoudakis, Kyriakos G.
    Modares, Hamidreza
    Kiumarsi, Bahare
    Lewis, Frank L.
    IEEE CONTROL SYSTEMS MAGAZINE, 2017, 37 (01): : 33 - 52
  • [47] Prognostic value of graph theory-based tissue architecture analysis in carcinomas of the tongue
    Sudbo, J
    Bankfalvi, A
    Bryne, M
    Marcelpoil, R
    Boysen, M
    Piffko, J
    Hemmer, J
    Kraft, K
    Reith, A
    LABORATORY INVESTIGATION, 2000, 80 (12) : 1881 - 1889
  • [48] Photonic architecture for reinforcement learning
    Flamini, Fulvio
    Hamann, Arne
    Jerbi, Sofiene
    Trenkwalder, Lea M.
    Nautrup, Hendrik Poulsen
    Briegel, Hans J.
    2020 PHOTONICS NORTH (PN), 2020,
  • [49] Concepts and facilities of a neural reinforcement learning control architecture for technical process control
    Riedmiller, M
    NEURAL COMPUTING & APPLICATIONS, 1999, 8 (04): : 323 - 338
  • [50] RADARS: Memory Efficient Reinforcement Learning Aided Differentiable Neural Architecture Search
    Yan, Zheyu
    Jiang, Weiwen
    Hu, Xiaobo Sharon
    Shi, Yiyu
    27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 128 - 133