The neural architecture of theory-based reinforcement learning

被引:7
|
作者
Tomov, Momchil S. [1 ,2 ,4 ,5 ]
Tsividis, Pedro A. [3 ,4 ]
Pouncy, Thomas [1 ,2 ]
Tenenbaum, Joshua B. [3 ,4 ]
Gershman, Samuel J. [1 ,2 ,4 ]
机构
[1] Harvard Univ, Dept Psychol, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA
[3] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
[4] MIT, Ctr Brains Minds & Machines, Cambridge, MA 02139 USA
[5] Mot AD Inc, Boston, MA 02210 USA
关键词
ORBITOFRONTAL CORTEX; CAUSAL INFERENCE; COGNITIVE MAPS; MODEL; PREDICTION; BRAIN; GO; REPRESENTATIONS; KNOWLEDGE; HUMANS;
D O I
10.1016/j.neuron.2023.01.023
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Humans learn internal models of the world that support planning and generalization in complex environ-ments. Yet it remains unclear how such internal models are represented and learned in the brain. We approach this question using theory-based reinforcement learning, a strong form of model-based reinforce-ment learning in which the model is a kind of intuitive theory. We analyzed fMRI data from human participants learning to play Atari-style games. We found evidence of theory representations in prefrontal cortex and of theory updating in prefrontal cortex, occipital cortex, and fusiform gyrus. Theory updates coincided with transient strengthening of theory representations. Effective connectivity during theory updating suggests that information flows from prefrontal theory-coding regions to posterior theory-updating regions. Together, our results are consistent with a neural architecture in which top-down theory representations originating in prefrontal regions shape sensory predictions in visual areas, where factored theory prediction errors are computed and trigger bottom-up updates of the theory.
引用
收藏
页码:1331 / +
页数:23
相关论文
共 50 条
  • [31] Community theory-based learning framework for Higher education
    Li, Qian
    Yan, Fei
    LEARNING AND MOTIVATION, 2023, 84
  • [32] A novel neural network based reinforcement learning
    Fan, Jian
    Song, Yang
    Fei, MinRui
    Zhao, Qijie
    BIO-INSPIRED COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2007, 4688 : 46 - +
  • [33] Optimization Theory-Based Deep Reinforcement Learning for Resource Allocation in Ultra-Reliable Wireless Networked Control Systems
    Ali, Hamida Qumber
    Darabi, Amirhassan Babazadeh
    Coleri, Sinem
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (08) : 4774 - 4787
  • [34] A generic architecture for adaptive agents based on reinforcement learning
    Preux, P
    Delepoulle, S
    Darcheville, JC
    INFORMATION SCIENCES, 2004, 161 (1-2) : 37 - 55
  • [35] Need for theory-based methods to test theory-based questions - Reply
    Mathias, JL
    Nettelbeck, T
    Willson, RJ
    RESEARCH IN DEVELOPMENTAL DISABILITIES, 1996, 17 (02) : 153 - 160
  • [36] Reinforcement Learning Based Neural Architecture Search for Flaw Detection in Intelligent Ultrasonic Imaging NDE System
    Zhang, Xin
    Saniie, Jafar
    2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS), 2022,
  • [37] A reinforcement learning based neural network architecture for obstacle avoidance in multi-fingered grasp synthesis
    Rezzoug, Nasser
    Gorce, Philippe
    NEUROCOMPUTING, 2009, 72 (4-6) : 1229 - 1241
  • [38] Reinforcement learning-based architecture search for quantum machine learning
    Rapp, Frederic
    Kreplin, David A.
    Huber, Marco F.
    Roth, Marco
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2025, 6 (01):
  • [39] NINTER - Networked Interaction: Theory-Based Cases in Teaching and Learning
    Saarenkunnas M.
    Järvelä S.
    Kuure L.
    Kunelius E.
    Häkkinen P.
    Taalas P.
    Learning Environments Research, 2000, 3 (1) : 35 - 50
  • [40] Learning and motivation in chemistry education: A theory-based integrative model
    Zimmerman, James A.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2005, 230 : U853 - U854