The neural architecture of theory-based reinforcement learning

被引：7

作者：

Tomov, Momchil S. ^{[1
,2
,4
,5
]}

Tsividis, Pedro A. ^{[3
,4
]}

Pouncy, Thomas ^{[1
,2
]}

Tenenbaum, Joshua B. ^{[3
,4
]}

Gershman, Samuel J. ^{[1
,2
,4
]}

机构：

[1] Harvard Univ, Dept Psychol, Cambridge, MA 02138 USA

[2] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA

[3] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA

[4] MIT, Ctr Brains Minds & Machines, Cambridge, MA 02139 USA

[5] Mot AD Inc, Boston, MA 02210 USA

来源：

NEURON | 2023年 / 111卷 / 08期

关键词：

ORBITOFRONTAL CORTEX; CAUSAL INFERENCE; COGNITIVE MAPS; MODEL; PREDICTION; BRAIN; GO; REPRESENTATIONS; KNOWLEDGE; HUMANS;

D O I：

10.1016/j.neuron.2023.01.023

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

Humans learn internal models of the world that support planning and generalization in complex environ-ments. Yet it remains unclear how such internal models are represented and learned in the brain. We approach this question using theory-based reinforcement learning, a strong form of model-based reinforce-ment learning in which the model is a kind of intuitive theory. We analyzed fMRI data from human participants learning to play Atari-style games. We found evidence of theory representations in prefrontal cortex and of theory updating in prefrontal cortex, occipital cortex, and fusiform gyrus. Theory updates coincided with transient strengthening of theory representations. Effective connectivity during theory updating suggests that information flows from prefrontal theory-coding regions to posterior theory-updating regions. Together, our results are consistent with a neural architecture in which top-down theory representations originating in prefrontal regions shape sensory predictions in visual areas, where factored theory prediction errors are computed and trigger bottom-up updates of the theory.

引用

页码：1331 / +

页数：23

共 50 条

[31] Community theory-based learning framework for Higher education
Li, Qian
Yan, Fei
LEARNING AND MOTIVATION, 2023, 84
[32] A novel neural network based reinforcement learning
Fan, Jian
Song, Yang
Fei, MinRui
Zhao, Qijie
BIO-INSPIRED COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2007, 4688 : 46 - +
[33] Optimization Theory-Based Deep Reinforcement Learning for Resource Allocation in Ultra-Reliable Wireless Networked Control Systems
Ali, Hamida Qumber
Darabi, Amirhassan Babazadeh
Coleri, Sinem
IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (08) : 4774 - 4787
[34] A generic architecture for adaptive agents based on reinforcement learning
Preux, P
Delepoulle, S
Darcheville, JC
INFORMATION SCIENCES, 2004, 161 (1-2) : 37 - 55
[35] Need for theory-based methods to test theory-based questions - Reply
Mathias, JL
Nettelbeck, T
Willson, RJ
RESEARCH IN DEVELOPMENTAL DISABILITIES, 1996, 17 (02) : 153 - 160
[36] Reinforcement Learning Based Neural Architecture Search for Flaw Detection in Intelligent Ultrasonic Imaging NDE System
Zhang, Xin
Saniie, Jafar
2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS), 2022,
[37] A reinforcement learning based neural network architecture for obstacle avoidance in multi-fingered grasp synthesis
Rezzoug, Nasser
Gorce, Philippe
NEUROCOMPUTING, 2009, 72 (4-6) : 1229 - 1241
[38] Reinforcement learning-based architecture search for quantum machine learning
Rapp, Frederic
Kreplin, David A.
Huber, Marco F.
Roth, Marco
MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2025, 6 (01):
[39] NINTER - Networked Interaction: Theory-Based Cases in Teaching and Learning
Saarenkunnas M.
Järvelä S.
Kuure L.
Kunelius E.
Häkkinen P.
Taalas P.
Learning Environments Research, 2000, 3 (1) : 35 - 50
[40] Learning and motivation in chemistry education: A theory-based integrative model
Zimmerman, James A.
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2005, 230 : U853 - U854

← 1 2 3 4 5 →