Reinforcement Learning in Multidimensional Environments Relies on Attention Mechanisms

被引：234

作者：

Niv, Yael ^{[1
,2
]}

Daniel, Reka ^{[1
,2
]}

Geana, Andra ^{[1
,2
]}

Gershman, Samuel J. ^{[3
]}

Leong, Yuan Chang ^{[4
]}

Radulescu, Angela ^{[1
,2
]}

Wilson, Robert C. ^{[5
,6
]}

机构：

[1] Princeton Univ, Dept Psychol, Princeton, NJ 08540 USA

[2] Princeton Univ, Inst Neurosci, Princeton, NJ 08540 USA

[3] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA

[4] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA

[5] Univ Arizona, Dept Psychol, Tucson, AZ 85721 USA

[6] Univ Arizona, Cognit Sci Program, Tucson, AZ 85721 USA

来源：

JOURNAL OF NEUROSCIENCE | 2015年 / 35卷 / 21期

关键词：

attention; fMRI; frontoparietal network; model comparison; reinforcement learning; representation learning; PREFRONTAL CORTEX; PREDICTION ERRORS; SELECTIVE ATTENTION; COGNITIVE FUNCTIONS; PARKINSONS-DISEASE; NEURAL MECHANISMS; FRONTAL-CORTEX; MODELS; TASK; CATEGORIZATION;

D O I：

10.1523/JNEUROSCI.2978-14.2015

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

In recent years, ideas from the computational field of reinforcement learning have revolutionized the study of learning in the brain, famously providing new, precise theories of how dopamine affects learning in the basal ganglia. However, reinforcement learning algorithms are notorious for not scaling well to multidimensional environments, as is required for real-world learning. We hypothesized that the brain naturally reduces the dimensionality of real-world problems to only those dimensions that are relevant to predicting reward, and conducted an experiment to assess by what algorithms and with what neural mechanisms this "representation learning" process is realized in humans. Our results suggest that a bilateral attentional control network comprising the intraparietal sulcus, precuneus, and dorsolateral prefrontal cortex is involved in selecting what dimensions are relevant to the task at hand, effectively updating the task representation through trial and error. In this way, cortical attention mechanisms interact with learning in the basal ganglia to solve the "curse of dimensionality" in reinforcement learning.

引用

页码：8145 / 8157

页数：13

共 50 条

[21] Dynamic Attention Network for Multi-UAV Reinforcement Learning
Xu, Dongsheng
Wu, Shang
[J]. INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021), 2021, 12156
[22] Reinforcement Learning with Attention that Works: A Self-Supervised Approach
Manchin, Anthony
Abbasnejad, Ehsan
van den Hengel, Anton
[J]. NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 223 - 230
[23] Role-based attention in deep reinforcement learning for games
Yang, Dong
Yang, Wenjing
Li, Minglong
Yang, Qiong
[J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2021, 32 (02)
[24] Centralized reinforcement learning for multi-agent cooperative environments
Chengxuan Lu
Qihao Bao
Shaojie Xia
Chongxiao Qu
[J]. Evolutionary Intelligence, 2024, 17 : 267 - 273
[25] Centralized reinforcement learning for multi-agent cooperative environments
Lu, Chengxuan
Bao, Qihao
Xia, Shaojie
Qu, Chongxiao
[J]. EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 267 - 273
[26] Neural mechanisms of reinforcement learning under mortality threat
Gao, Tianyu
Zhou, Yuqing
Li, Wenxin
Pfabigan, Daniela M.
Han, Shihui
[J]. SOCIAL NEUROSCIENCE, 2020, 15 (02) : 170 - 185
[27] Enhancing reinforcement learning for de novo molecular design applying self-attention mechanisms
Pereira, Tiago O.
Abbasi, Maryam
Arrais, Joel P.
[J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
[28] Reinforcement Learning Applied to PSO for Multidimensional Knapsack Problem
Olivares, Rodrigo
Rios, Victor
Olivares, Pablo
Serrano, Benjamin
[J]. MACHINE LEARNING METHODS IN SYSTEMS, VOL 4, CSOC 2024, 2024, 1126 : 375 - 382
[29] Mechanisms of value-learning in the guidance of spatial attention
Anderson, Brian A.
Kim, Haena
[J]. COGNITION, 2018, 178 : 26 - 36
[30] Preparatory Attention Relies on Dynamic Interactions between Prelimbic Cortex and Anterior Cingulate Cortex
Totah, Nelson K. B.
Jackson, Mark E.
Moghaddam, Bita
[J]. CEREBRAL CORTEX, 2013, 23 (03) : 729 - 738

← 1 2 3 4 5 →