Reinforcement Learning in Multidimensional Environments Relies on Attention Mechanisms

被引:234
作者
Niv, Yael [1 ,2 ]
Daniel, Reka [1 ,2 ]
Geana, Andra [1 ,2 ]
Gershman, Samuel J. [3 ]
Leong, Yuan Chang [4 ]
Radulescu, Angela [1 ,2 ]
Wilson, Robert C. [5 ,6 ]
机构
[1] Princeton Univ, Dept Psychol, Princeton, NJ 08540 USA
[2] Princeton Univ, Inst Neurosci, Princeton, NJ 08540 USA
[3] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
[4] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[5] Univ Arizona, Dept Psychol, Tucson, AZ 85721 USA
[6] Univ Arizona, Cognit Sci Program, Tucson, AZ 85721 USA
关键词
attention; fMRI; frontoparietal network; model comparison; reinforcement learning; representation learning; PREFRONTAL CORTEX; PREDICTION ERRORS; SELECTIVE ATTENTION; COGNITIVE FUNCTIONS; PARKINSONS-DISEASE; NEURAL MECHANISMS; FRONTAL-CORTEX; MODELS; TASK; CATEGORIZATION;
D O I
10.1523/JNEUROSCI.2978-14.2015
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
In recent years, ideas from the computational field of reinforcement learning have revolutionized the study of learning in the brain, famously providing new, precise theories of how dopamine affects learning in the basal ganglia. However, reinforcement learning algorithms are notorious for not scaling well to multidimensional environments, as is required for real-world learning. We hypothesized that the brain naturally reduces the dimensionality of real-world problems to only those dimensions that are relevant to predicting reward, and conducted an experiment to assess by what algorithms and with what neural mechanisms this "representation learning" process is realized in humans. Our results suggest that a bilateral attentional control network comprising the intraparietal sulcus, precuneus, and dorsolateral prefrontal cortex is involved in selecting what dimensions are relevant to the task at hand, effectively updating the task representation through trial and error. In this way, cortical attention mechanisms interact with learning in the basal ganglia to solve the "curse of dimensionality" in reinforcement learning.
引用
收藏
页码:8145 / 8157
页数:13
相关论文
共 50 条
  • [21] Dynamic Attention Network for Multi-UAV Reinforcement Learning
    Xu, Dongsheng
    Wu, Shang
    [J]. INTERNATIONAL CONFERENCE ON ALGORITHMS, HIGH PERFORMANCE COMPUTING, AND ARTIFICIAL INTELLIGENCE (AHPCAI 2021), 2021, 12156
  • [22] Reinforcement Learning with Attention that Works: A Self-Supervised Approach
    Manchin, Anthony
    Abbasnejad, Ehsan
    van den Hengel, Anton
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 223 - 230
  • [23] Role-based attention in deep reinforcement learning for games
    Yang, Dong
    Yang, Wenjing
    Li, Minglong
    Yang, Qiong
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2021, 32 (02)
  • [24] Centralized reinforcement learning for multi-agent cooperative environments
    Chengxuan Lu
    Qihao Bao
    Shaojie Xia
    Chongxiao Qu
    [J]. Evolutionary Intelligence, 2024, 17 : 267 - 273
  • [25] Centralized reinforcement learning for multi-agent cooperative environments
    Lu, Chengxuan
    Bao, Qihao
    Xia, Shaojie
    Qu, Chongxiao
    [J]. EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 267 - 273
  • [26] Neural mechanisms of reinforcement learning under mortality threat
    Gao, Tianyu
    Zhou, Yuqing
    Li, Wenxin
    Pfabigan, Daniela M.
    Han, Shihui
    [J]. SOCIAL NEUROSCIENCE, 2020, 15 (02) : 170 - 185
  • [27] Enhancing reinforcement learning for de novo molecular design applying self-attention mechanisms
    Pereira, Tiago O.
    Abbasi, Maryam
    Arrais, Joel P.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [28] Reinforcement Learning Applied to PSO for Multidimensional Knapsack Problem
    Olivares, Rodrigo
    Rios, Victor
    Olivares, Pablo
    Serrano, Benjamin
    [J]. MACHINE LEARNING METHODS IN SYSTEMS, VOL 4, CSOC 2024, 2024, 1126 : 375 - 382
  • [29] Mechanisms of value-learning in the guidance of spatial attention
    Anderson, Brian A.
    Kim, Haena
    [J]. COGNITION, 2018, 178 : 26 - 36
  • [30] Preparatory Attention Relies on Dynamic Interactions between Prelimbic Cortex and Anterior Cingulate Cortex
    Totah, Nelson K. B.
    Jackson, Mark E.
    Moghaddam, Bita
    [J]. CEREBRAL CORTEX, 2013, 23 (03) : 729 - 738