Advanced Reinforcement Learning and Its Connections with Brain Neuroscience

被引:11
作者
Fan, Chaoqiong [1 ]
Yao, Li [1 ]
Zhang, Jiacai [1 ]
Zhen, Zonglei [2 ]
Wu, Xia [1 ]
机构
[1] Beijing Normal Univ, Sch Artificial Intelligence, Beijing, Peoples R China
[2] Beijing Normal Univ, Fac Psychol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
PREFRONTAL CORTEX; MEMORY; MODEL; PREDICTION; STIGMERGY; ATTENTION; DORSAL; CHOICE; GO;
D O I
10.34133/research.0064
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years, brain science and neuroscience have greatly propelled the innovation of computer science. In particular, knowledge from the neurobiology and neuropsychology of the brain revolutionized the development of reinforcement learning (RL) by providing novel interpretable mechanisms of how the brain achieves intelligent and efficient decision making. Triggered by this, there has been a boom in research about advanced RL algorithms that are built upon the inspirations of brain neuroscience. In this work, to further strengthen the bidirectional link between the 2 communities and especially promote the research on modern RL technology, we provide a comprehensive survey of recent advances in the area of brain-inspired/related RL algorithms. We start with basis theories of RL, and present a concise introduction to brain neuroscience related to RL. Then, we classify these advanced RL methodologies into 3 categories according to different connections of the brain, i.e., micro-neural activity, macro-brain structure, and cognitive function. Each category is further surveyed by presenting several modern RL algorithms along with their mathematical models, correlations with the brain, and open issues. Finally, we introduce several important applications of RL algorithms, followed by the discussions of challenges and opportunities for future research.
引用
收藏
页数:17
相关论文
共 112 条
  • [1] [Anonymous], 2011, Computational Neuroscience for Advancing Artificial Intelligence, DOI DOI 10.4018/978-1-60960-021-1.CH006
  • [2] [Anonymous], 2002, ICML
  • [3] Gliotransmitters Travel in Time and Space
    Araque, Alfonso
    Carmignoto, Giorgio
    Haydon, Philip G.
    Oliet, Stephane H. R.
    Robitaille, Richard
    Volterra, Andrea
    [J]. NEURON, 2014, 81 (04) : 728 - 739
  • [4] Deep Reinforcement Learning A brief survey
    Arulkumaran, Kai
    Deisenroth, Marc Peter
    Brundage, Miles
    Bharath, Anil Anthony
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
  • [5] Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
  • [6] Bahdanau D, 2017, P 5 INT C LEARNING R, P24
  • [7] Entorhinal and ventromedial prefrontal cortices abstract and generalize the structure of reinforcement learning problems
    Baram, Alon Boaz
    Muller, Timothy Howard
    Nili, Hamed
    Garvert, Mona Maria
    Behrens, Timothy Edward John
    [J]. NEURON, 2021, 109 (04) : 713 - +
  • [8] Barreto A., 2018, PR MACH LEARN RES, P501
  • [9] Barto AG, 2003, DISCRETE EVENT DYN S, V13, P41, DOI [10.1023/A:1022140919877, 10.1023/A:1025696116075]
  • [10] Bellemare MG, INT C MACH LEARN 201