Advanced Reinforcement Learning and Its Connections with Brain Neuroscience

被引：11

作者：

Fan, Chaoqiong ^{[1
]}

Yao, Li ^{[1
]}

Zhang, Jiacai ^{[1
]}

Zhen, Zonglei ^{[2
]}

Wu, Xia ^{[1
]}

机构：

[1] Beijing Normal Univ, Sch Artificial Intelligence, Beijing, Peoples R China

[2] Beijing Normal Univ, Fac Psychol, Beijing, Peoples R China

来源：

RESEARCH | 2023年 / 6卷

基金：

中国国家自然科学基金;

关键词：

PREFRONTAL CORTEX; MEMORY; MODEL; PREDICTION; STIGMERGY; ATTENTION; DORSAL; CHOICE; GO;

D O I：

10.34133/research.0064

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

In recent years, brain science and neuroscience have greatly propelled the innovation of computer science. In particular, knowledge from the neurobiology and neuropsychology of the brain revolutionized the development of reinforcement learning (RL) by providing novel interpretable mechanisms of how the brain achieves intelligent and efficient decision making. Triggered by this, there has been a boom in research about advanced RL algorithms that are built upon the inspirations of brain neuroscience. In this work, to further strengthen the bidirectional link between the 2 communities and especially promote the research on modern RL technology, we provide a comprehensive survey of recent advances in the area of brain-inspired/related RL algorithms. We start with basis theories of RL, and present a concise introduction to brain neuroscience related to RL. Then, we classify these advanced RL methodologies into 3 categories according to different connections of the brain, i.e., micro-neural activity, macro-brain structure, and cognitive function. Each category is further surveyed by presenting several modern RL algorithms along with their mathematical models, correlations with the brain, and open issues. Finally, we introduce several important applications of RL algorithms, followed by the discussions of challenges and opportunities for future research.

引用

页数：17

共 112 条

[1] [Anonymous], 2011, Computational Neuroscience for Advancing Artificial Intelligence, DOI DOI 10.4018/978-1-60960-021-1.CH006
[2] [Anonymous], 2002, ICML
[3] Gliotransmitters Travel in Time and Space
Araque, Alfonso
Carmignoto, Giorgio
Haydon, Philip G.
Oliet, Stephane H. R.
Robitaille, Richard
Volterra, Andrea
[J]. NEURON, 2014, 81 (04) : 728 - 739
[4] Deep Reinforcement Learning A brief survey
Arulkumaran, Kai
Deisenroth, Marc Peter
Brundage, Miles
Bharath, Anil Anthony
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
[5] Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
[6] Bahdanau D, 2017, P 5 INT C LEARNING R, P24
[7] Entorhinal and ventromedial prefrontal cortices abstract and generalize the structure of reinforcement learning problems
Baram, Alon Boaz
Muller, Timothy Howard
Nili, Hamed
Garvert, Mona Maria
Behrens, Timothy Edward John
[J]. NEURON, 2021, 109 (04) : 713 - +
[8] Barreto A., 2018, PR MACH LEARN RES, P501
[9] Barto AG, 2003, DISCRETE EVENT DYN S, V13, P41, DOI [10.1023/A:1022140919877, 10.1023/A:1025696116075]
[10] Bellemare MG, INT C MACH LEARN 201

← 1 2 3 4 5 6 7 8 9 10 →