Reinforcement learning algorithms: A brief survey

被引：171

作者：

Shakya, Ashish Kumar ^{[1
]}

Pillai, Gopinatha ^{[1
]}

Chakrabarty, Sohom ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Elect Engn, Roorkee 247667, Uttaranchal, India

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2023年 / 231卷

关键词：

Reinforcement learning; Stochastic optimal control; Function approximation; Deep Reinforcement Learning (DRL); PARTICLE SWARM OPTIMIZATION; GRAPH NEURAL-NETWORK; DIALOGUE MANAGEMENT; ROBOT NAVIGATION; LEVEL; GAME; GO; ENVIRONMENT; MODEL; FACILITIES;

D O I：

10.1016/j.eswa.2023.120495

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement Learning (RL) is a machine learning (ML) technique to learn sequential decision-making in complex problems. RL is inspired by trial-and-error based human/animal learning. It can learn an optimal policy autonomously with knowledge obtained by continuous interaction with a stochastic dynamical environment. Problems considered virtually impossible to solve, such as learning to play video games just from pixel information, are now successfully solved using deep reinforcement learning. Without human intervention, RL agents can surpass human performance in challenging tasks. This review gives a broad overview of RL, covering its fundamental principles, essential methods, and illustrative applications. The authors aim to develop an initial reference point for researchers commencing their research work in RL. In this review, the authors cover some fundamental model-free RL algorithms and pathbreaking function approximation-based deep RL (DRL) algorithms for complex uncertain tasks with continuous action and state spaces, making RL useful in various interdisciplinary fields. This article also provides a brief review of model-based and multi-agent RL approaches. Finally, some promising research directions for RL are briefly presented.

引用

页数：32

共 365 条

[1] From inverse optimal control to inverse reinforcement learning: A historical review [J].

Ab Azar, Nematollah ;

Shahmansoorian, Aref ;

Davoudi, Mohsen .

ANNUAL REVIEWS IN CONTROL, 2020, 50 :119-138

[2]

Abdoos M, 2011, IEEE INT C INTELL TR, P1580, DOI 10.1109/ITSC.2011.6083114

[3]

Achiam J, 2017, PR MACH LEARN RES, V70

[4] Cyber-security and reinforcement learning - A brief survey [J].

Adawadkar, Amrin Maria Khan ;

Kulkarni, Nilima .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114

[5] Reinforcement Learning based Recommender Systems: A Survey [J].

Afsar, M. Mehdi ;

Crump, Trafford ;

Far, Behrouz .

ACM COMPUTING SURVEYS, 2023, 55 (07)

[6]

Agrawal S., 2017, Advances in Neural Information Processing Systems, V30, P1184

[7]

Akkaya I, 2019, Arxiv, DOI arXiv:1910.07113

[8] Reinforcement Learning Interpretation Methods: A Survey [J].

Alharin, Alnour ;

Doan, Thanh-Nam ;

Sartipi, Mina .

IEEE ACCESS, 2020, 8 :171058-171077

[9]

Amini A., 2020, COURSE, V6, pS191

[10] Learning Robust Control Policies for End-to-End Autonomous Driving From Data-Driven Simulation [J].

Amini, Alexander ;

Gilitschenski, Igor ;

Phillips, Jacob ;

Moseyko, Julia ;

Banerjee, Rohan ;

Karaman, Sertac ;

Rus, Daniela .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :1143-1150

← 1 2 3 4 5 6 7 8 9 10 →