A Survey on Deep Reinforcement Learning Algorithms for Robotic Manipulation

被引：82

作者：

Han, Dong ^{[1
]}

Mulyana, Beni ^{[1
]}

Stankovic, Vladimir ^{[2
]}

Cheng, Samuel ^{[1
]}

机构：

[1] Univ Oklahoma, Sch Elect & Comp Engn, Norman, OK 73019 USA

[2] Univ Straclyde, Dept Elect & Elect Engn, Glasglow G1 1XW, Scotland

来源：

SENSORS | 2023年 / 23卷 / 07期

关键词：

reinforcement learning; robotic manipulation; graph neural network; NEURAL-NETWORKS;

D O I：

10.3390/s23073762

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Robotic manipulation challenges, such as grasping and object manipulation, have been tackled successfully with the help of deep reinforcement learning systems. We give an overview of the recent advances in deep reinforcement learning algorithms for robotic manipulation tasks in this review. We begin by outlining the fundamental ideas of reinforcement learning and the parts of a reinforcement learning system. The many deep reinforcement learning algorithms, such as value-based methods, policy-based methods, and actor-critic approaches, that have been suggested for robotic manipulation tasks are then covered. We also examine the numerous issues that have arisen when applying these algorithms to robotics tasks, as well as the various solutions that have been put forth to deal with these issues. Finally, we highlight several unsolved research issues and talk about possible future directions for the subject.

引用

页数：35

共 157 条

[1]

Rusu AA, 2016, Arxiv, DOI [arXiv:1606.04671, DOI 10.43550/ARXIV:1606.04671, DOI 10.48550/ARXIV.1606.04671]

[2]

Almasan J., 2020, Deep Reinforcement Learning meets Graph Neural Networks: exploring a routing optimization use case

[3] Evaluation of Variable Impedance- and Hybrid Force/Motion-Controllers for Learning Force Tracking Skills [J].

Anand, Akhil S. ;

Myrestrand, Martin Hagen ;

Gravdahl, Jan Tommy .

2022 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII 2022), 2022, :83-89

[4]

Andrychowicz M., 2017, arXiv

[5] Learning dexterous in-hand manipulation [J].

Andrychowicz, Marcin ;

Baker, Bowen ;

Chociej, Maciek ;

Jozefowicz, Rafal ;

McGrew, Bob ;

Pachocki, Jakub ;

Petron, Arthur ;

Plappert, Matthias ;

Powell, Glenn ;

Ray, Alex ;

Schneider, Jonas ;

Sidor, Szymon ;

Tobin, Josh ;

Welinder, Peter ;

Weng, Lilian ;

Zaremba, Wojciech .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) :3-20

[6]

Baram N, 2017, 34 INT C MACHINE LEA, V70

[7] Robotic architectural assembly with tactile skills: Simulation and optimization [J].

Belousov, Boris ;

Wibranek, Bastian ;

Schneider, Jan ;

Schneider, Tim ;

Chalvatzaki, Georgia ;

Peters, Jan ;

Tessmann, Oliver .

AUTOMATION IN CONSTRUCTION, 2022, 133

[8] Context meta-reinforcement learning via neuromodulation [J].

Ben-Iwhiwhu, Eseoghene ;

Dick, Jeffery ;

Ketz, Nicholas A. ;

Pilly, Praveen K. ;

Soltoggio, Andrea .

NEURAL NETWORKS, 2022, 152 :70-79

[9]

Bengio Y., 2009, P 26 ANN INT C MACH, P41, DOI DOI 10.1145/1553374.1553380

[10] Prioritized Hindsight with Dual Buffer for Meta-Reinforcement Learning [J].

Beyene, Sofanit Wubeshet ;

Han, Ji-Hyeong .

ELECTRONICS, 2022, 11 (24)

← 1 2 3 4 5 6 7 8 9 10 →