RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control

被引：0

作者：

Xiang, Yanfei ^{[1
]}

Wang, Xin ^{[2
,3
]}

Hu, Shu ^{[4
]}

Zhu, Bin ^{[5
]}

Huang, Xiaomeng ^{[1
]}

Wu, Xi ^{[6
]}

Lyu, Siwei ^{[3
]}

机构：

[1] Tsinghua Univ, Inst Global Change Studies, Dept Earth Syst Sci, Minist Educ Key Lab Earth Syst Modeling, Beijing 100084, Peoples R China

[2] SUNY Albany, Albany, NY 12222 USA

[3] SUNY Buffalo, Buffalo, NY 14222 USA

[4] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[5] Microsoft Res Asia, Beijing, Peoples R China

[6] Chengdu Univ Informat Technol, Chengdu, Peoples R China

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS | 2023年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/IROS55552.2023.10342479

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning is used to tackle complex tasks with high-dimensional sensory inputs. Over the past decade, a wide range of reinforcement learning algorithms have been developed, with recent progress benefiting from deep learning for raw sensory signal representation. This raises a natural question: how well do these algorithms perform across different robotic manipulation tasks? To objectively compare algorithms, benchmarks use performance metrics. Benchmarks use objective performance metrics to offer a scientific way to compare algorithms. In this paper, we introduce RMBench, the first benchmark for robotic manipulations with high-dimensional continuous action and state spaces. We implement and evaluate reinforcement learning algorithms that take observed pixels as inputs and report their average performance and learning curves to demonstrate their performance and training stability. Our study concludes that none of the evaluated algorithms can handle all tasks well, with soft Actor-Critic outperforming most algorithms in terms of average reward and stability, and an algorithm combined with data augmentation potentially facilitating learning policies. Our code is publicly available at https://github.com/xiangyanfei212/RMBench- 2022. git, including all benchmark tasks and studied algorithms.

引用

页码：1207 / 1214

页数：8

共 50 条

[1] Decentralized reinforcement learning control of a robotic manipulator
Busoniu, Lucian
De Schutter, Bart
Babuska, Robert
2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1121 - +
[2] Reinforcement Learning Control for a Robotic Manipulator with Unknown Deadzone
Li, Yanan
Xiao, Shengtao
Ge, Shuzhi Sam
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 593 - 598
[3] A Reinforcement Learning Neural Network for Robotic Manipulator Control
Hu, Yazhou
Si, Bailu
NEURAL COMPUTATION, 2018, 30 (07) : 1983 - 2004
[4] Trajectory Tracking Control Based on Deep Reinforcement Learning for a Robotic Manipulator with an Input Deadzone
Wang, Fujie
Hu, Jintao
Qin, Yi
Guo, Fang
Jiang, Ming
SYMMETRY-BASEL, 2025, 17 (02):
[5] Benchmarking Deep Reinforcement Learning for Continuous Control
Duan, Yan
Chen, Xi
Houthooft, Rein
Schulman, John
Abbeel, Pieter
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
[6] Manipulator Control Method Based on Deep Reinforcement Learning
Zeng, Rui
Liu, Manlu
Zhang, Junjun
Li, Xinmao
Zhou, Qijie
Jiang, Yuanchen
PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 415 - 420
[7] Intelligent Control of Manipulator Based on Deep Reinforcement Learning
Zhou, Jiangtao
Zheng, Hua
Zhao, Dongzhu
Chen, Yingxue
2021 12TH INTERNATIONAL CONFERENCE ON MECHANICAL AND AEROSPACE ENGINEERING (ICMAE), 2021, : 275 - 279
[8] Manipulator Control using Federated Deep Reinforcement Learning
Shivkumar, S.
Kumaar, A. A. Nippun
10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
[9] Predictive Control of a Robot Manipulator with Deep Reinforcement Learning
Bejar, Eduardo
Moran, Antonio
2021 7TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2021, : 127 - 130
[10] A proximal policy optimization based deep reinforcement learning framework for tracking control of a flexible robotic manipulator
Kumar, V. Joshi
Elumalai, Vinodh Kumar
RESULTS IN ENGINEERING, 2025, 25

← 1 2 3 4 5 →