RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control

被引:0
|
作者
Xiang, Yanfei [1 ]
Wang, Xin [2 ,3 ]
Hu, Shu [4 ]
Zhu, Bin [5 ]
Huang, Xiaomeng [1 ]
Wu, Xi [6 ]
Lyu, Siwei [3 ]
机构
[1] Tsinghua Univ, Inst Global Change Studies, Dept Earth Syst Sci, Minist Educ Key Lab Earth Syst Modeling, Beijing 100084, Peoples R China
[2] SUNY Albany, Albany, NY 12222 USA
[3] SUNY Buffalo, Buffalo, NY 14222 USA
[4] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[5] Microsoft Res Asia, Beijing, Peoples R China
[6] Chengdu Univ Informat Technol, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/IROS55552.2023.10342479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning is used to tackle complex tasks with high-dimensional sensory inputs. Over the past decade, a wide range of reinforcement learning algorithms have been developed, with recent progress benefiting from deep learning for raw sensory signal representation. This raises a natural question: how well do these algorithms perform across different robotic manipulation tasks? To objectively compare algorithms, benchmarks use performance metrics. Benchmarks use objective performance metrics to offer a scientific way to compare algorithms. In this paper, we introduce RMBench, the first benchmark for robotic manipulations with high-dimensional continuous action and state spaces. We implement and evaluate reinforcement learning algorithms that take observed pixels as inputs and report their average performance and learning curves to demonstrate their performance and training stability. Our study concludes that none of the evaluated algorithms can handle all tasks well, with soft Actor-Critic outperforming most algorithms in terms of average reward and stability, and an algorithm combined with data augmentation potentially facilitating learning policies. Our code is publicly available at https://github.com/xiangyanfei212/RMBench- 2022. git, including all benchmark tasks and studied algorithms.
引用
收藏
页码:1207 / 1214
页数:8
相关论文
共 50 条
  • [1] Decentralized reinforcement learning control of a robotic manipulator
    Busoniu, Lucian
    De Schutter, Bart
    Babuska, Robert
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1121 - +
  • [2] Reinforcement Learning Control for a Robotic Manipulator with Unknown Deadzone
    Li, Yanan
    Xiao, Shengtao
    Ge, Shuzhi Sam
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 593 - 598
  • [3] A Reinforcement Learning Neural Network for Robotic Manipulator Control
    Hu, Yazhou
    Si, Bailu
    NEURAL COMPUTATION, 2018, 30 (07) : 1983 - 2004
  • [4] Trajectory Tracking Control Based on Deep Reinforcement Learning for a Robotic Manipulator with an Input Deadzone
    Wang, Fujie
    Hu, Jintao
    Qin, Yi
    Guo, Fang
    Jiang, Ming
    SYMMETRY-BASEL, 2025, 17 (02):
  • [5] Benchmarking Deep Reinforcement Learning for Continuous Control
    Duan, Yan
    Chen, Xi
    Houthooft, Rein
    Schulman, John
    Abbeel, Pieter
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [6] Manipulator Control Method Based on Deep Reinforcement Learning
    Zeng, Rui
    Liu, Manlu
    Zhang, Junjun
    Li, Xinmao
    Zhou, Qijie
    Jiang, Yuanchen
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 415 - 420
  • [7] Intelligent Control of Manipulator Based on Deep Reinforcement Learning
    Zhou, Jiangtao
    Zheng, Hua
    Zhao, Dongzhu
    Chen, Yingxue
    2021 12TH INTERNATIONAL CONFERENCE ON MECHANICAL AND AEROSPACE ENGINEERING (ICMAE), 2021, : 275 - 279
  • [8] Manipulator Control using Federated Deep Reinforcement Learning
    Shivkumar, S.
    Kumaar, A. A. Nippun
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [9] Predictive Control of a Robot Manipulator with Deep Reinforcement Learning
    Bejar, Eduardo
    Moran, Antonio
    2021 7TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2021, : 127 - 130
  • [10] A proximal policy optimization based deep reinforcement learning framework for tracking control of a flexible robotic manipulator
    Kumar, V. Joshi
    Elumalai, Vinodh Kumar
    RESULTS IN ENGINEERING, 2025, 25