Comparison of Model-Based and Model-Free Reinforcement Learning for Real-World Dexterous Robotic Manipulation Tasks

被引:3
|
作者
Valencia, David [1 ]
Jia, John [1 ]
Li, Raymond [1 ]
Hayashi, Alex [2 ]
Lecchi, Megan [2 ]
Terezakis, Reuel [2 ]
Gee, Trevor [1 ]
Liarokapis, Minas [2 ]
MacDonald, Bruce A. [1 ]
Williams, Henry [1 ]
机构
[1] Univ Auckland, Ctr Automat & Robot Engn Sci, Auckland, New Zealand
[2] Univ Auckland, New Dexter Res Grp, Auckland, New Zealand
关键词
D O I
10.1109/ICRA48891.2023.10160983
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Model Free Reinforcement Learning (MFRL) has shown significant promise for learning dexterous robotic manipulation tasks, at least in simulation. However, the high number of samples, as well as the long training times, prevent MFRL from scaling to complex real-world tasks. Model-Based Reinforcement Learning (MBRL) emerges as a potential solution that, in theory, can improve the data efficiency of MFRL approaches. This could drastically reduce the training time of MFRL, and increase the application of RL for real-world robotic tasks. This article presents a study on the feasibility of using the state-of-the-art MBRL to improve the training time for two real-world dexterous manipulation tasks. The evaluation is conducted on a real low-cost robot gripper where the predictive model and the control policy are learned from scratch. The results indicate that MBRL is capable of learning accurate models of the world, but does not show clear improvements in learning the control policy in the real world as prior literature suggests should be expected.
引用
收藏
页码:871 / 878
页数:8
相关论文
共 50 条
  • [31] Programming and learning in real-world manipulation tasks
    Cervera, E
    del Pobil, AP
    IROS '97 - PROCEEDINGS OF THE 1997 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOT AND SYSTEMS: INNOVATIVE ROBOTICS FOR REAL-WORLD APPLICATIONS, VOLS 1-3, 1996, : 471 - 472
  • [32] Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems
    Gao, Cheng
    Wang, Dan
    JOURNAL OF BUILDING ENGINEERING, 2023, 74
  • [33] Combining Model-Based and Model-Free Reinforcement Learning Policies for More Efficient Sepsis Treatment
    Liu, Xiangyu
    Yu, Chao
    Huang, Qikai
    Wang, Luhao
    Wu, Jianfeng
    Guan, Xiangdong
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 105 - 117
  • [34] Ventral Striatum and Orbitofrontal Cortex Are Both Required for Model-Based, But Not Model-Free, Reinforcement Learning
    McDannald, Michael A.
    Lucantonio, Federica
    Burke, Kathryn A.
    Niv, Yael
    Schoenbaum, Geoffrey
    JOURNAL OF NEUROSCIENCE, 2011, 31 (07): : 2700 - 2705
  • [35] Model-free Reinforcement Learning for Spatiotemporal Tasks using Symbolic Automata
    Balakrishnan, Anand
    Jaksic, Stefan
    Aguilar, Edgar A.
    Nickovic, Dejan
    Deshmukh, Jyotirmoy, V
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6834 - 6840
  • [36] Model-based learning retrospectively updates model-free values
    Max Doody
    Maaike M. H. Van Swieten
    Sanjay G. Manohar
    Scientific Reports, 12
  • [37] Model-Based and Model-Free Mechanisms of Human Motor Learning
    Haith, Adrian M.
    Krakauer, John W.
    PROGRESS IN MOTOR CONTROL: NEURAL, COMPUTATIONAL AND DYNAMIC APPROACHES, 2013, 782 : 1 - 21
  • [38] Model-based autonomous system for performing dexterous, human-level manipulation tasks
    Nicolas Hudson
    Jeremy Ma
    Paul Hebert
    Abhinandan Jain
    Max Bajracharya
    Thomas Allen
    Rangoli Sharan
    Matanya Horowitz
    Calvin Kuo
    Thomas Howard
    Larry Matthies
    Paul Backes
    Joel Burdick
    Autonomous Robots, 2014, 36 : 31 - 49
  • [39] Reinforcement Learning-Based Model-Free Controller for Feedback Stabilization of Robotic Systems
    Singh, Rupam
    Bhushan, Bharat
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7059 - 7073
  • [40] Model-based autonomous system for performing dexterous, human-level manipulation tasks
    Hudson, Nicolas
    Ma, Jeremy
    Hebert, Paul
    Jain, Abhinandan
    Bajracharya, Max
    Allen, Thomas
    Sharan, Rangoli
    Horowitz, Matanya
    Kuo, Calvin
    Howard, Thomas
    Matthies, Larry
    Backes, Paul
    Burdick, Joel
    AUTONOMOUS ROBOTS, 2014, 36 (1-2) : 31 - 49