Transfer learning with Partially Constrained Models: Application to reinforcement learning of linked multicomponent robot system control

被引:17
|
作者
Fernandez-Gauna, Borja [1 ]
Manuel Lopez-Guede, Jose [1 ]
Grana, Manuel [1 ]
机构
[1] Univ Basque Country, Computat Intelligence Grp, UPV EHU, Bilbao, Spain
关键词
Reinforcement learning; Linked multicomponent robotic systems; Transfer learning; Hose transportation; MDPS;
D O I
10.1016/j.robot.2012.07.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transfer learning is a hierarchical approach to reinforcement learning of complex tasks modeled as Markov Decision Processes. The learning results on the source task are used as the starting point for the learning on the target task. In this paper we deal with a hierarchy of constrained systems, where the source task is an under-constrained system, hence called the Partially Constrained Model (PCM). Constraints in the framework of reinforcement learning are dealt with by state-action veto policies. We propose a theoretical background for the hierarchy of training refinements, showing that the effective action repertoires learnt on the PCM are maximal, and that the PCM-optimal policy gives maximal state value functions. We apply the approach to learn the control of Linked Multicomponent Robotic Systems using Reinforcement Learning. The paradigmatic example is the transportation of a hose. The system has strong physical constraints and a large state space. Learning experiments in the target task are realized over an accurate but computationally expensive simulation of the hose dynamics. The PCM is obtained simplifying the hose model. Learning results of the PCM Transfer Learning show an spectacular improvement over conventional Q-learning on the target task. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:694 / 703
页数:10
相关论文
共 50 条
  • [1] Application of reinforcement learning to dexterous robot control
    Bucak, IO
    Zohdy, MA
    PROCEEDINGS OF THE 1998 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1998, : 1405 - 1409
  • [2] Reinforcement learning for robot control
    Smart, WD
    Kaelbling, LP
    MOBILE ROBOTS XVI, 2002, 4573 : 92 - 103
  • [3] Learning for a Robot: Deep Reinforcement Learning, Imitation Learning, Transfer Learning
    Hua, Jiang
    Zeng, Liangcai
    Li, Gongfa
    Ju, Zhaojie
    SENSORS, 2021, 21 (04) : 1 - 21
  • [4] Learning for a robot: Deep reinforcement learning, imitation learning, transfer learning
    Hua, Jiang
    Zeng, Liangcai
    Li, Gongfa
    Ju, Zhaojie
    Sensors (Switzerland), 2021, 21 (04): : 1 - 21
  • [5] Application of reinforcement learning in control system development
    Vichugov, VN
    Tsapko, GP
    Tsapko, SG
    Korus 2005, Proceedings, 2005, : 732 - 733
  • [6] POPCORN: Partially Observed Prediction Constrained Reinforcement Learning
    Futoma, Joseph
    Hughes, Michael C.
    Doshi-Velez, Finale
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 3578 - 3587
  • [7] Reinforcement learning and its application to force control of an industrial robot
    Song, KT
    Chu, TS
    CONTROL ENGINEERING PRACTICE, 1998, 6 (01) : 37 - 44
  • [8] APPLICATION OF REINFORCEMENT LEARNING TO A TWO DOF ROBOT ARM CONTROL
    Albers, Albert
    Yan Wenjie
    Frietsch, Markus
    ANNALS OF DAAAM FOR 2009 & PROCEEDINGS OF THE 20TH INTERNATIONAL DAAAM SYMPOSIUM, 2009, 20 : 415 - 416
  • [9] Hybrid reinforcement learning and its application to biped robot control
    Yamada, S
    Watanabe, A
    Nakashima, M
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 1071 - 1077
  • [10] Application of reinforcement learning in robot soccer
    Duan, Yong
    Liu, Qiang
    Xu, Xinhe
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2007, 20 (07) : 936 - 950