Deep Lagrangian Networks for end-to-end learning of energy-based control for under-actuated systems

被引:0
作者
Lutter, Michael [1 ]
Listmann, Kim [2 ]
Peters, Jan [1 ,3 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Petersenstr 30, D-64289 Darmstadt, Germany
[2] ABB Corp Res Ctr Germany, Wallstadter Str 59, D-68526 Ladenburg, Germany
[3] Max Planck Inst Intelligent Syst, Spemannstr 41, D-72076 Tubingen, Germany
来源
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2019年
关键词
MODEL; FRICTION;
D O I
10.1109/iros40897.2019.8968268
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applying Deep Learning to control has a lot of potential for enabling the intelligent design of robot control laws. Unfortunately common deep learning approaches to control, such as deep reinforcement learning, require an unrealistic amount of interaction with the real system, do not yield any performance guarantees, and do not make good use of extensive insights from control theory. In particular, common black-box approaches - that abandon all insight from control - are not suitable for complex robot systems. We propose a deep control approach as a bridge between the solid theoretical foundations of energy-based control and the flexibility of deep learning. To accomplish this goal, we extend Deep Lagrangian Networks (DeLaN) to not only adhere to Lagrangian Mechanics but also ensure conservation of energy and passivity of the learned representation. This novel extension is embedded within a energy control law to control under-actuated systems. The resulting DeLaN for energy control (DeLaN 4EC) is the first model learning approach using generic function approximation that is capable of learning energy control because existing approaches cannot learn the system energies directly. DeLaN 4EC exhibits excellent real-time control on the physical Furuta pendulum and learns to swing-up the pendulum while the control law using system identification does not.
引用
收藏
页码:7718 / 7725
页数:8
相关论文
共 40 条
  • [1] Albu-Schaffer A., 2002, THESIS
  • [2] ESTIMATION OF INERTIAL PARAMETERS OF MANIPULATOR LOADS AND LINKS
    ATKESON, CG
    AN, CH
    HOLLERBACH, JM
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 1986, 5 (03) : 101 - 119
  • [3] Bona B, 2005, IEEE DECIS CONTR P, P4360
  • [4] Learning and Reproduction of Gestures by Imitation An Approach Based on Hidden Markov Model and Gaussian Mixture Regression
    Calinon, Sylvain
    D'Halluin, Florent
    Sauser, Eric L.
    Caldwell, Darwin G.
    Billard, Aude G.
    [J]. IEEE ROBOTICS & AUTOMATION MAGAZINE, 2010, 17 (02) : 44 - 54
  • [5] NONLINEAR CONTROL OF A SWINGING PENDULUM
    CHUNG, CC
    HAUSER, J
    [J]. AUTOMATICA, 1995, 31 (06) : 851 - 862
  • [6] Coumans E., 2016, Pybullet, a python module for physics simulation for games, robotics and machine learning
  • [7] Deisenroth Marc Peter, 2011, INT C MACH LEARN ICM, P465
  • [8] Duan Yan, 2016, INT C MACH LEARN, P1329, DOI [DOI 10.48550/ARXIV.1604.06778, DOI 10.1109/CVPR.2014.180]
  • [9] Fantoni Isabelle, 2002, Non-linear Control for Underactuated Mechanical Systems
  • [10] MOSAIC model for sensorimotor learning and control
    Haruno, M
    Wolpert, DM
    Kawato, M
    [J]. NEURAL COMPUTATION, 2001, 13 (10) : 2201 - 2220