Online Kernel-Based Learning for Task-Space Tracking Robot Control

被引:20
作者
Duy Nguyen-Tuong [1 ]
Peters, Jan [1 ]
机构
[1] Max Planck Inst Biol Cybernet, Dept Empir Inference, D-72076 Tubingen, Germany
关键词
Kernel methods; online learning; real-time learning; robot control; task-space tracking; MODEL;
D O I
10.1109/TNNLS.2012.2201261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Task-space control of redundant robot systems based on analytical models is known to be susceptive to modeling errors. Data-driven model learning methods may present an interesting alternative approach. However, learning models for task-space tracking control from sampled data is an ill-posed problem. In particular, the same input data point can yield many different output values, which can form a nonconvex solution space. Because the problem is ill-posed, models cannot be learned from such data using common regression methods. While learning of task-space control mappings is globally ill-posed, it has been shown in recent work that it is locally a well-defined problem. In this paper, we use this insight to formulate a local kernel-based learning approach for online model learning for task-space tracking control. We propose a parametrization for the local model, which makes an application in task-space tracking control of redundant robots possible. The model parametrization further allows us to apply the kernel-trick and, therefore, enables a formulation within the kernel learning framework. In our evaluations, we show the ability of the method for online model learning for task-space tracking control of redundant robots.
引用
收藏
页码:1417 / 1425
页数:9
相关论文
共 30 条
  • [1] [Anonymous], 2006, ROBOT DYNAMICS CONTR
  • [2] [Anonymous], 1996, MATRIX COMPUTATION
  • [3] [Anonymous], 2010, IEEE PES T
  • [4] Atkeson CG, 1997, ARTIF INTELL REV, V11, P11, DOI 10.1023/A:1006559212014
  • [5] Bhushan N, 1999, ADV NEUR IN, V11, P3
  • [6] Adaptive Learning in Complex Reproducing Kernel Hilbert Spaces Employing Wirtinger's Subgradients
    Bouboulis, Pantelis
    Slavakis, Konstantinos
    Theodoridis, Sergios
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (03) : 425 - 438
  • [7] Quantized Kernel Least Mean Square Algorithm
    Chen, Badong
    Zhao, Songlin
    Zhu, Pingping
    Principe, Jose C.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (01) : 22 - 32
  • [8] D'Souza A, 2001, IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, P298, DOI 10.1109/IROS.2001.973374
  • [9] MOSAIC model for sensorimotor learning and control
    Haruno, M
    Wolpert, DM
    Kawato, M
    [J]. NEURAL COMPUTATION, 2001, 13 (10) : 2201 - 2220
  • [10] JORDAN MI, 1992, COGNITIVE SCI, V16, P307, DOI 10.1207/s15516709cog1603_1