Online Kernel-Based Learning for Task-Space Tracking Robot Control

被引:20
作者
Duy Nguyen-Tuong [1 ]
Peters, Jan [1 ]
机构
[1] Max Planck Inst Biol Cybernet, Dept Empir Inference, D-72076 Tubingen, Germany
关键词
Kernel methods; online learning; real-time learning; robot control; task-space tracking; MODEL;
D O I
10.1109/TNNLS.2012.2201261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Task-space control of redundant robot systems based on analytical models is known to be susceptive to modeling errors. Data-driven model learning methods may present an interesting alternative approach. However, learning models for task-space tracking control from sampled data is an ill-posed problem. In particular, the same input data point can yield many different output values, which can form a nonconvex solution space. Because the problem is ill-posed, models cannot be learned from such data using common regression methods. While learning of task-space control mappings is globally ill-posed, it has been shown in recent work that it is locally a well-defined problem. In this paper, we use this insight to formulate a local kernel-based learning approach for online model learning for task-space tracking control. We propose a parametrization for the local model, which makes an application in task-space tracking control of redundant robots possible. The model parametrization further allows us to apply the kernel-trick and, therefore, enables a formulation within the kernel learning framework. In our evaluations, we show the ability of the method for online model learning for task-space tracking control of redundant robots.
引用
收藏
页码:1417 / 1425
页数:9
相关论文
共 30 条
[1]  
[Anonymous], 2006, ROBOT DYNAMICS CONTR
[2]  
[Anonymous], 1996, MATRIX COMPUTATION
[3]  
[Anonymous], 2010, IEEE PES T
[4]  
Atkeson CG, 1997, ARTIF INTELL REV, V11, P11, DOI 10.1023/A:1006559212014
[5]  
Bhushan N, 1999, ADV NEUR IN, V11, P3
[6]   Adaptive Learning in Complex Reproducing Kernel Hilbert Spaces Employing Wirtinger's Subgradients [J].
Bouboulis, Pantelis ;
Slavakis, Konstantinos ;
Theodoridis, Sergios .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (03) :425-438
[7]   Quantized Kernel Least Mean Square Algorithm [J].
Chen, Badong ;
Zhao, Songlin ;
Zhu, Pingping ;
Principe, Jose C. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (01) :22-32
[8]  
D'Souza A, 2001, IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, P298, DOI 10.1109/IROS.2001.973374
[9]   MOSAIC model for sensorimotor learning and control [J].
Haruno, M ;
Wolpert, DM ;
Kawato, M .
NEURAL COMPUTATION, 2001, 13 (10) :2201-2220
[10]  
JORDAN MI, 1992, COGNITIVE SCI, V16, P307, DOI 10.1207/s15516709cog1603_1