Online Kernel-Based Learning for Task-Space Tracking Robot Control

被引：20

作者：

Duy Nguyen-Tuong ^{[1
]}

Peters, Jan ^{[1
]}

机构：

[1] Max Planck Inst Biol Cybernet, Dept Empir Inference, D-72076 Tubingen, Germany

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2012年 / 23卷 / 09期

关键词：

Kernel methods; online learning; real-time learning; robot control; task-space tracking; MODEL;

D O I：

10.1109/TNNLS.2012.2201261

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Task-space control of redundant robot systems based on analytical models is known to be susceptive to modeling errors. Data-driven model learning methods may present an interesting alternative approach. However, learning models for task-space tracking control from sampled data is an ill-posed problem. In particular, the same input data point can yield many different output values, which can form a nonconvex solution space. Because the problem is ill-posed, models cannot be learned from such data using common regression methods. While learning of task-space control mappings is globally ill-posed, it has been shown in recent work that it is locally a well-defined problem. In this paper, we use this insight to formulate a local kernel-based learning approach for online model learning for task-space tracking control. We propose a parametrization for the local model, which makes an application in task-space tracking control of redundant robots possible. The model parametrization further allows us to apply the kernel-trick and, therefore, enables a formulation within the kernel learning framework. In our evaluations, we show the ability of the method for online model learning for task-space tracking control of redundant robots.

引用

页码：1417 / 1425

页数：9

共 30 条

[1]

[Anonymous], 2006, ROBOT DYNAMICS CONTR

[2]

[Anonymous], 1996, MATRIX COMPUTATION

[3]

[Anonymous], 2010, IEEE PES T

[4]

Atkeson CG, 1997, ARTIF INTELL REV, V11, P11, DOI 10.1023/A:1006559212014

[5]

Bhushan N, 1999, ADV NEUR IN, V11, P3

[6] Adaptive Learning in Complex Reproducing Kernel Hilbert Spaces Employing Wirtinger's Subgradients [J].

Bouboulis, Pantelis ;

Slavakis, Konstantinos ;

Theodoridis, Sergios .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (03) :425-438

[7] Quantized Kernel Least Mean Square Algorithm [J].

Chen, Badong ;

Zhao, Songlin ;

Zhu, Pingping ;

Principe, Jose C. .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (01) :22-32

[8]

D'Souza A, 2001, IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, P298, DOI 10.1109/IROS.2001.973374

[9] MOSAIC model for sensorimotor learning and control [J].

Haruno, M ;

Wolpert, DM ;

Kawato, M .

NEURAL COMPUTATION, 2001, 13 (10) :2201-2220

[10]

JORDAN MI, 1992, COGNITIVE SCI, V16, P307, DOI 10.1207/s15516709cog1603_1

← 1 2 3 →