Online Action Learning using Kernel Density Estimation for Quick Discovery of Good Parameters for Peg-in-Hole Insertion

被引：4

作者：

Sorensen, Lars Caroe ^{[1
]}

Buch, Jacob Porksen ^{[1
]}

Petersen, Henrik Gordon ^{[1
]}

Kraft, Dirk ^{[1
]}

机构：

[1] Univ Southern Denmark, Maersk Mc Kinney Moller Inst, SDURobot, Campusvej 55, Odense, Denmark

来源：

ICINCO: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 2 | 2016年

关键词：

Learning and Adaptive Systems; Compliant Assembly; Intelligent and Flexible Manufacturing; MULTIARMED BANDIT;

D O I：

10.5220/0005958801660177

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning action parameters is becoming an ever more important topic in industrial assembly with tendencies towards smaller batch sizes, more required flexibility and process uncertainties. This paper presents a statistical online learning method capable of handling these issues. The method uses elimination of unpromising parameter sets to reduce the elements of the discretised sample space (inspired by Action Elimination) based on regression uncertainty. Kernel Density Estimation and Wilson Score are explored as internal representations. Based on a dynamic simulator setup for a real world Peg-in-Hole problem, it is shown that the presented method can drastically reduce the number of samples needed. Furthermore, it is also shown that the solution obtained in simulation by our learning method succeeds when executed on the corresponding real world setup.

引用

页码：166 / 177

页数：12

共 24 条

[1] Approximate is better than "exact" for interval estimation of binomial proportions [J].

Agresti, A ;

Coull, BA .

AMERICAN STATISTICIAN, 1998, 52 (02) :119-126

[2]

[Anonymous], 1986, MONOGR STAT APPL PRO

[3]

[Anonymous], 2011, Paladyn, DOI DOI 10.2478/S13230-011-0012-X

[4]

[Anonymous], 2011, FDN TRENDS ROBOT, DOI DOI 10.1561/2300000021

[5]

[Anonymous], CORR

[6]

[Anonymous], 1998, Reinforcement learning: An introduction

[7] Finite-time analysis of the multiarmed bandit problem [J].

Auer, P ;

Cesa-Bianchi, N ;

Fischer, P .

MACHINE LEARNING, 2002, 47 (2-3) :235-256

[8] An Adaptable Robot Vision System Performing Manipulation Actions With Flexible Objects [J].

Bodenhagen, Leon ;

Fugl, Andreas R. ;

Jordt, Andreas ;

Willatzen, Morten ;

Andersen, Knud A. ;

Olsen, Martin M. ;

Koch, Reinhard ;

Petersen, Henrik G. ;

Kruger, Norbert .

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (03) :749-765

[9] Applying simulation and a domain-specific language for an adaptive action library [J].

Buch, Jacob Pørksen ;

Laursen, Johan Sund ;

Sørensen, Lars Carøe ;

Ellekilde, Lars-Peter ;

Kraft, Dirk ;

Schultz, Ulrik Pagh ;

Petersen, Henrik Gordon .

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8810 :86-97

[10]

EU Robotics AISBL, 2014, ROB 2020 MULT ROADM

← 1 2 3 →