On Learning Navigation Behaviors for Small Mobile Robots With Reservoir Computing Architectures

被引：54

作者：

Antonelo, Eric Aislan ^{[1
]}

Schrauwen, Benjamin ^{[2
]}

机构：

[1] Univ Fed Santa Catarina, Dept Automat & Syst, Florianopolis, SC, Brazil

[2] Univ Ghent, Dept Elect & Informat Syst, Fac Engn, B-9000 Ghent, Belgium

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2015年 / 26卷 / 04期

关键词：

Echo state network (ESN); goal-directed navigation; recurrent neural networks (RNNs); reinforcement learning (RL); reservoir computing (RC); robot navigation; sensory-motor coupling; COMPUTATIONAL MODEL; SYSTEMS; EVOLUTION; NETWORKS; PLACE; LOCALIZATION;

D O I：

10.1109/TNNLS.2014.2323247

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a general reservoir computing (RC) learning framework that can be used to learn navigation behaviors for mobile robots in simple and complex unknown partially observable environments. RC provides an efficient way to train recurrent neural networks by letting the recurrent part of the network (called reservoir) be fixed while only a linear readout output layer is trained. The proposed RC framework builds upon the notion of navigation attractor or behavior that can be embedded in the high-dimensional space of the reservoir after learning. The learning of multiple behaviors is possible because the dynamic robot behavior, consisting of a sensory-motor sequence, can be linearly discriminated in the high-dimensional nonlinear space of the dynamic reservoir. Three learning approaches for navigation behaviors are shown in this paper. The first approach learns multiple behaviors based on the examples of navigation behaviors generated by a supervisor, while the second approach learns goal-directed navigation behaviors based only on rewards. The third approach learns complex goal-directed behaviors, in a supervised way, using a hierarchical architecture whose internal predictions of contextual switches guide the sequence of basic navigation behaviors toward the goal.

引用

页码：763 / 780

页数：18

共 56 条

[1]

[Anonymous], 2003, J. Mach. Learn. Res.

[2] Event detection and localization for small mobile robots using reservoir computing [J].

Antonelo, E. A. ;

Schrauwen, B. ;

Stroobandt, D. .

NEURAL NETWORKS, 2008, 21 (06) :862-871

[3]

Antonelo E. A., 2011, P 10 BRAZ C COMP INT

[4] Learning slow features with reservoir computing for biologically-inspired robot localization [J].

Antonelo, Eric ;

Schrauwen, Benjamin .

NEURAL NETWORKS, 2012, 25 :178-190

[5]

Antonelo E, 2008, IEEE SYS MAN CYBERN, P1842

[6] Generative modeling of autonomous robots and their environments using reservoir computing [J].

Antonelo, Eric A. ;

Schrauwen, Benjamin ;

Van Campenhout, Jan .

NEURAL PROCESSING LETTERS, 2007, 26 (03) :233-249

[7] Supervised Learning of Internal Models for Autonomous Goal-Oriented Robot Navigation using Reservoir Computing [J].

Antonelo, Eric A. ;

Schrauwen, Benjamin .

2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, :2959-2964

[8]

Antonelo EA, 2006, IEEE IJCNN, P498

[9]

Arkin R.C., 1998, Behavior-based robotics

[10] Cognitive navigation based on nonuniform gabor space sampling, unsupervised growing networks, and reinforcement learning [J].

Arleo, A ;

Smeraldi, F ;

Gerstner, W .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (03) :639-652

← 1 2 3 4 5 6 →