Learning Adaptive Escape Behavior for Wheel-Legged Robot by Inner Torque Information

被引：0

作者：

Nishimura, Yuki ^{[1
]}

Mikami, Sadayoshi ^{[1
]}

机构：

[1] Future Univ Hakodate, Dept Complex & Intelligent Syst, Hakodate, Hokkaido, Japan

来源：

2016 JOINT 8TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 17TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS) | 2016年

关键词：

Reinforcement Learning; Wheel-Legged Robot; Escaping Actions; Rough Terrain; Autonomous System;

D O I：

10.1109/SCIS&ISIS.2016.149

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Autonomous robots used for rescue or exploration needs to work in unknown environment. Such robots should select appropriate actions corresponding to their environments. In this research, we develop a wheel-legged robot getting better actions in unknown environment with reinforcement learning. We used values of external force measured on the robot's legs as the definition of states and rewards. For the quick convergence, the number of states and actions are reduced by using the characteristics of the robot's structure. To evaluate the performance of our learning system, we carried out some experiments with a simulator using a physics engine. The results of the experiments show the effectiveness of our system.

引用

页码：10 / 15

页数：6

共 11 条

[1]

Adachi H., 1999, Proceedings 1999 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human and Environment Friendly Robots with High Intelligence and Emotional Quotients (Cat. No.99CH36289), P1792, DOI 10.1109/IROS.1999.811738

[2] Purposive behavior acquisition for a real robot by vision-based reinforcement learning [J].

Asada, M ;

Noda, S ;

Tawaratsumida, S ;

Hosoda, K .

MACHINE LEARNING, 1996, 23 (2-3) :279-303

[3]

Doi T., 2003, J ROBOTICS MECHATRON, V15, P349

[4]

Doki K, 2007, PROCEDINGS OF THE 11TH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, P256

[5] Decoupled control of posture and trajectory of the hybrid wheel-legged robot Hylos [J].

Grand, C ;

BenAmar, F ;

Plumet, F ;

Bidaud, P .

2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, :5111-5116

[6]

Horikawa M., 2011, P 12 INT S ADV INT S, P315

[7] Prior structure for on-line learning [J].

Huber, M ;

Grupen, RA .

1997 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION - CIRA '97, PROCEEDINGS: TOWARDS NEW COMPUTATIONAL PRINCIPLES FOR ROBOTICS AND AUTOMATION, 1997, :124-129

[8] Reinforcement learning: A survey [J].

Kaelbling, LP ;

Littman, ML ;

Moore, AW .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285

[9]

Sutton R., 1998, Introduction to reinforcement learning

[10]

WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698

← 1 2 →