Solution to reinforcement learning problems with artificial potential field

被引：0

作者：

Li-juan Xie

Guang-rong Xie

Huan-wen Chen

Xiao-li Li

机构：

[1] Central South University,Institute of Mental Health, Xiangya School of Medicine

[2] Changsha University of Science and Technology,School of Computer and Communication

[3] Hunan College of Information,Department of Computer Engineering

[4] University of Birmingham,School of Computer Science

来源：

Journal of Central South University of Technology | 2008年 / 15卷

关键词：

reinforcement learning; path planning; mobile robot navigation; artificial potential field; virtual water-flow;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A novel method was designed to solve reinforcement learning problems with artificial potential field. Firstly a reinforcement learning problem was transferred to a path planning problem by using artificial potential field(APF), which was a very appropriate method to model a reinforcement learning problem. Secondly, a new APF algorithm was proposed to overcome the local minimum problem in the potential field methods with a virtual water-flow concept. The performance of this new method was tested by a gridworld problem named as key and door maze. The experimental results show that within 45 trials, good and deterministic policies are found in almost all simulations. In comparison with WIERING’s HQ-learning system which needs 20 000 trials for stable solution, the proposed new method can obtain optimal and stable policy far more quickly than HQ-learning. Therefore, the new method is simple and effective to give an optimal solution to the reinforcement learning problem.

引用

页码：552 / 557

页数：5

共 23 条

[1] Kaelbling L. P.(1996)Reinforcement learning: A survey [J] Journal of Artificial Intelligence Research 4 273-285
[2] Littman M. L.(2005)A biologically inspired hierarchical reinforcement learning system [J] Cybernetics and Systems 36 1-44
[3] Moore A. W.(2003)Recent advances in hierarchical reinforcement learning [J] Discrete Event Dynamic Systems: Theory and Applications 13 41-77
[4] Zhou W.(2006)Global path planning approach based on ant colony optimization algorithm [J] Journal of Central South University of Technology 13 707-712
[5] Coggins R.(2007)Robust simultaneous tracking and stabilization of wheeled mobile robots not satisfying nonholonomic constraint [J] Journal of Central South University of Technology 14 537-545
[6] Barto A.(2003)Non-smooth environment modeling and global path planning for mobile robots [J] Journal of Central South University of Technology 10 248-254
[7] Mahadevan S.(1986)Real-time obstacle avoidance for manipulators and mobile robots [J] International Journal of Robotics Research 5 90-98
[8] Wen Z.-q.(2006)Visual navigation and obstacle avoidance using a steering potential function [J] Journal of Robotics and Autonomous Systems 54 288-299
[9] Cai Z.-xing.(1992)Exact robot navigation using artificial potential functions [J] IEEE Transactions on Robotics and Automation 8 501-518
[10] Zhu X.-c.(1998)HQ-learning [J] Adaptive Behavior 6 219-246

← 1 2 3 →