Path Planning in Unknown Environment with Kernel Smoothing and Reinforcement Learning for Multi-Agent Systems

被引:0
作者
Luviano Cruz, David [1 ]
Yu, Wen [1 ]
机构
[1] IPN, CINVESTAV, Dept Control Automat, Mexico City, DF, Mexico
来源
2015 12TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2015) | 2015年
关键词
CONSENSUS; ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In unknown environment, path planning of multi-agent systems is difficult. The popular methods for the path planning, such as reinforcement learning (RL), do not work for these two cases: unknown environment and multi-agent. In this paper, we use a special intelligent method, kernel smoothing, to estimate the unknown environment, and combine it with the reinforcement learning technique. The advantage of the combination of the reinforcement learning and the kernel smoothing technique is we do not need to repeat RL for the unvisited state. The path planning process has three stages: 1) the reinforcement learning is applied to generate the training samples; 2) the model is trained by the kernel smoothing method; 3) the trained model gives an approximate action to agents. Experiment results show the proposed algorithm can generate desired paths in the unknown environment for multiple agents.
引用
收藏
页数:6
相关论文
共 33 条
[1]   Emotional temporal difference Q-learning signals in multi-agent system cooperation: real case studies [J].
Abdi, Javad ;
Moshiri, Behzad ;
Abdulhai, Baher .
IET INTELLIGENT TRANSPORT SYSTEMS, 2013, 7 (03) :315-326
[2]  
[Anonymous], 2002, An Introduction to MultiAgent Systems
[3]  
[Anonymous], 1998, Learning from data-concepts, theory and methods
[4]  
[Anonymous], 2006, Planning algorithms
[5]   Reinforcement learning-based multi-agent system for network traffic signal control [J].
Arel, I. ;
Liu, C. ;
Urbanik, T. ;
Kohls, A. G. .
IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) :128-135
[6]  
Bell AJ, 1999, COMPUT NEUR MIT, P145
[7]   Multi-agent Path Planning with Multiple Tasks and Distance Constraints [J].
Bhattacharya, Subhrajit ;
Likhachev, Maxim ;
Kumar, Vijay .
2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, :953-959
[8]   Multiagent learning using a variable learning rate [J].
Bowling, M ;
Veloso, M .
ARTIFICIAL INTELLIGENCE, 2002, 136 (02) :215-250
[9]   Artificial Intelligence in the path planning optimization of mobile agent navigation [J].
Brassai, Sandor T. ;
Iantovics, Barna ;
Enachescu, Calin .
INTERNATIONAL CONFERENCE EMERGING MARKETS QUERIES IN FINANCE AND BUSINESS, 2012, 3 :243-250
[10]  
Broomhead D. S., 1988, Complex Systems, V2, P321