H-MAS Architecture and Reinforcement Learning method for autonomous robot path planning

被引:0
作者
Lamini, Chaymaa [1 ]
Fathi, Youssef [1 ]
Benhlima, Said [1 ]
机构
[1] Moulay Ismail Univ, Fac Sci, Dept Comp Sci, MACS Lab, Meknes, Morocco
来源
2017 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV) | 2017年
关键词
path planning; machine learning; reinforcement learning; Q-learning; multi agent system; MOBILE ROBOT; NAVIGATION; OPTIMIZATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work provides a novel collaborative Q-Learning method using a Holonic Multi Agent System (H-MAS), with the use of two types of Q-table; Q-Master Table (QMT) and Q-Embedded Table (QET) instead of the classical use. The QMT is updated by a collaborative policy between head holons. Thus the QET is updated based on the individual decision of each agent, so that each and every robot in the environment can use and profit not only from its own experiences but also from the experiences of other agents. This method is supported by a H-MAS architecture based on holonic and internal agents, and a Fuzzy Logic System, in order to solve the problem of path planning for a mobile robot in a partially or completely unknown environment.
引用
收藏
页数:7
相关论文
共 32 条
[1]  
A. FIPA, 2002, FIP ACL MESS STRUCT
[2]  
Al-Sagban M, 2012, I C CONT AUTOMAT ROB, P353, DOI 10.1109/ICARCV.2012.6485184
[3]  
Al-Taharwa Ismail, 2008, Journal of Computer Sciences, V4, P341, DOI 10.3844/jcssp.2008.341.344
[4]  
[Anonymous], 1998, P FOIS
[5]  
[Anonymous], 2012, ROBOT MOTION PLANNIN
[6]  
[Anonymous], 1969, SPEECH ACTS ESSAY PH
[7]  
[Anonymous], 2007, Supervised machine learning: A review of classification techniques
[8]  
Bellifemine F., 2001, Proceedings of the Fifth International Conference on Autonomous Agents, P216, DOI 10.1145/375735.376120
[9]   Autonomous robot navigation using adaptive potential fields [J].
Cosío, FA ;
Castañeda, MAP .
MATHEMATICAL AND COMPUTER MODELLING, 2004, 40 (9-10) :1141-1156
[10]   Intelligent-based multi-robot path planning inspired by improved classical Q-learning and improved particle swarm optimization with perturbed velocity [J].
Das, P. K. ;
Behera, H. S. ;
Panigrahi, B. K. .
ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2016, 19 (01) :651-669