Multi-agent Polygon Formation using Reinforcement Learning

被引：5

作者：

Prasad, B. K. Swathi ^{[1
]}

Manjunath, Aditya G. ^{[2
]}

Ramasangu, Hariharan ^{[3
]}

机构：

[1] MS Ramaiah Univ Appl Sci, Dept Elect Engn, Bangalore, Karnataka, India

[2] MS Ramaiah Univ Appl Sci, Dept Comp Sci & Engn, Bangalore, Karnataka, India

[3] MS Ramaiah Univ Appl Sci, Dept Elect & Commun Engn, Bangalore, Karnataka, India

来源：

ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1 | 2017年

关键词：

Formation; Pattern; Q-learning; Algorithm; Episode;

D O I：

10.5220/0006187001590165

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work provides details a simulation experiment and analysis of Q-learning applied to multi-agent systems. Six agents interact within the environment to form hexagon, square and triangle, by reaching their specific goal states. In the proposed approach, the agents form a hexagon and the maximum dimension of this pattern is be reduced to form patterns with smaller dimensions. A decentralised approach of controlling the agents via Q-Learning was adopted which reduced complexity. The agents will be able to either move forward, backward and sideways based on the decision taken. Finally, the Q-Learning action-reward system was designed such that the agents could exploit the system which meant that they would earn high rewards for correct actions and negative rewards so the opposite.

引用

页码：159 / 165

页数：7

共 19 条

[1]

Alonso-Mora J., 2011, 2011 IEEE International Conference on Robotics and Automation (ICRA 2011), P4512, DOI 10.1109/ICRA.2011.5980269

[2]

[Anonymous], NEURAL NETWORK OBSER

[3]

[Anonymous], FLOCKING TRAJECTORY

[4]

[Anonymous], 2014, IFAC PAPERSONLINE

[5]

[Anonymous], IEEE ROBOTICS AUTOMA

[6]

[Anonymous], CONTROL THEORY APPL

[7]

[Anonymous], SYNCHRONIZING MULTI

[8]

Broucke M.E., 2006, P 17 INT S MATH THEO, P2415

[9]

Busoniu L., 2006, 2006 9 INT C CONTR A, P1, DOI DOI 10.1109/ICARCV.2006.345351

[10] Decentralized control of multi-agent systems for swarming with a given geometric pattern [J].

Cheng, Teddy M. ;

Savkin, Andrey V. .

COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2011, 61 (04) :731-744

← 1 2 →