Multi-agent Polygon Formation using Reinforcement Learning

被引:5
作者
Prasad, B. K. Swathi [1 ]
Manjunath, Aditya G. [2 ]
Ramasangu, Hariharan [3 ]
机构
[1] MS Ramaiah Univ Appl Sci, Dept Elect Engn, Bangalore, Karnataka, India
[2] MS Ramaiah Univ Appl Sci, Dept Comp Sci & Engn, Bangalore, Karnataka, India
[3] MS Ramaiah Univ Appl Sci, Dept Elect & Commun Engn, Bangalore, Karnataka, India
来源
ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1 | 2017年
关键词
Formation; Pattern; Q-learning; Algorithm; Episode;
D O I
10.5220/0006187001590165
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work provides details a simulation experiment and analysis of Q-learning applied to multi-agent systems. Six agents interact within the environment to form hexagon, square and triangle, by reaching their specific goal states. In the proposed approach, the agents form a hexagon and the maximum dimension of this pattern is be reduced to form patterns with smaller dimensions. A decentralised approach of controlling the agents via Q-Learning was adopted which reduced complexity. The agents will be able to either move forward, backward and sideways based on the decision taken. Finally, the Q-Learning action-reward system was designed such that the agents could exploit the system which meant that they would earn high rewards for correct actions and negative rewards so the opposite.
引用
收藏
页码:159 / 165
页数:7
相关论文
共 19 条
  • [1] Alonso-Mora J., 2011, 2011 IEEE International Conference on Robotics and Automation (ICRA 2011), P4512, DOI 10.1109/ICRA.2011.5980269
  • [2] [Anonymous], NEURAL NETWORK OBSER
  • [3] [Anonymous], FLOCKING TRAJECTORY
  • [4] [Anonymous], 2014, IFAC PAPERSONLINE
  • [5] [Anonymous], IEEE ROBOTICS AUTOMA
  • [6] [Anonymous], CONTROL THEORY APPL
  • [7] [Anonymous], SYNCHRONIZING MULTI
  • [8] Broucke M.E., 2006, P 17 INT S MATH THEO, P2415
  • [9] Busoniu L., 2006, 2006 9 INT C CONTR A, P1, DOI DOI 10.1109/ICARCV.2006.345351
  • [10] Decentralized control of multi-agent systems for swarming with a given geometric pattern
    Cheng, Teddy M.
    Savkin, Andrey V.
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2011, 61 (04) : 731 - 744