COOPERATIVE MULTI-ROBOT SYSTEM FOR INFRASTRUCTURE SECURITY TASKS

被引:0
作者
Hernandez, Erik [1 ]
Barrientos, Antonio [1 ]
Rossi, Claudio [1 ]
del Cerro, Jaime [1 ]
机构
[1] Tech Univ Madrid, Ctr Robot & Automat, Madrid, Spain
来源
ICAART: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL. 2 | 2012年
关键词
Reinforcement learning; Robotics security; Infrastructure security tasks; Multi-robot systems;
D O I
10.5220/0003721403130316
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a result of terrorist attacks in the last years; new efforts have raised trying to solve challenges related to security task automation using robotic platforms. In this paper we present the results of a cooperative multi-robot approach for infrastructure security applications at critical facilities. We formulate our problem using a M. Pac-Mac like environment. In this implementation, multiple robotic agents define policies with the objective to increase the number of explored states in a grid world. This is through the application of the off-policy learning algorithm from reinforcement learning area, known as Q-learning. We validate experimentally our approach with a group of agents learning a patrol task and we present results obtained in simulated environments.
引用
收藏
页码:313 / 316
页数:4
相关论文
共 11 条
  • [1] Theoretical analysis of the multi-agent patrolling problem
    Chevaleyre, Y
    [J]. IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2004, : 302 - 308
  • [2] Elmaliach Y., 2007, ANN MATH ARTIFICIAL
  • [3] Everett H., 2003, INSTRUMENTATION MEAS
  • [4] Kaelbling L., 1996, CS9605103 ARXIV
  • [5] Khamis A., 2006, SYST MAN CYB 2006 SM, V2, P1499
  • [6] Luo R., 2009, ROBOTICS AUTONOMOUS
  • [7] Oates R., 2009, 3909 ICRA
  • [8] Puterman M.L., 2008, Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley Series in Probability and Statistics
  • [9] Roman-Ballesteros I., 2006, ROBOTICS ELECT
  • [10] Sutton R., 1998, Reinforcement learning: An introduction, V116