COOPERATIVE MULTI-ROBOT SYSTEM FOR INFRASTRUCTURE SECURITY TASKS

被引：0

作者：

Hernandez, Erik ^{[1
]}

Barrientos, Antonio ^{[1
]}

Rossi, Claudio ^{[1
]}

del Cerro, Jaime ^{[1
]}

机构：

[1] Tech Univ Madrid, Ctr Robot & Automat, Madrid, Spain

来源：

ICAART: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL. 2 | 2012年

关键词：

Reinforcement learning; Robotics security; Infrastructure security tasks; Multi-robot systems;

D O I：

10.5220/0003721403130316

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As a result of terrorist attacks in the last years; new efforts have raised trying to solve challenges related to security task automation using robotic platforms. In this paper we present the results of a cooperative multi-robot approach for infrastructure security applications at critical facilities. We formulate our problem using a M. Pac-Mac like environment. In this implementation, multiple robotic agents define policies with the objective to increase the number of explored states in a grid world. This is through the application of the off-policy learning algorithm from reinforcement learning area, known as Q-learning. We validate experimentally our approach with a group of agents learning a patrol task and we present results obtained in simulated environments.

引用

页码：313 / 316

页数：4

共 11 条

[1] Theoretical analysis of the multi-agent patrolling problem
Chevaleyre, Y
[J]. IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2004, : 302 - 308
[2] Elmaliach Y., 2007, ANN MATH ARTIFICIAL
[3] Everett H., 2003, INSTRUMENTATION MEAS
[4] Kaelbling L., 1996, CS9605103 ARXIV
[5] Khamis A., 2006, SYST MAN CYB 2006 SM, V2, P1499
[6] Luo R., 2009, ROBOTICS AUTONOMOUS
[7] Oates R., 2009, 3909 ICRA
[8] Puterman M.L., 2008, Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley Series in Probability and Statistics
[9] Roman-Ballesteros I., 2006, ROBOTICS ELECT
[10] Sutton R., 1998, Reinforcement learning: An introduction, V116

← 1 2 →