Scalable, MDP-based Planning for Multiple, Cooperating Agents

被引：0

作者：

Redding, Joshua D. ^{[1
]}

Ure, N. Kemal ^{[1
]}

How, Jonathan P. ^{[1
]}

Vavrina, Matthew A. ^{[2
]}

Vian, John ^{[2
]}

机构：

[1] MIT, Aerosp Controls Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[2] Boeing Res & Technol, Seattle, WA USA

来源：

2012 AMERICAN CONTROL CONFERENCE (ACC) | 2012年

关键词：

DECENTRALIZED CONTROL; COMPLEXITY;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper introduces an approximation algorithm for stochastic multi-agent planning based on Markov decision processes (MDPs). Specifically, we focus on a decentralized approach for planning the actions of a team of cooperating agents with uncertainties in fuel consumption and health-related models. The core idea behind the algorithm presented in this paper is to allow each agent to approximate the representation of its teammates. Each agent therefore maintains its own planner that fully enumerates its local states and actions while approximating those of its teammates. In prior work, the authors approximated each teammate individually, which resulted in a large reduction of the planning space, but remained exponential (in n - 1 rather than in n, where n is the number of agents) in computational scalability. This paper extends the approach and presents a new approximation that aggregates all teammates into a single, abstracted entity. Under the persistent search & track mission scenario with 3 agents, we show that while resulting performance is decreased nearly 20% compared with the centralized optimal solution, the problem size becomes linear in n, a very attractive feature when planning online for large multi-agent teams.

引用

页码：6011 / 6016

页数：6

共 50 条

[1] MDP-Based Outpatient Scheduling for Multiple Examinations
Liu, Yang
Geng, Na
Zhu, Yanhong
2015 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2015, : 1312 - 1317
[2] MDP-based Motion Planning for Grasping in Dynamic Scenarios
Mueller, Steffen
Stephan, Benedict
Gross, Horst-Michael
10TH EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR 2021), 2021,
[3] A Scalable MDP-based Sensing and Processing Framework for Vehicular Networks
Chattopadhyay, Rajarshi
Tham, Chen-Khong
2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2019, : 687 - 692
[4] An MDP-based recommender system
Shani, G
Heckerman, D
Brafman, RI
JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 1265 - 1295
[5] MDP-Based Mission Planning for Multi-UAV Persistent Surveillance
Jeong, Byeong-Min
Ha, Jung-Su
Choi, Han-Lim
2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 831 - 834
[6] MDP-based Network Friendly Recommendations
Giannakas, Theodoros
Giovanidis, Anastasios
Spyropoulos, Thrasyvoulos
ACM TRANSACTIONS ON MODELING AND PERFORMANCE EVALUATION OF COMPUTING SYSTEMS, 2021, 6 (04)
[7] An MDP-based approach to online mechanism design
Parkes, DC
Singh, S
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 791 - 798
[8] Deep reinforcement learning for layout planning - An MDP-based approach for the facility layout problem
Heinbach, Benjamin
Burggraef, Peter
Wagner, Johannes
MANUFACTURING LETTERS, 2023, 38 : 40 - 43
[9] Empirical Evaluation of MDP-based DASH Player
Bokani, Ayub
Hoseini, S. Amir
Hassan, Mahbub
Kanhere, Salil S.
25TH INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC 2015), 2015, : 332 - 337
[10] THE USE OF MDP-BASED MATERIALS FOR BONDING TO ZIRCONIA
de Souza, Grace
Hennig, Diana
Aggarwal, Anuj
Tam, Laura E.
JOURNAL OF PROSTHETIC DENTISTRY, 2014, 112 (04): : 895 - 902

← 1 2 3 4 5 →