Scalable, MDP-based Planning for Multiple, Cooperating Agents

被引：0

作者：

Redding, Joshua D. ^{[1
]}

Ure, N. Kemal ^{[1
]}

How, Jonathan P. ^{[1
]}

Vavrina, Matthew A. ^{[2
]}

Vian, John ^{[2
]}

机构：

[1] MIT, Aerosp Controls Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[2] Boeing Res & Technol, Seattle, WA USA

来源：

2012 AMERICAN CONTROL CONFERENCE (ACC) | 2012年

关键词：

DECENTRALIZED CONTROL; COMPLEXITY;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper introduces an approximation algorithm for stochastic multi-agent planning based on Markov decision processes (MDPs). Specifically, we focus on a decentralized approach for planning the actions of a team of cooperating agents with uncertainties in fuel consumption and health-related models. The core idea behind the algorithm presented in this paper is to allow each agent to approximate the representation of its teammates. Each agent therefore maintains its own planner that fully enumerates its local states and actions while approximating those of its teammates. In prior work, the authors approximated each teammate individually, which resulted in a large reduction of the planning space, but remained exponential (in n - 1 rather than in n, where n is the number of agents) in computational scalability. This paper extends the approach and presents a new approximation that aggregates all teammates into a single, abstracted entity. Under the persistent search & track mission scenario with 3 agents, we show that while resulting performance is decreased nearly 20% compared with the centralized optimal solution, the problem size becomes linear in n, a very attractive feature when planning online for large multi-agent teams.

引用

页码：6011 / 6016

页数：6

共 50 条

[21] A MDP-based Energy Efficient and Delay Aware Handover Algorithm
Islam, Nahina
Kandeepan, Sithamparanathan
Chavez, Karina Gomez
Scott, James
2019 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2019,
[22] MDP-Based Reliability Analysis of an Ambient Assisted Living System
Liu, Yan
Gui, Lin
Liu, Yang
FM 2014: FORMAL METHODS, 2014, 8442 : 688 - 702
[23] MDP-Based Network Selection with Reward Optimization in HetNets
CHEN Xin
LI Zhuo
WANG Kai
XING Lei
Chinese Journal of Electronics, 2018, 27 (01) : 183 - 190
[24] Consensus and compromise: Planning in cooperating agents
Clark, R
Grossner, C
Radhakrishnan, T
INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 1996, 5 (01): : 27 - 72
[25] Bond strength between MDP-based cement and translucent zirconia
Minh Le
Larsson, Christel
Papia, Evaggelia
DENTAL MATERIALS JOURNAL, 2019, 38 (03) : 480 - 489
[26] An MDP-based Approximation Method for Goal Constrained Multi-MAV Planning under Action Uncertainty
Liu, Lantao
Michael, Nathan
2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 56 - 62
[27] An MDP-based peer-to-peer search server network
Shen, YP
Lee, DL
WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, 2002, : 269 - 278
[28] An MDP-Based Dynamic Optimization Methodology for Wireless Sensor Networks
Munir, Arslan
Gordon-Ross, Ann
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (04) : 616 - 625
[29] An MDP-Based Handover Decision Algorithm in Hierarchical LTE Networks
Pan, Jun
Zhang, Wenyi
2012 IEEE VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2012,
[30] MDP-Based Cost Sensitive Classification Using Decision Trees
Maliah, Shlomi
Shani, Guy
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3746 - 3753

← 1 2 3 4 5 →