Scalable, MDP-based Planning for Multiple, Cooperating Agents

被引:0
|
作者
Redding, Joshua D. [1 ]
Ure, N. Kemal [1 ]
How, Jonathan P. [1 ]
Vavrina, Matthew A. [2 ]
Vian, John [2 ]
机构
[1] MIT, Aerosp Controls Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Boeing Res & Technol, Seattle, WA USA
关键词
DECENTRALIZED CONTROL; COMPLEXITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces an approximation algorithm for stochastic multi-agent planning based on Markov decision processes (MDPs). Specifically, we focus on a decentralized approach for planning the actions of a team of cooperating agents with uncertainties in fuel consumption and health-related models. The core idea behind the algorithm presented in this paper is to allow each agent to approximate the representation of its teammates. Each agent therefore maintains its own planner that fully enumerates its local states and actions while approximating those of its teammates. In prior work, the authors approximated each teammate individually, which resulted in a large reduction of the planning space, but remained exponential (in n - 1 rather than in n, where n is the number of agents) in computational scalability. This paper extends the approach and presents a new approximation that aggregates all teammates into a single, abstracted entity. Under the persistent search & track mission scenario with 3 agents, we show that while resulting performance is decreased nearly 20% compared with the centralized optimal solution, the problem size becomes linear in n, a very attractive feature when planning online for large multi-agent teams.
引用
收藏
页码:6011 / 6016
页数:6
相关论文
共 50 条
  • [21] A MDP-based Energy Efficient and Delay Aware Handover Algorithm
    Islam, Nahina
    Kandeepan, Sithamparanathan
    Chavez, Karina Gomez
    Scott, James
    2019 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2019,
  • [22] MDP-Based Reliability Analysis of an Ambient Assisted Living System
    Liu, Yan
    Gui, Lin
    Liu, Yang
    FM 2014: FORMAL METHODS, 2014, 8442 : 688 - 702
  • [23] MDP-Based Network Selection with Reward Optimization in HetNets
    CHEN Xin
    LI Zhuo
    WANG Kai
    XING Lei
    Chinese Journal of Electronics, 2018, 27 (01) : 183 - 190
  • [24] Consensus and compromise: Planning in cooperating agents
    Clark, R
    Grossner, C
    Radhakrishnan, T
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 1996, 5 (01): : 27 - 72
  • [25] Bond strength between MDP-based cement and translucent zirconia
    Minh Le
    Larsson, Christel
    Papia, Evaggelia
    DENTAL MATERIALS JOURNAL, 2019, 38 (03) : 480 - 489
  • [26] An MDP-based Approximation Method for Goal Constrained Multi-MAV Planning under Action Uncertainty
    Liu, Lantao
    Michael, Nathan
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 56 - 62
  • [27] An MDP-based peer-to-peer search server network
    Shen, YP
    Lee, DL
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, 2002, : 269 - 278
  • [28] An MDP-Based Dynamic Optimization Methodology for Wireless Sensor Networks
    Munir, Arslan
    Gordon-Ross, Ann
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (04) : 616 - 625
  • [29] An MDP-Based Handover Decision Algorithm in Hierarchical LTE Networks
    Pan, Jun
    Zhang, Wenyi
    2012 IEEE VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2012,
  • [30] MDP-Based Cost Sensitive Classification Using Decision Trees
    Maliah, Shlomi
    Shani, Guy
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3746 - 3753