Scalable, MDP-based Planning for Multiple, Cooperating Agents

被引:0
|
作者
Redding, Joshua D. [1 ]
Ure, N. Kemal [1 ]
How, Jonathan P. [1 ]
Vavrina, Matthew A. [2 ]
Vian, John [2 ]
机构
[1] MIT, Aerosp Controls Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Boeing Res & Technol, Seattle, WA USA
关键词
DECENTRALIZED CONTROL; COMPLEXITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces an approximation algorithm for stochastic multi-agent planning based on Markov decision processes (MDPs). Specifically, we focus on a decentralized approach for planning the actions of a team of cooperating agents with uncertainties in fuel consumption and health-related models. The core idea behind the algorithm presented in this paper is to allow each agent to approximate the representation of its teammates. Each agent therefore maintains its own planner that fully enumerates its local states and actions while approximating those of its teammates. In prior work, the authors approximated each teammate individually, which resulted in a large reduction of the planning space, but remained exponential (in n - 1 rather than in n, where n is the number of agents) in computational scalability. This paper extends the approach and presents a new approximation that aggregates all teammates into a single, abstracted entity. Under the persistent search & track mission scenario with 3 agents, we show that while resulting performance is decreased nearly 20% compared with the centralized optimal solution, the problem size becomes linear in n, a very attractive feature when planning online for large multi-agent teams.
引用
收藏
页码:6011 / 6016
页数:6
相关论文
共 50 条
  • [41] An MDP-based vertical handoff decision algorithm for heterogeneous wireless networks
    Stevens-Navarro, Enrique
    Lin, Yuxia
    Wong, Vincent W. S.
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2008, 57 (02) : 1243 - 1254
  • [42] Effect of the demineralisation efficacy of MDP utilized on the bonding performance of MDP-based all-in-one adhesives
    Fujita , Kou
    Nikaido, Toru
    Burrow, Michael Francis
    Iwasaki, Taro
    Tanimoto, Yasuhiro
    Hirayama, Satoshi
    Nishiyama, Norihiro
    JOURNAL OF DENTISTRY, 2018, 77 : 59 - 65
  • [43] An MDP-based approach for multipath data transmission over wireless networks
    Bui, Vinh
    Zhu, Weiping
    Botta, Alessio
    Pescape, Antonio
    2008 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, PROCEEDINGS, VOLS 1-13, 2008, : 268 - +
  • [44] An MDP-based Model for Optimal Relay Selection in OFDMA Cooperative Networks
    Abu Ali, Najah A.
    Taha, Abd-Elhamid M.
    2012 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2012,
  • [45] AsistO: A Qualitative MDP-based Recommender System for Power Plant Operation
    Reyes, Alberto
    Enrique Sucar, L.
    Morales, Eduardo F.
    COMPUTACION Y SISTEMAS, 2009, 13 (01): : 5 - 20
  • [46] An MDP-based Vertical Handoff Decision Algorithm for Heterogeneous Wireless Networks
    Chen, Lin
    Li, Hui
    2016 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, 2016,
  • [47] MDP-Based MAC Protocol for WBANs in Edge-Enabled eHealth Systems
    Su, Haoru
    Pan, Meng-Shiuan
    Chen, Huamin
    Liu, Xiliang
    ELECTRONICS, 2023, 12 (04)
  • [48] An MDP-Based Winning Approach to Autonomous Power Trading: Formalization and Empirical Analysis
    Urieli, Daniel
    Stone, Peter
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 827 - 835
  • [49] Effects of an MDP-based surface cleaner on dentin structure, morphology and nanomechanical properties
    Toledano, Manuel
    Osorio, Estrella
    Espigares, Jorge
    Gonzalez-Fernandez, Juan Francisco
    Osorio, Raquel
    JOURNAL OF DENTISTRY, 2023, 138
  • [50] MDP-based resource allocation for triple-play transmission on xDSL systems
    de Souza, Lamartine V.
    de Carvalho, Glaucio H. S.
    Cardoso, Diego L.
    de Carvalho, Solon V.
    Frances, Carlos R. L.
    Costa, Joao C. W. A.
    Riu, Jaume Rius i
    BROADBAND ACCESS COMMUNICATION TECHNOLOGIES II, 2007, 6776