Scalable, MDP-based Planning for Multiple, Cooperating Agents

被引:0
|
作者
Redding, Joshua D. [1 ]
Ure, N. Kemal [1 ]
How, Jonathan P. [1 ]
Vavrina, Matthew A. [2 ]
Vian, John [2 ]
机构
[1] MIT, Aerosp Controls Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Boeing Res & Technol, Seattle, WA USA
关键词
DECENTRALIZED CONTROL; COMPLEXITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces an approximation algorithm for stochastic multi-agent planning based on Markov decision processes (MDPs). Specifically, we focus on a decentralized approach for planning the actions of a team of cooperating agents with uncertainties in fuel consumption and health-related models. The core idea behind the algorithm presented in this paper is to allow each agent to approximate the representation of its teammates. Each agent therefore maintains its own planner that fully enumerates its local states and actions while approximating those of its teammates. In prior work, the authors approximated each teammate individually, which resulted in a large reduction of the planning space, but remained exponential (in n - 1 rather than in n, where n is the number of agents) in computational scalability. This paper extends the approach and presents a new approximation that aggregates all teammates into a single, abstracted entity. Under the persistent search & track mission scenario with 3 agents, we show that while resulting performance is decreased nearly 20% compared with the centralized optimal solution, the problem size becomes linear in n, a very attractive feature when planning online for large multi-agent teams.
引用
收藏
页码:6011 / 6016
页数:6
相关论文
共 50 条
  • [31] Proactive MDP-based Collision Avoidance Algorithm for Autonomous Cars
    Osipychev, Denis
    Tran, Duy
    Sheng, Weihua
    Chowdhary, Girish
    Zeng, Ruili
    2015 IEEE INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2015, : 983 - 988
  • [32] MDP-based dynamic update of aggregated information for hierarchical QoS routing
    Chang, BJ
    Hwang, RH
    JOINT 4TH IEEE INTERNATIONAL CONFERENCE ON ATM (ICATM'01) AND HIGH SPEED INTELLIGENT INTERNET SYMPOSIUM, 2001, : 113 - 117
  • [33] MDP-Based Budget Allocation for Efficient Cooperative Task Solving
    JIANG Huan
    ZUO Min
    MATSUBARA Shigeo
    Chinese Journal of Electronics, 2017, 26 (05) : 966 - 972
  • [34] MDP-based Vehicular Network Connectivity Model for VCC Management
    Saad, Abubakar
    De Grande, Robson E.
    PROCEEDINGS OF THE 2020 IEEE/ACM 24TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), 2020, : 214 - 221
  • [35] SOBA: Session optimal MDP-based network friendly recommendations
    Giannakas, Theodoros
    Giovanidis, Anastasios
    Spyropoulos, Thrasyvoulos
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
  • [36] MDP-Based Handover In Heterogeneous Ultra-Dense Networks
    Khodmi, Amel
    Ben Rejeb, Sonia
    Nasser, Nidal
    Choukair, Zied
    35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 349 - 352
  • [37] Method for designating the wind condition in MDP-based motion planning of under-actuated blimp type UAV
    Kawano, Hiroshi
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10, 2007, : 1049 - 1055
  • [38] Paucity of Nanolayering in Resin-Dentin Interfaces of MDP-based Adhesives
    Tian, F.
    Zhou, L.
    Zhang, Z.
    Niu, L.
    Zhang, L.
    Chen, C.
    Zhou, J.
    Yang, H.
    Wang, X.
    Fu, B.
    Huang, C.
    Pashley, D. H.
    Tay, F. R.
    JOURNAL OF DENTAL RESEARCH, 2016, 95 (04) : 380 - 387
  • [39] MDP-Based Model for Interest Scheduling in IoT-NDN Environment
    Muralidharan, Shapna
    Roy, Abhishek
    Saxena, Navrati
    IEEE COMMUNICATIONS LETTERS, 2018, 22 (02) : 232 - 235
  • [40] MDP-Based MAC Design With Deterministic Backoffs in Virtualized 802.11 WLANs
    Shoaei, Atoosa Dalili
    Derakhshani, Mahsa
    Parsaeifard, Saeedeh
    Le-Ngoc, Tho
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (09) : 7754 - 7759