Scalable, MDP-based Planning for Multiple, Cooperating Agents

被引:0
|
作者
Redding, Joshua D. [1 ]
Ure, N. Kemal [1 ]
How, Jonathan P. [1 ]
Vavrina, Matthew A. [2 ]
Vian, John [2 ]
机构
[1] MIT, Aerosp Controls Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Boeing Res & Technol, Seattle, WA USA
关键词
DECENTRALIZED CONTROL; COMPLEXITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces an approximation algorithm for stochastic multi-agent planning based on Markov decision processes (MDPs). Specifically, we focus on a decentralized approach for planning the actions of a team of cooperating agents with uncertainties in fuel consumption and health-related models. The core idea behind the algorithm presented in this paper is to allow each agent to approximate the representation of its teammates. Each agent therefore maintains its own planner that fully enumerates its local states and actions while approximating those of its teammates. In prior work, the authors approximated each teammate individually, which resulted in a large reduction of the planning space, but remained exponential (in n - 1 rather than in n, where n is the number of agents) in computational scalability. This paper extends the approach and presents a new approximation that aggregates all teammates into a single, abstracted entity. Under the persistent search & track mission scenario with 3 agents, we show that while resulting performance is decreased nearly 20% compared with the centralized optimal solution, the problem size becomes linear in n, a very attractive feature when planning online for large multi-agent teams.
引用
收藏
页码:6011 / 6016
页数:6
相关论文
共 50 条
  • [1] MDP-Based Outpatient Scheduling for Multiple Examinations
    Liu, Yang
    Geng, Na
    Zhu, Yanhong
    2015 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2015, : 1312 - 1317
  • [2] MDP-based Motion Planning for Grasping in Dynamic Scenarios
    Mueller, Steffen
    Stephan, Benedict
    Gross, Horst-Michael
    10TH EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR 2021), 2021,
  • [3] A Scalable MDP-based Sensing and Processing Framework for Vehicular Networks
    Chattopadhyay, Rajarshi
    Tham, Chen-Khong
    2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2019, : 687 - 692
  • [4] An MDP-based recommender system
    Shani, G
    Heckerman, D
    Brafman, RI
    JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 1265 - 1295
  • [5] MDP-Based Mission Planning for Multi-UAV Persistent Surveillance
    Jeong, Byeong-Min
    Ha, Jung-Su
    Choi, Han-Lim
    2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 831 - 834
  • [6] MDP-based Network Friendly Recommendations
    Giannakas, Theodoros
    Giovanidis, Anastasios
    Spyropoulos, Thrasyvoulos
    ACM TRANSACTIONS ON MODELING AND PERFORMANCE EVALUATION OF COMPUTING SYSTEMS, 2021, 6 (04)
  • [7] An MDP-based approach to online mechanism design
    Parkes, DC
    Singh, S
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 791 - 798
  • [8] Deep reinforcement learning for layout planning - An MDP-based approach for the facility layout problem
    Heinbach, Benjamin
    Burggraef, Peter
    Wagner, Johannes
    MANUFACTURING LETTERS, 2023, 38 : 40 - 43
  • [9] Empirical Evaluation of MDP-based DASH Player
    Bokani, Ayub
    Hoseini, S. Amir
    Hassan, Mahbub
    Kanhere, Salil S.
    25TH INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC 2015), 2015, : 332 - 337
  • [10] THE USE OF MDP-BASED MATERIALS FOR BONDING TO ZIRCONIA
    de Souza, Grace
    Hennig, Diana
    Aggarwal, Anuj
    Tam, Laura E.
    JOURNAL OF PROSTHETIC DENTISTRY, 2014, 112 (04): : 895 - 902