Data-Driven Cooperative Multi-Task Assignment in Heterogeneous Multi-Agent Pursuit-Evade Game

被引:1
作者
Lu, Wei [1 ]
Liu, Hao [2 ]
Gao, Qing [3 ]
机构
[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
[3] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Multi-agent system; task assignment; optimal control; reinforcement learning; unmanned aerial vehicle; TASK ASSIGNMENT; MULTIPLE QUADROTORS; UAV; ALLOCATION; NETWORKS; TRACKING;
D O I
10.1142/S2301385024420019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a task assignment method is proposed to deal with the multi-agent pursuit-evade game for heterogeneous unnamed aerial vehicles via reinforcement learning. The mathematical model based on the local position error dynamics is established to describe the interactions among the vehicles in the pursuit-evade game, subject to high nonlinearities and parameter uncertainties involved in the vehicle model. The execution costs and the corresponding optimal control policies of the agent pursuing each target are calculated, and the policy with minimum execution cost is determined as the objective of the multi-agent pursuit-evade game. Min-Max strategy is introduced to estimate and counteract the interaction effects in the mathematical model, and the reinforcement learning-based algorithm is proposed to obtain the optimal solution to the assignment problem based on the Hamilton-Jacobi-Bellman equation without the interaction effects. Simulation results are given to show the effectiveness of the proposed task assignment method.
引用
收藏
页码:523 / 533
页数:11
相关论文
共 27 条
  • [1] A Multi-Trip Task Assignment for Early Target Inspection in Squads of Aerial Drones
    Bartolini, Novella
    Coletta, Andrea
    Maselli, Gaia
    Khalifeh, Alar
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (11) : 3099 - 3116
  • [2] Multiagent Dynamic Task Assignment Based on Forest Fire Point Model
    Chen, Jie
    Guo, Yuqian
    Qiu, Zhifeng
    Xin, Bin
    Jia, Qing-Shan
    Gui, Weihua
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (02) : 833 - 849
  • [3] Multi-UAV Task Assignment With Parameter and Time-Sensitive Uncertainties Using Modified Two-Part Wolf Pack Search Algorithm
    Chen, Yongbo
    Yang, Di
    Yu, Jianqiao
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2018, 54 (06) : 2853 - 2872
  • [4] Consensus-Based Decentralized Auctions for Robust Task Allocation
    Choi, Han-Lim
    Brunet, Luc
    How, Jonathan P.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2009, 25 (04) : 912 - 926
  • [5] Nonlinear Control of Quadrotor for Point Tracking: Actual Implementation and Experimental Tests
    Choi, Young-Cheol
    Ahn, Hyo-Sung
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2015, 20 (03) : 1179 - 1192
  • [6] Dynamic Delayed-Decision Task Assignment Under Spatial-Temporal Constraints in Mobile Crowdsensing
    Ding, Yu
    Zhang, Lichen
    Guo, Longjiang
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (04): : 2418 - 2431
  • [7] Dynamic Discrete Pigeon-Inspired Optimization for Multi-UAV Cooperative Search-Attack Mission Planning
    Duan, Haibin
    Zhao, Jianxia
    Deng, Yimin
    Shi, Yuhui
    Ding, Xilun
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2021, 57 (01) : 706 - 720
  • [8] Learning Cooperative Multi-Agent Policies With Partial Reward Decoupling
    Freed, Benjamin
    Kapoor, Aditya
    Abraham, Ian
    Schneider, Jeff
    Choset, Howie
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 890 - 897
  • [9] Game Combined Multi-Agent Reinforcement Learning Approach for UAV Assisted Offloading
    Gao, Ang
    Wang, Qi
    Liang, Wei
    Ding, Zhiguo
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (12) : 12888 - 12901
  • [10] UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning
    Gong, Zihao
    Xu, Yang
    Luo, Delin
    [J]. UNMANNED SYSTEMS, 2023, 11 (03) : 273 - 286