Nash Bargaining Solution based rendezvous guidance of unmanned aerial vehicles

被引:7
作者
Bardhan, R. [1 ]
Ghose, D. [1 ]
机构
[1] Indian Inst Sci, Guidance Control & Decis Syst Lab, Dept Aerosp Engn, Bangalore 560012, Karnataka, India
来源
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS | 2018年 / 355卷 / 16期
关键词
DIFFERENTIAL GAME APPROACH; RECEDING HORIZON CONTROL; DECENTRALIZED OPTIMIZATION; MULTIAGENT SYSTEMS; OBSTACLE AVOIDANCE; FORMATION FLIGHT; CONTROL DESIGN; COORDINATION; CONSENSUS; NETWORKS;
D O I
10.1016/j.jfranklin.2018.08.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses a finite-time rendezvous problem for a group of unmanned aerial vehicles (UAVs), in the absence of a leader or a reference trajectory. When the UAVs do not cooperate, they are assumed to use Nash equilibrium strategies (NES). However, when the UAVs can communicate among themselves, they can implement cooperative game theoretic strategies for mutual benefit. In a convex linear quadratic differential game (LQDG), a Pareto-optimal solution (POS) is obtained when the UAVs jointly minimize a team cost functional, which is constructed through a convex combination of individual cost functionals. This paper proposes an algorithm to determine the convex combination of weights corresponding to the Pareto-optimal Nash Bargaining Solution (NBS), which offers each UAV a lower cost than that incurred from the NES. Conditions on the cost functions that make the proposed algorithm converge to the NBS are presented. A UAV, programmed to choose its strategies at a given time based upon cost-to-go estimates for the rest of the game duration, may switch to NES finding it to be more beneficial than continuing with a cooperative strategy it previously agreed upon with the other UAVs. For such scenarios, a renegotiation method, that makes use of the proposed algorithm to obtain the NBS corresponding to the state of the game at an intermediate time, is proposed. This renegotiation method helps to establish cooperation between UAVs and prevents non-cooperative behaviour. In this context, the conditions of time consistency of a cooperative solution have been derived in connection to LQDG. The efficacy of the guidance law derived from the proposed algorithm is illustrated through simulations. (C) 2018 Published by Elsevier Ltd on behalf of The Franklin Institute.
引用
收藏
页码:8106 / 8140
页数:35
相关论文
共 53 条
  • [31] Ramirez J., 2010, THESIS
  • [32] Sliding Mode Control-Based Autopilots for Leaderless Consensus of Unmanned Aerial Vehicles
    Rao, Sachit
    Ghose, Debasish
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2014, 22 (05) : 1964 - 1972
  • [33] Variable Deviated Pursuit for Rendezvous Guidance
    Ratnoo, Ashwini
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2015, 38 (04) : 787 - +
  • [34] Formation-Flying Guidance for Cooperative Radar Deception
    Ratnoo, Ashwini
    Shima, Tal
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2012, 35 (06) : 1730 - 1739
  • [35] Ren W., AIAA INFOTECH AEROSP, P2005, DOI [10.2514/6.2005-7067, DOI 10.2514/6.2005-7067]
  • [36] Saif O, 2014, INT CONF UNMAN AIRCR, P222, DOI 10.1109/ICUAS.2014.6842259
  • [37] Multi-agent team cooperation: A game theory approach
    Semsar-Kazerooni, E.
    Khorasani, K.
    [J]. AUTOMATICA, 2009, 45 (10) : 2205 - 2213
  • [38] Seo J., P AIAA INF AER C SEA, P1, DOI [10.2514/6.2009-1826, DOI 10.2514/6.2009-1826]
  • [39] Shamsi F., 2011, P 19 IR C EL ENG IR, P1
  • [40] Shima T, 2009, ADV DES CONTROL, P1, DOI 10.1137/1.9780898718584