Nash Bargaining Solution based rendezvous guidance of unmanned aerial vehicles

被引:7
作者
Bardhan, R. [1 ]
Ghose, D. [1 ]
机构
[1] Indian Inst Sci, Guidance Control & Decis Syst Lab, Dept Aerosp Engn, Bangalore 560012, Karnataka, India
来源
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS | 2018年 / 355卷 / 16期
关键词
DIFFERENTIAL GAME APPROACH; RECEDING HORIZON CONTROL; DECENTRALIZED OPTIMIZATION; MULTIAGENT SYSTEMS; OBSTACLE AVOIDANCE; FORMATION FLIGHT; CONTROL DESIGN; COORDINATION; CONSENSUS; NETWORKS;
D O I
10.1016/j.jfranklin.2018.08.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses a finite-time rendezvous problem for a group of unmanned aerial vehicles (UAVs), in the absence of a leader or a reference trajectory. When the UAVs do not cooperate, they are assumed to use Nash equilibrium strategies (NES). However, when the UAVs can communicate among themselves, they can implement cooperative game theoretic strategies for mutual benefit. In a convex linear quadratic differential game (LQDG), a Pareto-optimal solution (POS) is obtained when the UAVs jointly minimize a team cost functional, which is constructed through a convex combination of individual cost functionals. This paper proposes an algorithm to determine the convex combination of weights corresponding to the Pareto-optimal Nash Bargaining Solution (NBS), which offers each UAV a lower cost than that incurred from the NES. Conditions on the cost functions that make the proposed algorithm converge to the NBS are presented. A UAV, programmed to choose its strategies at a given time based upon cost-to-go estimates for the rest of the game duration, may switch to NES finding it to be more beneficial than continuing with a cooperative strategy it previously agreed upon with the other UAVs. For such scenarios, a renegotiation method, that makes use of the proposed algorithm to obtain the NBS corresponding to the state of the game at an intermediate time, is proposed. This renegotiation method helps to establish cooperation between UAVs and prevents non-cooperative behaviour. In this context, the conditions of time consistency of a cooperative solution have been derived in connection to LQDG. The efficacy of the guidance law derived from the proposed algorithm is illustrated through simulations. (C) 2018 Published by Elsevier Ltd on behalf of The Franklin Institute.
引用
收藏
页码:8106 / 8140
页数:35
相关论文
共 53 条
  • [1] Ahmed M., P 49 AIAA AER SCI M, P2011, DOI [10.2514/6.2011-76, DOI 10.2514/6.2011-76]
  • [2] Anderson M.R., 1998, P AIAA GUIDANCE NAVI, P244
  • [3] Basar T., 1982, Dynamic Noncooperative Game Theory
  • [4] Bio-Inspired Rendezvous Strategies and Respondent Detections
    Basset, Gareth
    Xu, Yunjun
    Khanh Pham
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2013, 36 (01) : 64 - 73
  • [5] Mechanism design for optimal consensus problems
    Bauso, D.
    Giarre, L.
    Pesenti, R.
    [J]. PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 3381 - +
  • [6] Bauso D., 2003, THESIS
  • [7] Bertsekas D.P., 2015, Convex optimization algorithms
  • [8] An Overview of Recent Progress in the Study of Distributed Multi-Agent Coordination
    Cao, Yongcan
    Yu, Wenwu
    Ren, Wei
    Chen, Guanrong
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (01) : 427 - 438
  • [9] Coordinated formation control design with obstacle avoidance in three-dimensional space
    Chang, Kai
    Xia, Yuanqing
    Huang, Kaoli
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2015, 352 (12): : 5779 - 5795
  • [10] Chen XP, 2003, 42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, P498