Differential dynamic programming for finite-horizon zero-sum differential games of nonlinear systems

被引:2
作者
Zhang, Bin [1 ]
Jia, Yingmin [2 ]
Zhang, Yuqi [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Beihang Univ, Res Div 7, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
differential dynamic programming; differential games; finite-horizon cost function; Hamilton-Jacobi-Isaacs equation; iterative algorithm; ADAPTIVE OPTIMAL-CONTROL; VALUE-ITERATION;
D O I
10.1002/rnc.6932
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we present an iterative algorithm based on differential dynamic programming (DDP) for finite-horizon two-person zero-sum differential games. The technique of DDP is used to expand the Hamilton-Jacobi-Isaacs (HJI) partial differential equation into higher-order differential equations. Using value function and saddle point approximations, the DDP expansion is transformed into algebraic matrix equation in integral form. Based on the algebraic matrix equation, a DDP iterative algorithm is developed to learn the solution to the differential games. Strict proof is proposed to guarantee the iterative convergences of the value function and saddle point. The new algorithm is fundamentally different from existing results, in the sense that it overcome the technical obstacle to address the time-varying behavior of HJI partial differential equation. Simulation examples are given to demonstrate the effectiveness of the proposed method.
引用
收藏
页码:11062 / 11084
页数:23
相关论文
共 47 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]  
[Anonymous], 2012, Optimal control
[3]  
[Anonymous], 2015, Differential Games: A Concise Introduction
[4]  
Basar T., 1995, Dynamic Noncooperative Game Theory
[5]   Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design [J].
Bian, Tao ;
Jiang, Zhong-Ping .
AUTOMATICA, 2016, 71 :348-360
[6]   Adaptive dynamic programming and optimal control of nonlinear nonaffine systems [J].
Bian, Tao ;
Jiang, Yu ;
Jiang, Zhong-Ping .
AUTOMATICA, 2014, 50 (10) :2624-2632
[7]   Distributed Differential Games for Control of Multi-Agent Systems [J].
Cappello, Domenico ;
Mylvaganam, Thulasi .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2022, 9 (02) :635-646
[8]   Adaptive optimal target controlled infusion algorithm to prevent hypotension associated with labor epidural: An adaptive dynamic programming approach [J].
Davoud, Sherwin ;
Gao, Weinan ;
Riveros-Perez, Efrain .
ISA TRANSACTIONS, 2020, 100 :74-81
[9]  
Delfour Michel, 2008, 16th Mediterranean Conference on Control & Automation, MED 2008, P907, DOI 10.1109/MED.2008.4602013
[10]   Approximate Dynamic Programming: Combining Regional and Local State Following Approximations [J].
Deptula, Patryk ;
Rosenfeld, Joel A. ;
Kamalapurkar, Rushikesh ;
Dixon, Warren E. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) :2154-2166