共 7 条
[2]
Policy Iteration Q-Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems..[J].Luo Biao;Yang Yin;Liu Derong.IEEE transactions on cybernetics.2020,
[4]
Stability Analysis of Optimal Adaptive Control Under Value Iteration Using a Stabilizing Initial Policy..[J].Heydari Ali.IEEE transactions on neural networks and learning systems.2017, 9
[6]
Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm.[J].Derong Liu;Hongliang Li;Ding Wang.Neurocomputing.2013,