共 30 条
[21]
POLICY IMPROVEMENT FOR PERFECT INFORMATION ADDITIVE REWARD AND ADDITIVE TRANSITION STOCHASTIC GAMES WITH DISCOUNTED AND AVERAGE PAYOFFS
[J].
JOURNAL OF DYNAMICS AND GAMES,
2014, 1 (03)
:347-361
[24]
Existence of the Limit Value of Two Person Zero-Sum Discounted Repeated Games via Comparison Theorems
[J].
Journal of Optimization Theory and Applications,
2013, 157
:564-576
[26]
Perfect aggregation of information in two-person multistage games with fixed sequence of moves and aggregated information on partner’s choice
[J].
Automation and Remote Control,
2010, 71
:1240-1246
[29]
Online Solution of Nonlinear Two-Player Zero-Sum Games Using Synchronous Policy Iteration
[J].
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC),
2010,
:3040-3047
[30]
Decentralized Learning in Two-Player Zero-Sum Games: A LR-I Lagging Anchor Algorithm
[J].
2011 AMERICAN CONTROL CONFERENCE,
2011,
:107-112