共 40 条
- [1] Asiain E(2019)Controller exploitation-exploration: A reinforcement learning architecture Soft Computing 23 3591-3604
- [2] Clempner JB(2019)Allassonnière S,: Learning from both experts and data Entropy 21 1208-25
- [3] Poznyak AS(2021)A markovian stackelberg game approach for computing an optimal dynamic mechanism Computational and Applied Mathematics 40 1-862
- [4] Besson R(2021)A proximal/gradient approach for computing the nash equilibrium in controllable markov games J Optim Theory Appl 188 847-286
- [5] Le Pennec E(2022)A dynamic mechanism design for controllable and ergodic markov games Computational Economics To be published 328 267-15
- [6] Clempner JB(2018)A tikhonov regularized penalty function approach for solving polylinear programming problems J. Comput. Appl. Math. 95 1-128
- [7] Clempner JB(2020)A nucleus for bayesian partially observable markov games: Joint observer and mechanism design Engineering Applications of Artificial Intelligence 9 118-464
- [8] Clempner JB(2021)Analytical method for mechanism design in partially observable markov games Mathematics 147 457-492
- [9] Clempner JB(2000)Survey of adaptive dual control methods IEEE Control Theoryand Applications 19 359-30
- [10] Poznyak AS(2007)Bayesian policy gradient algorithms Neural Information Processing Systems 8 1-1231