共 62 条
[52]
Online Learning and Online Convex Optimization
[J].
FOUNDATIONS AND TRENDS IN MACHINE LEARNING,
2012, 4 (02)
:107-194
[54]
Sutton R., 1998, Introduction to reinforcement learning
[58]
van Damme E., 1987, Stability and Perfection of Nash Equilibria
[59]
No-regret dynamics and fictitious play
[J].
JOURNAL OF ECONOMIC THEORY,
2013, 148 (02)
:825-842
[60]
von Neumann J, 1928, MATH ANN, V100, P295