共 50 条
[42]
Q-Learning with Side Information in Multi-Agent Finite Games
[J].
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC),
2019,
:5032-5037
[46]
Q-learning and policy iteration algorithms for stochastic shortest path problems
[J].
Annals of Operations Research,
2013, 208
:95-132
[50]
Sample Complexity of Asynchronous Q-Learning: Sharper Analysis and Variance Reduction
[J].
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020,
2020, 33