共 50 条
[1]
Assran M., 2019, ADV NEURAL INFORM PR, P13320
[2]
Bai Y, 2019, ADV NEUR IN, V32
[3]
Error bounds for constant step-size Q-learning
[J].
SYSTEMS & CONTROL LETTERS,
2012, 61 (12)
:1203-1208
[4]
Bonawitz K., 2019, Proc. Mach. Learn. Res, V1, P374
[6]
Communication-Efficient Policy G ad en Methods for Distributed Reinforcement Learning
[J].
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS,
2022, 9 (02)
:917-929
[7]
Chen Z., 2022, Transactions on Machine Learning Research
[8]
Chen ZW, 2021, Arxiv, DOI arXiv:2102.01567
[9]
Chen Zaiwei, 2020, Advances in Neural Information Processing Systems, V33, P8223
[10]
Chen Ziyi, 2022, P MACHINE LEARNIN