共 69 条
- [1] Abbasi-Yadkori Yasin, 2011, ADV NEURAL INFORM PR, P2312
- [2] Agrawal S., 2017, Advances in Neural Information Processing Systems, P1184
- [3] Akkaya Andrychowicz Chociej Litwin McGrew Petron Paino Plappert Powell Ribas Schneider Tezak Tworek Welinder Weng Yuan Zaremba Zhang I. M. M. M. B. A. A. M. G. R. J. N. J. P. L. Q. W. L., 2019, arXiv
- [4] [Anonymous], 2017, ARXIV170305449
- [5] [Anonymous], 2019, ADV NEUR IN
- [7] Ayoub A, 2020, PR MACH LEARN RES, V119
- [9] Bertsekas D. P., 1996, Neuro-Dynamic Programming
- [10] Cai Q., 2019, ARXIV191205830