共 55 条
- [1] Agarwal A., 2020, NEURIPS, V33, P20095
- [2] Agarwal A., 2019, Reinforcement learning: Theory and algorithms
- [3] Arora Sanjeev, 2020, PMLR, P367
- [4] Ayoub A., 2020, P 37 INT C MACHINE L, P463
- [5] Barreto A, 2017, ADV NEUR IN, V30
- [6] Bengio Y., 2009, INT C MACH LEARN
- [7] Blier L., 2021, ARXIV
- [8] Brunskill E, 2014, PR MACH LEARN RES, V32, P316
- [9] Brunskill Emma, 2013, ARXIV