共 55 条
[1]
Agarwal N., 2021, P INT C LEARN REPR
[2]
Baird L., 1995, MACHINE LEARNING P 1, P30
[3]
Bertsekas D. P., 1996, Neuro-dynamic Programming
[4]
Bhandari J., 2018, Conference On Learning Theory, P1691
[8]
Carvalho D., 2020, ADV NEUR IN, V33
[9]
Chen Z., 2022, arXiv
[10]
Chen Z., 2021, Advances in Neural Information Processing Systems, V34