共 365 条
[31]
Bellemare MG, 2017, PR MACH LEARN RES, V70
[34]
Bellman R., 1958, Information and Control, V3, P228, DOI DOI 10.1016/S0019-9958(58)80003-0
[35]
Bellman R., 1972, Dynamic Programming
[36]
Bellman R, 1956, SANKHYA, V16, P221
[37]
Berner C., 2019, arXiv
[38]
Bertsekas D., 2012, Dynamic Programming and Optimal Control, VI
[39]
Bertsekas Dimitri P, 1996, Neuro-Dynamic Programming
[40]
Bhatnagar S., 2009, Advances in Neural Information Processing Systems, P1204