共 29 条
[2]
Deisenroth M., 2011, PROC INT C MACH LEAR, P465
[3]
Dosovitskiy A, 2017, PR MACH LEARN RES, V78
[5]
Feinberg V., 2018, Model-based value estimation for efficient model-free reinforcement Learning
[9]
Grondman I., 2015, THESIS DELFT U TECHN, DOI DOI 10.4233/UUID:415-14FD-0B1B-4E18-8974-5AD61F7FE280