共 24 条
[2]
Basher H., 2003, ORNLTM2003252
[5]
Dulac-Arnold G, 2019, Arxiv, DOI arXiv:1904.12901
[6]
Reinforcement Learning with Multiple Shared Rewards
[J].
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016),
2016, 80
:855-864
[7]
Haarnoja T, 2018, PR MACH LEARN RES, V80
[8]
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[10]
KAERI, 1990, Advanced compact nuclear simulator textbook.