共 41 条
[1]
Abraham W.C., NPJ SCI LEARNING, V4
[3]
[Anonymous], 2014, INT C LEARN REPR
[4]
[Anonymous], ARXIV160604671
[5]
Atkinson C., ARXIV180203875
[6]
Reinforcement learning in continuous time and space: Interference and not ill conditioning is the main problem when using distributed function approximators
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2008, 38 (04)
:950-956
[7]
Berseth Glen, 2018, INT C LEARN REPR
[8]
Caselles-Dupre H., ARXIV190209434
[9]
Caselles-Dupre H., ARXIV181003880