共 17 条
[11]
Lerousseau M.
[12]
Sutton R.S., Barto A.G., Reinforcement learning an introduction, MIT press, (2018)
[13]
Mnih V., Badia A.P., Et al., Asynchronous methods for deep reinforcement learning, CoRR, abs/1602.01783, (2016)
[14]
Wang Z., de Freitas N., Lanctot M., Dueling network architectures for deep reinforcement learning, CoRR, abs/1511.06581, (2015)
[15]
Mnih V., al, Human-level control through deep reinforcement learning, Nature, 518, 7540, pp. 529-533, (2015)
[16]
Rahwan I., al, Machine behaviour, Nature, (2019)
[17]
Osband I., (2019)