共 68 条
- [32] The root of all value: a neural common currency for choice [J]. CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) : 1027 - 1038
- [38] Mnih V, 2016, PR MACH LEARN RES, V48
- [39] The Misbehavior of Reinforcement Learning [J]. PROCEEDINGS OF THE IEEE, 2014, 102 (04) : 528 - 541
- [40] Nachum O, 2017, ADV NEUR IN, V30