共 21 条
[11]
Orseau Laurent., 2016, C UNC ART INT, P557
[15]
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[16]
Sutton RS, 1996, ADV NEUR IN, V8, P1038
[17]
van Hasselt H, 2016, AAAI CONF ARTIF INTE, P2094
[18]
van Seijen H, 2009, ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, P177