共 43 条
- [3] Basar T, 1998, Dynamic Noncooperative Game Theory, V2nd
- [5] Multiple model-based reinforcement learning [J]. NEURAL COMPUTATION, 2002, 14 (06) : 1347 - 1369
- [8] Ioannou P, 2006, ADV DES CONTROL, P1, DOI 10.1137/1.9780898718652
- [9] Janner M, 2019, ADV NEUR IN, V32