共 32 条
- [1] Barr RS(1995)Designing and reporting on computational experiments with heuristic methods Journal of Heuristics 1 9-32
- [2] Golden BL(2011)Approximate policy iteration: A survey and some new methods Journal of Control Theory and Applications 9 310-335
- [3] Kelly JP(1996)Linear least-squares algorithms for temporal difference learning Machine Learning 22 33-57
- [4] Resende MG(2012)Thirty years of inventory routing Transportation Science 48 1-19
- [5] Stewart J(2017)Approximate dynamic programming for missile defense interceptor fire control European Journal of Operational Research 259 873-886
- [6] William R(2002)The stochastic inventory routing problem with direct deliveries Transportation Science 36 94-70
- [7] Bertsekas DP(2004)Dynamic programming approximations for a stochastic inventory routing problem Transportation Science 38 42-1149
- [8] Bradtke SJ(2003)Least-squares policy iteration The Journal of Machine Learning Research 4 1107-749
- [9] Barto AG(2010)Disruption management of the vehicle routing problem with vehicle breakdown Journal of the Operational Research Society 62 742-38
- [10] Coelho LC(2012)Perspectives of approximate dynamic programming Annals of Operations Research 13 1-839