An approximate dynamic programming approach for the empty container allocation problem

被引:124
作者
Lam, Shao-Wei [1 ]
Lee, Loo-Hay [1 ]
Tang, Loon-Ching [1 ]
机构
[1] Natl Univ Singapore, Dept Ind & Syst Engn, Singapore 117548, Singapore
关键词
dynamic container allocation; temporal difference learning; average cost minimization; sea cargo;
D O I
10.1016/j.trc.2007.04.005
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
The objective of this study is to demonstrate the successful application of an approximate dynamic programming approach in deriving effective operational strategies for the relocation of empty containers in the containerized sea-cargo industry. A dynamic stochastic model for a simple two-ports two-voyages (TPTV) system is proposed first to demonstrate the effectiveness of the approximate optimal solution obtained through a simulation based approach known as the temporal difference (TD) learning for average cost minimization. An exact optimal solution can be obtained for this simple TPTV model. Approximate optimal results from the TPTV model utilizing a linear approximation architecture under the TD framework can then be compared to this exact solution. The results were found comparable and showed promising improvements over an existing commonly used heuristics. The modeling and solution approach can be extended to a realistic multiple-ports multiple-voyages (MPMV) system. Some results for the MPMV case are shown. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:265 / 277
页数:13
相关论文
共 28 条
[1]  
ASSAD A, 1987, NETWORKS, V8, P37
[2]   A MODEL FOR FLEET SIZING AND VEHICLE ALLOCATION [J].
BEAUJON, GJ ;
TURNQUIST, MA .
TRANSPORTATION SCIENCE, 1991, 25 (01) :19-45
[3]  
BENJAMIN VR, 1996, NEURODYNAMIC APPROAC
[4]  
Bertsekas D., 1996, NEURO DYNAMIC PROGRA, V1st
[5]   Missile defense and interceptor allocation by neuro-dynamic programming [J].
Bertsekas, DP ;
Homer, ML ;
Logan, DA ;
Patek, SD ;
Sandell, NR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (01) :42-51
[6]   A new value iteration method for the average cost dynamic programming problem [J].
Bertsekas, DP .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1998, 36 (02) :742-759
[7]  
Bertsekas DP, 1995, Dynamic Programming and Optimal Control, V2
[8]   Branch-acid-bound parallelization strategies applied to a depot location and container fleet management problem [J].
Bourbeau, B ;
Crainic, TG ;
Gendron, B .
PARALLEL COMPUTING, 2000, 26 (01) :27-46
[9]  
*BUS TIM, 2002, SHIPP TIM SING HOLD, P3
[10]   A two-stage stochastic network model and solution methods for the dynamic empty container allocation problem [J].
Cheung, RK ;
Chen, CY .
TRANSPORTATION SCIENCE, 1998, 32 (02) :142-162