A Monotone Approximate Dynamic Programming Approach for the Stochastic Scheduling, Allocation, and Inventory Replenishment Problem: Applications to Drone and Electric Vehicle Battery Swap Stations

被引:11
作者
Asadi, Amin [1 ,2 ]
Pinkley, Sarah Nurre [2 ]
机构
[1] Univ Twente, Dept Ind Engn & Business Informat Syst, NL-7522 NB Enschede, Netherlands
[2] Univ Arkansas, Dept Ind Engn, Fayetteville, AR 72701 USA
基金
美国国家科学基金会;
关键词
electric vehicles and drones; battery swap station; Markov decision processes; battery degradation; monotone policy and value function; regression-based initialization; approximate dynamic programming; OPTIMIZATION; SYSTEM; MAINTENANCE; ALGORITHMS; MANAGEMENT; OPERATIONS; DISPATCH; DEMAND; MODELS; DRIVEN;
D O I
10.1287/trsc.2021.1108
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
There is a growing interest in using electric vehicles (EVs) and drones for many applications. However, battery-oriented issues, including range anxiety and battery degradation, impede adoption. Battery swap stations are one alternative to reduce these concerns that allowthe swap of depleted for full batteries inminutes. We consider the problem of deriving actions at a battery swap station when explicitly considering the uncertain arrival of swap demand, battery degradation, and replacement. We model the operations at a battery swap station using a finite horizon Markov decision process model for the stochastic scheduling, allocation, and inventory replenishment problem(SAIRP), which determines when and how many batteries are charged, discharged, and replaced over time. We present theoretical proofs for the monotonicity of the value function and monotone structure of an optimal policy for special SAIRP cases. Because of the curses of dimensionality, we develop a new monotone approximate dynamic programming (ADP) method, which intelligently initializes a value function approximation using regression. In computational tests, we demonstrate the superior performance of the new regression-based monotone ADP method compared with exact methods and other monotone ADP methods. Furthermore, with the tests, we deduce policy insights for drone swap stations.
引用
收藏
页码:1085 / 1110
页数:26
相关论文
共 94 条
[1]  
Abdollahi A, 2015, BATTERY HLTH DEGRADA
[2]  
Abe M., 2012, Hitachi Review, V61, P259
[3]   The optimal timing of living-donor liver transplantation [J].
Alagoz, O ;
Maillart, LM ;
Schaefer, AJ ;
Roberts, MS .
MANAGEMENT SCIENCE, 2004, 50 (10) :1420-1430
[4]  
[Anonymous], 2013, CBS NEWS
[5]  
[Anonymous], 2015, P 28 INT EL VEH S EX
[6]   A stochastic scheduling, allocation, and inventory replenishment problem for battery swap stations [J].
Asadi, Amin ;
Pinkley, Sarah Nurre .
TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2021, 146
[7]  
Battery University, 2017, BU 801B DEF BATT LIF
[8]   An approximate dynamic programming approach to multidimensional knapsack problems [J].
Bertsimas, D ;
Demir, R .
MANAGEMENT SCIENCE, 2002, 48 (04) :550-565
[9]   Monotone Markov processes with respect to the reversed hazard rate ordering: An application to reliability [J].
Bloch-Mercier, S .
JOURNAL OF APPLIED PROBABILITY, 2001, 38 (01) :195-208
[10]  
C<comma>imen M, 2013, IFAC P, V46, P2015