A Markov decision process approach for managing medical drone deliveries

被引：22

作者：

Asadi, Amin ^{[1
,2
]}

Pinkley, Sarah Nurre ^{[2
]}

Mes, Martijn ^{[1
]}

机构：

[1] Univ Twente, Dept Ind Engn & Business Informat Syst, NL-7522 NB Enschede, Netherlands

[2] Univ Arkansas, Dept Ind Engn, 4207 Bell Engn, Fayetteville, AR 72701 USA

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2022年 / 204卷

基金：

美国国家科学基金会;

关键词：

Markov decision processes; Drones; Healthcare; Routing; Dynamic scheduling allocation; Reinforcement learning; DYNAMIC-PROGRAMMING ALGORITHMS; DEMAND; OPTIMIZATION; ALLOCATION; VEHICLES; MODEL;

D O I：

10.1016/j.eswa.2022.117490

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Drone delivery is a fast and innovative method for delivering parcels, food, and medical supplies. Furthermore, this low-contact delivery mode contributes to reducing the spread of pandemic and vaccine-preventable diseases. Focusing on the delivery of medical supplies, this paper studies optimizing the distribution operations at a drone hub that dispatches drones to hospitals located at different geographic locations. Each hospital generates stochastic demands for medical supplies to be covered. This paper classifies stochastic demands based on the distance between hospitals and the drone hub. Satisfying the demands requires flying over different ranges, which is directly related to the amount of charge of the drone batteries. We develop a stochastic scheduling and allocation problem with multiple classes of demand and model the problem using a finite Markov decision process approach. We provide exact solutions for the modest sizes instances using backward induction and discuss that the problem suffers from the curses of dimensionality. Hence, we provide a reinforcement learning method capable of giving near-optimal solutions. We perform a set of computational tests using realistic data representing a prominent drone delivery company. Finally, we analyze the results to provide insights for managing drone hub operations and show that the reinforcement learning method has high performance compared with the exact and heuristic solution methods.

引用

页数：17

共 92 条

[1]

Ackerman E, 2020, Zipline Launches Long-Distance Drone Delivery of COVID-19 Supplies in the U.S

[2]

Al-Sabban WH, 2013, IEEE INT CONF ROBOT, P784, DOI 10.1109/ICRA.2013.6630662

[3] The optimal timing of living-donor liver transplantation [J].

Alagoz, O ;

Maillart, LM ;

Schaefer, AJ ;

Roberts, MS .

MANAGEMENT SCIENCE, 2004, 50 (10) :1420-1430

[4]

[Anonymous], 2019, MINIMIZING TRAVEL TI

[5]

[Anonymous], 2015, At least 60 killed in the blast at Shikarpur imambargah'

[6]

[Anonymous], 2015, SEGOLENE ROYAL AGREE

[7]

ARMONY M, 2015, STOCHASTIC SYSTEMS, V5, P1, DOI DOI 10.1214/14-SSY153

[8] A Monotone Approximate Dynamic Programming Approach for the Stochastic Scheduling, Allocation, and Inventory Replenishment Problem: Applications to Drone and Electric Vehicle Battery Swap Stations [J].

Asadi, Amin ;

Pinkley, Sarah Nurre .

TRANSPORTATION SCIENCE, 2022, 56 (04) :1085-1110

[9] A stochastic scheduling, allocation, and inventory replenishment problem for battery swap stations [J].

Asadi, Amin ;

Pinkley, Sarah Nurre .

TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2021, 146

[10]

Baek SS, 2013, IEEE INT C INT ROBOT, P2955, DOI 10.1109/IROS.2013.6696775

← 1 2 3 4 5 6 7 8 9 10 →