Dynamic Courier Capacity Acquisition in Rapid Delivery Systems: A Deep Q-Learning Approach

被引：8

作者：

Auad, Ramon ^{[1
,2
]}

Erera, Alan ^{[1
]}

Savelsbergh, Martin ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA 30332 USA

[2] Univ Catolica Norte, Dept Ind Engn, Antofagasta 1240000, Chile

来源：

TRANSPORTATION SCIENCE | 2024年 / 58卷 / 01期

关键词：

logistics; rapid delivery; capacity management; last-mile delivery; reinforcement learning; deep Q-learning; OPTIMIZATION;

D O I：

10.1287/trsc.2022.0042

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

With the recent boom of the gig economy, urban delivery systems have experienced substantial demand growth. In such systems, orders are delivered to customers from local distribution points respecting a delivery time promise. An important example is a restaurant meal delivery system, where delivery times are expected to be minutes after an order is placed. The system serves orders by making use of couriers that continuously perform pickups and deliveries. Operating such a rapid delivery system is very challenging, primarily because of the high service expectations and the considerable uncertainty in both demand and delivery capacity. Delivery providers typically plan courier shifts for an operating period based on a demand forecast. However, because of the high demand volatility, it may at times during the operating period be necessary to adjust and dynamically add couriers. We study the problem of dynamically adding courier capacity in a rapid delivery system and propose a deep reinforcement-learning approach to obtain a policy that balances the cost of adding couriers and the cost-of-service quality degradation because of insufficient delivery capacity. Specifically, we seek to ensure that a high fraction of orders is delivered on time with a small number of courier hours. A computational study in the meal delivery space shows that a learned policy outperforms policies representing current practice and demonstrates the potential of deep learning for solving operational problems in highly stochastic logistic settings.

引用

页码：67 / 93

页数：28

共 58 条

[1] Optimization for dynamic ride-sharing: A review [J].

Agatz, Niels ;

Erera, Alan ;

Savelsbergh, Martin ;

Wang, Xing .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 223 (02) :295-303

[2]

Alfonso V., 2021, BIS Bull., V36

[3]

[Anonymous], 2019, Online food delivery

[4] Crowdsourced Delivery-A Dynamic Pickup and Delivery Problem with Ad Hoc drivers [J].

Arslan, Alp M. ;

Agatz, Niels ;

Kroon, Leo ;

Zuidwijk, Rob .

TRANSPORTATION SCIENCE, 2019, 53 (01) :222-235

[5]

Auad R, 2022, CAPACITY REQUIREMENT, V8125

[6] Courier satisfaction in rapid delivery systems using dynamic operating regions [J].

Auad, Ramon ;

Erera, Alan ;

Savelsbergh, Martin .

OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2023, 121

[7] Ridesharing and fleet sizing for On-Demand Multimodal Transit Systems [J].

Auad-Perez, Ramon ;

Van Hentenryck, Pascal .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 138

[8]

Ba J, 2014, ACS SYM SER

[9]

Balcan M.F., 2018, P 35 INT C MACHINE L

[10] Machine learning for combinatorial optimization: A methodological tour d'horizon [J].

Bengio, Yoshua ;

Lodi, Andrea ;

Prouvost, Antoine .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 290 (02) :405-421

← 1 2 3 4 5 6 →