Dynamic Courier Capacity Acquisition in Rapid Delivery Systems: A Deep Q-Learning Approach

被引:3
作者
Auad, Ramon [1 ,2 ]
Erera, Alan [1 ]
Savelsbergh, Martin [1 ]
机构
[1] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA 30332 USA
[2] Univ Catolica Norte, Dept Ind Engn, Antofagasta 1240000, Chile
关键词
logistics; rapid delivery; capacity management; last-mile delivery; reinforcement learning; deep Q-learning; OPTIMIZATION;
D O I
10.1287/trsc.2022.0042
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
With the recent boom of the gig economy, urban delivery systems have experienced substantial demand growth. In such systems, orders are delivered to customers from local distribution points respecting a delivery time promise. An important example is a restaurant meal delivery system, where delivery times are expected to be minutes after an order is placed. The system serves orders by making use of couriers that continuously perform pickups and deliveries. Operating such a rapid delivery system is very challenging, primarily because of the high service expectations and the considerable uncertainty in both demand and delivery capacity. Delivery providers typically plan courier shifts for an operating period based on a demand forecast. However, because of the high demand volatility, it may at times during the operating period be necessary to adjust and dynamically add couriers. We study the problem of dynamically adding courier capacity in a rapid delivery system and propose a deep reinforcement-learning approach to obtain a policy that balances the cost of adding couriers and the cost-of-service quality degradation because of insufficient delivery capacity. Specifically, we seek to ensure that a high fraction of orders is delivered on time with a small number of courier hours. A computational study in the meal delivery space shows that a learned policy outperforms policies representing current practice and demonstrates the potential of deep learning for solving operational problems in highly stochastic logistic settings.
引用
收藏
页码:67 / 93
页数:28
相关论文
共 58 条
  • [1] Optimization for dynamic ride-sharing: A review
    Agatz, Niels
    Erera, Alan
    Savelsbergh, Martin
    Wang, Xing
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 223 (02) : 295 - 303
  • [2] Alfonso V., 2021, BIS Bulletin, V36
  • [3] [Anonymous], 2019, Online food delivery
  • [4] [Anonymous], 1995, Neural networks for pattern recognition
  • [5] Crowdsourced Delivery-A Dynamic Pickup and Delivery Problem with Ad Hoc drivers
    Arslan, Alp M.
    Agatz, Niels
    Kroon, Leo
    Zuidwijk, Rob
    [J]. TRANSPORTATION SCIENCE, 2019, 53 (01) : 222 - 235
  • [6] Auad R, 2022, CAPACITY REQUIREMENT, V8125
  • [7] Courier satisfaction in rapid delivery systems using dynamic operating regions 
    Auad, Ramon
    Erera, Alan
    Savelsbergh, Martin
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2023, 121
  • [8] Ridesharing and fleet sizing for On-Demand Multimodal Transit Systems
    Auad-Perez, Ramon
    Van Hentenryck, Pascal
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 138
  • [9] Balcan Maria-Florina, 2018, PMLR, P344
  • [10] Machine learning for combinatorial optimization: A methodological tour d'horizon
    Bengio, Yoshua
    Lodi, Andrea
    Prouvost, Antoine
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 290 (02) : 405 - 421