A Reinforcement Learning and Prediction-Based Lookahead Policy for Vehicle Repositioning in Online Ride-Hailing Systems

被引:4
作者
Wei, Honghao [1 ]
Yang, Zixian [2 ]
Liu, Xin [3 ]
Qin, Zhiwei [4 ,5 ]
Tang, Xiaocheng [4 ,6 ]
Ying, Lei [2 ]
机构
[1] Washington State Univ, EECS, Pullman, WA 99164 USA
[2] Univ Michigan, EECS, Ann Arbor, MI 48109 USA
[3] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R China
[4] Lab DiDi, Mountain View, CA 94043 USA
[5] Lyft Rideshare Labs, San Francisco, CA 94107 USA
[6] Meta AI, San Francisco, CA 94102 USA
关键词
Ride-hailing; large-scale; reposition; idle car routing; reinforcement learning;
D O I
10.1109/TITS.2023.3312048
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Existing approaches for vehicle repositioning on large-scale ride-hailing platforms either ignore the spatial -temporal mismatch between supply and demand in real-time or overlook the long-term balance of the system. To account for both, we propose a lookahead repositioning policy in this paper, which is a novel approach to repositioning idle vehicles from both a dynamic system and a long-term performance perspective. Our method consists of two parts; the first part utilizes linear programming (LP) to formulate the nonstationary system as a time-varying, T-step lookahead optimization problem and explicitly models the fraction of drivers who follow repositioning recommendations (called the repositioning rate). The second step is to incorporate a reinforcement learning (RL) method to maximize long-term return based on learned value functions after the T time slots. Extensive studies utilizing a real-world dataset on both small-scale and large-scale simulators show that our method outperforms previous baseline methods and is robust to prediction errors.
引用
收藏
页码:1846 / 1856
页数:11
相关论文
共 20 条
  • [1] Pricing and Prioritizing Time-Sensitive Customers with Heterogeneous Demand Rates
    Afeche, Philipp
    Baron, Opher
    Milner, Joseph
    Roet-Green, Ricky
    [J]. OPERATIONS RESEARCH, 2019, 67 (04) : 1184 - 1208
  • [2] Fast unfolding of communities in large networks
    Blondel, Vincent D.
    Guillaume, Jean-Loup
    Lambiotte, Renaud
    Lefebvre, Etienne
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
  • [3] Empty-Car Routing in Ridesharing Systems
    Braverman, Anton
    Dai, J. G.
    Liu, Xin
    Ying, Lei
    [J]. OPERATIONS RESEARCH, 2019, 67 (05) : 1437 - 1452
  • [4] Cheng SF, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P577
  • [5] Traffic Forecasting via Dilated Temporal Convolution With Peak-Sensitive Loss
    Guo, Ge
    Yuan, Wei
    Liu, Jinyuan
    Lv, Yisheng
    Liu, Wei
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2023, 15 (01) : 48 - 57
  • [6] A residual spatio-temporal architecture for travel demand forecasting
    Guo, Ge
    Zhang, Tianqi
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 115
  • [7] Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem
    Holler, John
    Vuorio, Risto
    Qin, Zhiwei
    Tang, Xiaocheng
    Jiao, Yan
    Jin, Tiancheng
    Singh, Satinder
    Wang, Chenxi
    Ye, Jieping
    [J]. 2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 1090 - 1095
  • [8] Iglesias R, 2018, IEEE INT CONF ROBOT, P6019
  • [9] Jiao Y., 2020, 2020 IEEE HOT CHIPS, P1, DOI [DOI 10.1109/HCS49909.2020.9220619, 10.1109/HCS49909.2020.9220619]
  • [10] A Comparison of Detrending Models and Multi-Regime Models for Traffic Flow Prediction
    Li, Zhiheng
    Li, Yuebiao
    Li, Li
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2014, 6 (04) : 34 - 44