Supply-Demand-aware Deep Reinforcement Learning for Dynamic Fleet Management

被引:6
作者
Zheng, Bolong [1 ]
Ming, Lingfeng [1 ]
Hu, Qi [1 ]
Lu, Zhipeng [1 ]
Liu, Guanfeng [2 ]
Zhou, Xiaofang [3 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Peoples R China
[2] Macquarie Univ, Sydney, NSW, Australia
[3] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China
关键词
Trajectory; deep reinforcement learning; fleet management;
D O I
10.1145/3467979
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online ride-hailing platforms have reduced significantly the amounts of the time that taxis are idle and that passengers spend on waiting. As a key component of these platforms, the fleet management problem can be naturally modeled as a Markov Decision Process, which enables us to use the deep reinforcement learning. However, existing studies are proposed based on simplified problem settings that fail to model the complicated supply-dynamics and restrict the performance in the real traffic environment. In this article, we propose a supply-demand-aware deep reinforcement learning algorithm for taxi dispatching, where we use a deep Q-network with action sampling policy, called AS-DQN, to learn an optimal dispatching policy. Furthermore, we utilize a dueling network architecture, called AS-DDQN, to improve the performance of AS-DQN. Extensive experiments on real-world datasets offer insight into the performance of our model and show that it is capable of outperforming the baseline approaches.
引用
收藏
页数:19
相关论文
共 46 条
  • [1] [Anonymous], 2021, COMMUNITY GUIDELINES
  • [2] [Anonymous], 2021, DIDI
  • [3] Bai L, 2020, ADV NEUR IN, V33
  • [4] Bai L, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1981
  • [5] Online Vehicle Routing: The Edge of Optimization in Large-Scale Applications
    Bertsimas, Dimitris
    Jaillet, Patrick
    Martin, Sebastien
    [J]. OPERATIONS RESEARCH, 2019, 67 (01) : 143 - 162
  • [6] Empty-Car Routing in Ridesharing Systems
    Braverman, Anton
    Dai, J. G.
    Liu, Xin
    Ying, Lei
    [J]. OPERATIONS RESEARCH, 2019, 67 (05) : 1437 - 1452
  • [7] Efficiently Solving the Practical Vehicle Routing Problem: A Novel Joint Learning Approach
    Duan, Lu
    Zhan, Yang
    Hu, Haoyuan
    Gong, Yu
    Wei, Jiangwen
    Zhang, Xiaodong
    Xu, Yinghui
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3054 - 3063
  • [8] Optimize taxi driving strategies based on reinforcement learning
    Gao, Yong
    Jiang, Dan
    Xu, Yan
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2018, 32 (08) : 1677 - 1696
  • [9] Spatio-Temporal Capsule-based Reinforcement Learning for Mobility-on-Demand Network Coordination
    He, Suining
    Shin, Kang G.
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2806 - 2813
  • [10] An Effective Partitioning Approach for Competitive Spatial-Temporal Searching (GIS Cup)
    Hu, Qi
    Ming, Lingfeng
    Tong, Chengdong
    Zheng, Bolong
    [J]. 27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 620 - 623