Supply-Demand-aware Deep Reinforcement Learning for Dynamic Fleet Management

被引：6

作者：

Zheng, Bolong ^{[1
]}

Ming, Lingfeng ^{[1
]}

Hu, Qi ^{[1
]}

Lu, Zhipeng ^{[1
]}

Liu, Guanfeng ^{[2
]}

Zhou, Xiaofang ^{[3
]}

机构：

[1] Huazhong Univ Sci & Technol, Wuhan, Peoples R China

[2] Macquarie Univ, Sydney, NSW, Australia

[3] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China

来源：

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY | 2022年 / 13卷 / 03期

关键词：

Trajectory; deep reinforcement learning; fleet management;

D O I：

10.1145/3467979

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Online ride-hailing platforms have reduced significantly the amounts of the time that taxis are idle and that passengers spend on waiting. As a key component of these platforms, the fleet management problem can be naturally modeled as a Markov Decision Process, which enables us to use the deep reinforcement learning. However, existing studies are proposed based on simplified problem settings that fail to model the complicated supply-dynamics and restrict the performance in the real traffic environment. In this article, we propose a supply-demand-aware deep reinforcement learning algorithm for taxi dispatching, where we use a deep Q-network with action sampling policy, called AS-DQN, to learn an optimal dispatching policy. Furthermore, we utilize a dueling network architecture, called AS-DDQN, to improve the performance of AS-DQN. Extensive experiments on real-world datasets offer insight into the performance of our model and show that it is capable of outperforming the baseline approaches.

引用

页数：19

共 46 条

[1] [Anonymous], 2021, COMMUNITY GUIDELINES
[2] [Anonymous], 2021, DIDI
[3] Bai L, 2020, ADV NEUR IN, V33
[4] Bai L, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1981
[5] Online Vehicle Routing: The Edge of Optimization in Large-Scale Applications
Bertsimas, Dimitris
Jaillet, Patrick
Martin, Sebastien
[J]. OPERATIONS RESEARCH, 2019, 67 (01) : 143 - 162
[6] Empty-Car Routing in Ridesharing Systems
Braverman, Anton
Dai, J. G.
Liu, Xin
Ying, Lei
[J]. OPERATIONS RESEARCH, 2019, 67 (05) : 1437 - 1452
[7] Efficiently Solving the Practical Vehicle Routing Problem: A Novel Joint Learning Approach
Duan, Lu
Zhan, Yang
Hu, Haoyuan
Gong, Yu
Wei, Jiangwen
Zhang, Xiaodong
Xu, Yinghui
[J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3054 - 3063
[8] Optimize taxi driving strategies based on reinforcement learning
Gao, Yong
Jiang, Dan
Xu, Yan
[J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2018, 32 (08) : 1677 - 1696
[9] Spatio-Temporal Capsule-based Reinforcement Learning for Mobility-on-Demand Network Coordination
He, Suining
Shin, Kang G.
[J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2806 - 2813
[10] An Effective Partitioning Approach for Competitive Spatial-Temporal Searching (GIS Cup)
Hu, Qi
Ming, Lingfeng
Tong, Chengdong
Zheng, Bolong
[J]. 27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 620 - 623

← 1 2 3 4 5 →