Spatio-Temporal Capsule-Based Reinforcement Learning for Mobility-on-Demand Coordination

被引:17
|
作者
He, Suining [1 ]
Shin, Kang G. [2 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
[2] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
关键词
Mobility-on-demand; ride-sharing platform; human and vehicle mobility; coordination; smart transportation; reinforcement learning; spatio-temporal capsule network; smart city; OPTIMIZATION; TAXI;
D O I
10.1109/TKDE.2020.2992565
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an alternative means of convenient and smart transportation, mobility-on-demand (MOD), typified by online ride-sharing and connected taxicabs, has been rapidly growing and spreading worldwide. The large volume of complex traffic and the uncertainty of market supplies/demands have made it essential for many MOD service providers to proactively dispatch vehicles towards ride-seekers. To meet this need effectively, we propose STRide, an MOD coordination learning mechanism reinforced spatio-temporally with capsules. We formalize the adaptive coordination of vehicles into a reinforcement learning framework. STRide incorporates spatial and temporal distributions of supplies (vehicles) and demands (ride requests), customers' preferences and other external factors. A novel spatio-temporal capsule neural network is designed to predict the provider's rewards based on MOD network states, vehicles and their dispatch actions. This way, the MOD platform adapts itself to the supply-demand dynamics with the best potential rewards. We have conducted extensive data analytics and experimental evaluation with five large-scale datasets (similar to 27 million rides from Uber, NYC/Chicago Taxis, Didi and Car2Go). STRide is shown to outperform state-of-the-arts, substantially reducing request-rejection rate and passenger waiting time, and also increasing the service provider's profits.
引用
收藏
页码:1446 / 1461
页数:16
相关论文
共 50 条
  • [1] Spatio-Temporal Capsule-based Reinforcement Learning for Mobility-on-Demand Network Coordination
    He, Suining
    Shin, Kang G.
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2806 - 2813
  • [2] Spatio-temporal Adaptive Pricing for Balancing Mobility-on-Demand Networks
    He, Suining
    Shin, Kang G.
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2019, 10 (04)
  • [3] Rebalancing Shared Mobility-on-Demand Systems: a Reinforcement Learning Approach
    Wen, Jian
    Zhao, Jinhua
    Jaillet, Patrick
    2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,
  • [4] A capsule-based reinforcement learning framework for supply-demand matching in mobile crowdsourcing
    Li, Yiping
    He, Wei
    Xu, Yonghui
    Guo, Wei
    Feng, Zhen
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
  • [5] Joint Quantum Reinforcement Learning and Stabilized Control for Spatio-Temporal Coordination in Metaverse
    Park, Soohyun
    Chung, Jaehyun
    Park, Chanyoung
    Jung, Soyi
    Choi, Minseok
    Cho, Sungrae
    Kim, Joongheon
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 12410 - 12427
  • [6] Hierarchical Coordination Multi-Agent Reinforcement Learning With Spatio-Temporal Abstraction
    Ma, Tinghuai
    Peng, Kexing
    Rong, Huan
    Qian, Yurong
    Al-Nabhan, Najla
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 533 - 547
  • [7] Reinforcement learning-based estimation for spatio-temporal systems
    Mowlavi, Saviz
    Benosman, Mouhacine
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [8] Graph Meta-Reinforcement Learning for Transferable Autonomous Mobility-on-Demand
    Gammelli, Daniele
    Yang, Kaidi
    Harrison, James
    Rodrigues, Filipe
    Pereira, Francisco
    Pavone, Marco
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2913 - 2923
  • [9] Predicting Citywide Passenger Demand via Reinforcement Learning from Spatio-Temporal Dynamics
    Ning, Xiaodong
    Yao, Lina
    Wang, Xianzhi
    Benatallah, Boualem
    Salim, Flora
    Haghighi, Pari Delir
    PROCEEDINGS OF THE 15TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS 2018), 2018, : 19 - 28
  • [10] Parallel Computing of Spatio-Temporal Model Based on Deep Reinforcement Learning
    Lv, Zhiqiang
    Li, Jianbo
    Xu, Zhihao
    Wang, Yue
    Li, Haoran
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 391 - 403