Spatio-Temporal Capsule-Based Reinforcement Learning for Mobility-on-Demand Coordination

被引：18

作者：

He, Suining ^{[1
]}

Shin, Kang G. ^{[2
]}

机构：

[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA

[2] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2022年 / 34卷 / 03期

关键词：

Mobility-on-demand; ride-sharing platform; human and vehicle mobility; coordination; smart transportation; reinforcement learning; spatio-temporal capsule network; smart city; OPTIMIZATION; TAXI;

D O I：

10.1109/TKDE.2020.2992565

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As an alternative means of convenient and smart transportation, mobility-on-demand (MOD), typified by online ride-sharing and connected taxicabs, has been rapidly growing and spreading worldwide. The large volume of complex traffic and the uncertainty of market supplies/demands have made it essential for many MOD service providers to proactively dispatch vehicles towards ride-seekers. To meet this need effectively, we propose STRide, an MOD coordination learning mechanism reinforced spatio-temporally with capsules. We formalize the adaptive coordination of vehicles into a reinforcement learning framework. STRide incorporates spatial and temporal distributions of supplies (vehicles) and demands (ride requests), customers' preferences and other external factors. A novel spatio-temporal capsule neural network is designed to predict the provider's rewards based on MOD network states, vehicles and their dispatch actions. This way, the MOD platform adapts itself to the supply-demand dynamics with the best potential rewards. We have conducted extensive data analytics and experimental evaluation with five large-scale datasets (similar to 27 million rides from Uber, NYC/Chicago Taxis, Didi and Car2Go). STRide is shown to outperform state-of-the-arts, substantially reducing request-rejection rate and passenger waiting time, and also increasing the service provider's profits.

引用

页码：1446 / 1461

页数：16

共 59 条

[1] Optimization for dynamic ride-sharing: A review [J].

Agatz, Niels ;

Erera, Alan ;

Savelsbergh, Martin ;

Wang, Xing .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 223 (02) :295-303

[2]

[Anonymous], 2009, PROC 18 WWW, DOI DOI 10.1145/1526709.1526816

[3]

[Anonymous], 2019, DRIVERS REPORT 80 PL

[4] Dynamic Intersections and Self-Driving Vehicles [J].

Aoki, Shunsuke ;

Rajkumar, Ragunathan .

2018 9TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2018), 2018, :320-330

[5] Spatial Pricing in Ride-Sharing Networks [J].

Bimpikis, Kostas ;

Candogan, Ozan ;

Saban, Daniela .

OPERATIONS RESEARCH, 2019, 67 (03) :744-769

[6] Reinforcement Mechanism Design for e-commerce [J].

Cai, Qingpeng ;

Filos-Ratsikas, Aris ;

Tang, Pingzhong ;

Zhang, Yiwei .

WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, :1339-1348

[7]

Campbell H., 2017, RSG 2017 SURVEY RESU

[8] Free floating electric car sharing design: Data driven optimisation [J].

Cocca, Michele ;

Giordano, Danilo ;

Mellia, Marco ;

Vassio, Luca .

PERVASIVE AND MOBILE COMPUTING, 2019, 55 :59-75

[9] The role of urban mobility in retail business survival [J].

D’Silva, Krittika ;

Jayarajah, Kasthuri ;

Noulas, Anastasios ;

Mascolo, Cecilia ;

Misra, Archan .

Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2018, 2 (03)

[10]

Dovey R., 2017, 5 FLORIDA CITIES TEA

← 1 2 3 4 5 6 →