Personalized route recommendation for ride-hailing with deep inverse reinforcement learning and real-time traffic conditions

被引:23
作者
Liu, Shan [1 ]
Jiang, Hai [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Ind Engn, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Ind Engn, South 530 Room,Shunde Bldg, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Personalized route recommendation; Inverse reinforcement learning; Dynamic environment; Ride-hailing; CHOICE MODEL; MULTICRITERIA; BEHAVIOR; NETWORK; PATH; PREDICTION;
D O I
10.1016/j.tre.2022.102780
中图分类号
F [经济];
学科分类号
02 ;
摘要
Personalized route recommendation aims to recommend routes based on users' route preference.The vast amount of GPS trajectories tracking driving behavior has made deep learning,especially inverse reinforcement learning (IRL), a popular choice for personalized route rec-ommendation. However, current IRL studies assume that the traffic condition is static andapproximate the expected state visitation frequencies to update the neural network. This studyimproves the IRL to recommend personalized routes considering real-time traffic conditions.We also improve the expected state visitation frequency calculation based on characteristics ofride-hailing and taxi trajectories to calculate the gradient of the neural network. In addition,the graph attention network is employed to capture the spatial dependencies between roadsegments. Numerical experiments using real ride-hailing trajectories in Chengdu, China validateour model. At last, a statistical test is conducted, and route preferences reflected by the samedriver's empty trajectories and occupied trajectories are found to have significant differences.
引用
收藏
页数:15
相关论文
共 73 条
[1]  
Aghasadeghi N, 2011, IEEE INT C INT ROBOT, P1561, DOI 10.1109/IROS.2011.6048804
[2]   Modeling pedestrian-cyclist interactions in shared space using inverse reinforcement learning [J].
Alsaleh, Rushdi ;
Sayed, Tarek .
TRANSPORTATION RESEARCH PART F-TRAFFIC PSYCHOLOGY AND BEHAVIOUR, 2020, 70 :37-57
[3]  
[Anonymous], 2021, Global TimesJun 8
[4]  
[Anonymous], 2021, China Daily
[5]  
[Anonymous], 2011, Advances in Neural Information Processing Systems
[6]  
[Anonymous], 2021, PEOPLES DAILY ONLINE
[7]   A survey of inverse reinforcement learning: Challenges, methods and progress [J].
Arora, Saurabh ;
Doshi, Prashant .
ARTIFICIAL INTELLIGENCE, 2021, 297 (297)
[8]  
Audiffren J, 2015, PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), P3315
[9]  
Blanco-Novoa O., 2018, PROCEEDINGS, V4, P26, DOI 10.3390/ecsa-5-05758
[10]   Where do cyclists ride? A route choice model developed with revealed preference GPS data [J].
Broach, Joseph ;
Dill, Jennifer ;
Gliebe, John .
TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2012, 46 (10) :1730-1740