A deep inverse reinforcement learning approach to route choice modeling with context-dependent rewards

被引:22
|
作者
Zhao, Zhan [1 ,2 ]
Liang, Yuebing [1 ]
机构
[1] Univ Hong Kong, Dept Urban Planning & Design, Hong Kong, Peoples R China
[2] Univ Hong Kong, Musketeers Fdn Inst Data Sci, Hong Kong, Peoples R China
关键词
Route choice modeling; Inverse reinforcement learning; Deep neural networks; Travel behavior; Trajectory data mining; RECURSIVE LOGIT MODEL;
D O I
10.1016/j.trc.2023.104079
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Route choice modeling is a fundamental task in transportation planning and demand forecasting. Classical methods generally adopt the discrete choice model (DCM) framework with linear utility functions and high-level route characteristics. While several recent studies have started to explore the applicability of deep learning for route choice modeling, they are all path-based with relatively simple model architectures and require choice set generation. Existing link-based models can capture the dynamic nature of link choices within the trip without the need for choice set generation, but still assume linear relationships and link-additive features. To address these issues, this study proposes a general deep inverse reinforcement learning (IRL) framework for link-based route choice modeling, which is capable of incorporating diverse features (of the state, action and trip context) and capturing complex relationships. Specifically, we adapt an adversarial IRL model to the route choice problem for efficient estimation of context-dependent reward functions without value iteration. Experiment results based on taxi GPS data from Shanghai, China validate the superior prediction performance of the proposed model over conventional DCMs and other imitation learning baselines, even for destinations unseen in the training data. Further analysis shows that the model exhibits competitive computational efficiency and reasonable interpretability. The proposed methodology provides a new direction for future development of route choice models. It is general and can be adaptable to other route choice problems across different modes and networks.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Context-dependent reproductive site choice in a Neotropical frog
    Murphy, PJ
    BEHAVIORAL ECOLOGY, 2003, 14 (05) : 626 - 633
  • [42] Context-dependent genetic benefits from mate choice
    Qvarnström, A
    TRENDS IN ECOLOGY & EVOLUTION, 2001, 16 (01) : 5 - 7
  • [43] User Equilibrium Analysis Considering Travelers' Context-Dependent Route Choice Behavior on the Risky Traffic Network
    Xu, Qinghui
    Ji, Xiangfeng
    SUSTAINABILITY, 2020, 12 (17)
  • [44] Deep Reinforcement Learning with Copy-oriented Context Awareness andWeighted Rewards for Abstractive Summarization
    Tan, Caidong
    2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 84 - 89
  • [45] Deep Reinforcement Learning For SPORADIC Rewards With HUMAN Experience
    Sinha, Harshit
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [46] CONTEXT-DEPENDENT PRONUNCIATION MODELING FOR IRAQI ASR
    Tsakalidis, Stavros
    Prasad, Rohit
    Natarajan, Prem
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4457 - 4460
  • [47] Preference Modeling with Context-Dependent Salient Features
    Bower, Amanda
    Balzano, Laura
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [48] A modelling of route choice behaviour in transportation networks: an approach from reinforcement learning
    Miyagi, T
    URBAN TRANSPORT X: URBAN TRANSPORT AND THE ENVIRONMENT IN THE 21ST CENTURY, 2004, 16 : 235 - 244
  • [49] Context-dependent influence of road attributes and pricing policies on route choice behavior of truck drivers: results of a conjoint choice experiment
    Arentze, Theo
    Feng, Tao
    Timmermans, Harry
    Robroeks, Jops
    TRANSPORTATION, 2012, 39 (06) : 1173 - 1188
  • [50] Preference Modeling with Context-Dependent Salient Features
    Bower, Amanda
    Balzano, Laura
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119