Online model-based reinforcement learning for decision-making in long distance routes

被引:2
|
作者
Alcaraz, Juan J. [1 ]
Losilla, Fernando [1 ]
Caballero-Arnaldos, Luis [1 ]
机构
[1] Tech Univ Cartagena UPCT, Dept Informat & Commun Technol, Cartagena, Spain
关键词
Route scheduling; Reinforcement learning; Model predictive control; Monte Carlo tree search; VEHICLE-ROUTING PROBLEM; TIME WINDOWS; STOCHASTIC TRAVEL; OPTIMIZATION; FRAMEWORK; SERVICE;
D O I
10.1016/j.tre.2022.102790
中图分类号
F [经济];
学科分类号
02 ;
摘要
In road transportation, long-distance routes require scheduled driving times, breaks, and restperiods, in compliance with the regulations on working conditions for truck drivers, whileensuring goods are delivered within the time windows of each customer. However, routes aresubject to uncertain travel and service times, and incidents may cause additional delays, makingpredefined schedules ineffective in many real-life situations. This paper presents a reinforcementlearning (RL) algorithm capable of making en-route decisions regarding driving times, breaks,and rest periods, under uncertain conditions. Our proposal aims at maximizing the likelihood ofon-time delivery while complying with drivers' work regulations. We use an online model-basedRL strategy that needs no prior training and is more flexible than model-free RL approaches,where the agent must be trained offline before making online decisions. Our proposal combinesmodel predictive control with a rollout strategy and Monte Carlo tree search. At each decisionstage, our algorithm anticipates the consequences of all the possible decisions in a number offuture stages (the lookahead horizon), and then uses a base policy to generate a sequence ofdecisions beyond the lookahead horizon. This base policy could be, for example, a set of decisionrules based on the experience and expertise of the transportation company covering the routes.Our numerical results show that the policy obtained using our algorithm outperforms not onlythe base policy (up to 83%), but also a policy obtained offline using deep Q networks (DQN),a state-of-the-art, model-free RL algorithm.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Generalized Behavior Decision-Making Model for Ship Collision Avoidance via Reinforcement Learning Method
    Guan, Wei
    Zhao, Ming-yang
    Zhang, Cheng-bao
    Xi, Zhao-yong
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (02)
  • [42] RelTrans: An Enhancing Offline Reinforcement Learning Model for the Complex Hand Gesture Decision-Making Task
    Chen, Xiangwei
    Zeng, Zhixia
    Xiao, Ruliang
    Rida, Imad
    Zhang, Shi
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 3762 - 3769
  • [43] A Comparative Study of Situation Awareness-Based Decision-Making Model Reinforcement Learning Adaptive Automation in Evolving Conditions
    Costa, Renato D.
    Hirata, Celso M.
    Pugliese, Victor U.
    IEEE ACCESS, 2023, 11 : 16166 - 16182
  • [44] Simulation and modeling of human decision-making process through reinforcement learning based computational model involving past experiences
    Guptaa, Nimisha
    Ahirwalb, Mitul Kumar
    Atulkara, Mithilesh
    DECISION SCIENCE LETTERS, 2022, 11 (04) : 367 - 378
  • [45] Constraints Driven Safe Reinforcement Learning for Autonomous Driving Decision-Making
    Gao, Fei
    Wang, Xiaodong
    Fan, Yuze
    Gao, Zhenhai
    Zhao, Rui
    IEEE ACCESS, 2024, 12 : 128007 - 128023
  • [46] HMM for discovering decision-making dynamics using reinforcement learning experiments
    Guo, Xingche
    Zeng, Donglin
    Wang, Yuanjia
    BIOSTATISTICS, 2024,
  • [47] Unified Local-Cloud Decision-Making via Reinforcement Learning
    Sengupta, Kathakoli
    Shangguan, Zhongkai
    Bharadwaj, Sandesh
    Arora, Sanjay
    Ohn-Bar, Eshed
    Mancuso, Renato
    COMPUTER VISION - ECCV 2024, PT XLI, 2025, 15099 : 185 - 203
  • [48] Variational Inference MPC for Bayesian Model-based Reinforcement Learning
    Okada, Masashi
    Taniguchi, Tadahiro
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [49] Application of Reinforcement Learning in Decision-Making Management of Intelligent Unmanned System
    Wei N.
    Wang G.
    Binggong Xuebao/Acta Armamentarii, 2022, 43 : 164 - 169
  • [50] Deploying Reinforcement Learning for Efficient Runtime Decision-Making in Autonomous Systems
    Dastranj, Melika
    Nia, Mehran Alidoost
    Kargahi, Mehdi
    2022 CPSSI 4TH INTERNATIONAL SYMPOSIUM ON REAL-TIME AND EMBEDDED SYSTEMS AND TECHNOLOGIES (RTEST 2022), 2022,