Multi-agent Transfer Learning in Reinforcement Learning-based Ride-sharing Systems

被引:2
|
作者
Castagna, Alberto [1 ]
Dusparic, Ivana [1 ]
机构
[1] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin, Ireland
基金
爱尔兰科学基金会;
关键词
Reinforcement Learning; Ride-sharing; Transfer Learning; Multi-agent;
D O I
10.5220/0010785200003116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) has been used in a range of simulated real-world tasks, e.g., sensor coordination, traffic light control, and on-demand mobility services. However, real world deployments are rare, as RL struggles with dynamic nature of real world environments, requiring time for learning a task and adapting to changes in the environment. Transfer Learning (TL) can help lower these adaptation times. In particular, there is a significant potential of applying TL in multi-agent RL systems, where multiple agents can share knowledge with each other, as well as with new agents that join the system. To obtain the most from inter-agent transfer, transfer roles (i.e., determining which agents act as sources and which as targets), as well as relevant transfer content parameters (e.g., transfer size) should be selected dynamically in each particular situation. As a first step towards fully dynamic transfers, in this paper we investigate the impact of TL transfer parameters with fixed source and target roles. Specifically, we label every agent-environment interaction with agent's epistemic confidence, and we filter the shared examples using varying threshold levels and sample sizes. We investigate impact of these parameters in two scenarios, a standard predator-prey RL benchmark and a simulation of a ride-sharing system with 200 vehicle agents and 10,000 ride-requests.
引用
收藏
页码:120 / 130
页数:11
相关论文
共 50 条
  • [1] Multi-agent Service Area Adaptation for Ride-Sharing Using Deep Reinforcement Learning
    Yoshida, Naoki
    Noda, Itsuki
    Sugawara, Toshiharu
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2020, 12092 LNAI : 363 - 375
  • [2] Optimization of ride-sharing with passenger transfer via deep reinforcement learning
    Wang, Dujuan
    Wang, Qi
    Yin, Yunqiang
    Cheng, T. C. E.
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2023, 172
  • [3] Deep Reinforcement Learning for Ride-sharing Dispatching and Repositioning
    Qin, Zhiwei
    Tang, Xiaocheng
    Jiao, Yan
    Zhang, Fan
    Wang, Chenxi
    Li, Qun
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6566 - 6568
  • [4] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [5] Aggregation Transfer Learning for Multi-Agent Reinforcement learning
    Xu, Dongsheng
    Qiao, Peng
    Dou, Yong
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 547 - 551
  • [6] Learning to Delay in Ride-Sourcing Systems: A Multi-Agent Deep Reinforcement Learning Framework
    Ke, Jintao
    Xiao, Feng
    Yang, Hai
    Ye, Jieping
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2280 - 2292
  • [7] Multi-Agent Reinforcement Learning-Based User Pairing in Multi-Carrier NOMA Systems
    Wang, Shaoyang
    Lv, Tiejun
    Zhang, Xuewei
    2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2019,
  • [8] Rationality of reward sharing in multi-agent reinforcement learning
    Kazuteru Miyazaki
    Shigenobu Kobayashi
    New Generation Computing, 2001, 19 : 157 - 172
  • [9] Rationality of reward sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    NEW GENERATION COMPUTING, 2001, 19 (02) : 157 - 172
  • [10] A Multi-agent Reinforcement Learning with Weighted Experience Sharing
    Yu, Lasheng
    Abdulai, Issahaku
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 219 - 225