Multi-agent Transfer Learning in Reinforcement Learning-based Ride-sharing Systems

被引：2

作者：

Castagna, Alberto ^{[1
]}

Dusparic, Ivana ^{[1
]}

机构：

[1] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin, Ireland

来源：

ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2 | 2022年

基金：

爱尔兰科学基金会;

关键词：

Reinforcement Learning; Ride-sharing; Transfer Learning; Multi-agent;

D O I：

10.5220/0010785200003116

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) has been used in a range of simulated real-world tasks, e.g., sensor coordination, traffic light control, and on-demand mobility services. However, real world deployments are rare, as RL struggles with dynamic nature of real world environments, requiring time for learning a task and adapting to changes in the environment. Transfer Learning (TL) can help lower these adaptation times. In particular, there is a significant potential of applying TL in multi-agent RL systems, where multiple agents can share knowledge with each other, as well as with new agents that join the system. To obtain the most from inter-agent transfer, transfer roles (i.e., determining which agents act as sources and which as targets), as well as relevant transfer content parameters (e.g., transfer size) should be selected dynamically in each particular situation. As a first step towards fully dynamic transfers, in this paper we investigate the impact of TL transfer parameters with fixed source and target roles. Specifically, we label every agent-environment interaction with agent's epistemic confidence, and we filter the shared examples using varying threshold levels and sample sizes. We investigate impact of these parameters in two scenarios, a standard predator-prey RL benchmark and a simulation of a ride-sharing system with 200 vehicle agents and 10,000 ride-requests.

引用

页码：120 / 130

页数：11

共 28 条

[1] DeepPool: Distributed Model-Free Algorithm for Ride-Sharing Using Deep Reinforcement Learning [J].

Al-Abbasi, Abubakr O. ;

Ghosh, Arnob ;

Aggarwal, Vaneet .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (12) :4714-4727

[2]

[Anonymous], 2013, INT C AUTONOMOUS AGE

[3]

Burda Yuri, 2018, Exploration by random network distillation

[4]

Castagna A., 2021, AI COMMUN, P1

[5]

Castagna A., 2020, 11 INT WORKSH AG TRA

[6]

Chevalier-Boisvert Maxime, 2018, Minimalistic grid-world environment for openai gym

[7]

Da Silva FL, 2020, AAAI CONF ARTIF INTE, V34, P5792

[8]

de la Cruz G. V., 2019, PRE TRAINING NEURAL

[9] Learning to Teach Reinforcement Learning Agents [J].

Fachantidis, Anestis ;

Taylor, Matthew ;

Vlahavas, Ioannis .

MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2019, 1 (01) :21-42

[10]

Fagnant D.J., 2016, TRANSPORTATION, V45

← 1 2 3 →