Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach

被引：4

作者：

Mak, Stephen ^{[1
,4
]}

Xu, Liming ^{[1
]}

Pearce, Tim ^{[2
,5
]}

Ostroumov, Michael ^{[3
]}

Brintrup, Alexandra ^{[1
]}

机构：

[1] Univ Cambridge, Inst Mfg, Dept Engn, Cambridge, England

[2] Microsoft Res Cambridge, Cambridge, England

[3] Value Chain Lab, London, England

[4] 17 Charles Babbage Rd, Cambridge CB3 0FS, England

[5] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China

来源：

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES | 2023年 / 157卷

基金：

英国工程与自然科学研究理事会;

关键词：

Collaborative vehicle routing; Deep multi-agent reinforcement learning; Negotiation; Gain sharing; Multi-agent systems; Machine learning; HORIZONTAL COOPERATION; ALLOCATION; LEVEL; COST; GAME;

D O I：

10.1016/j.trc.2023.104376

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Collaborative vehicle routing occurs when carriers collaborate through sharing their transporta-tion requests and performing transportation requests on behalf of each other. This achieves economies of scale, thus reducing cost, greenhouse gas emissions and road congestion. But which carrier should partner with whom, and how much should each carrier be compensated? Traditional game theoretic solution concepts are expensive to calculate as the characteristic function scales exponentially with the number of agents. This would require solving the vehicle routing problem (NP-hard) an exponential number of times. We therefore propose to model this problem as a coalitional bargaining game solved using deep multi-agent reinforcement learning, where - crucially - agents are not given access to the characteristic function. Instead, we implicitly reason about the characteristic function; thus, when deployed in production, we only need to evaluate the expensive post-collaboration vehicle routing problem once. Our contribution is that we are the first to consider both the route allocation problem and gain sharing problem simultaneously - without access to the expensive characteristic function. Through decentralised machine learning, our agents bargain with each other and agree to outcomes that correlate well with the Shapley value - a fair profit allocation mechanism. Importantly, we are able to achieve a reduction in run-time of 88%.

引用

页数：25

共 50 条

[21] Deep Multi-agent Reinforcement Learning in a Homogeneous Open Population
Radulescu, Roxana
Legrand, Manon
Efthymiadis, Kyriakos
Roijers, Diederik M.
Nowe, Ann
ARTIFICIAL INTELLIGENCE, BNAIC 2018, 2019, 1021 : 90 - 105
[22] A Multi-Agent Deep Reinforcement Learning Approach for Practical Decentralized UAV Collision Avoidance
Thumiger, Nicholas
Deghat, Mohammad
IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2174 - 2179
[23] Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning
Naderializadeh, Navid
Sydir, Jaroslaw
Simsek, Meryem
Nikopour, Hosein
PROCEEDINGS OF THE 21ST IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (IEEE SPAWC2020), 2020,
[24] Multi-Agent Deep Reinforcement Learning for Distributed Load Restoration
Linh Vu
Tuyen Vu
Thanh Long Vu
Srivastava, Anurag
IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (02) : 1749 - 1760
[25] Multi-Agent Reinforcement Learning for Highway Platooning
Kolat, Mate
Becsi, Tamas
ELECTRONICS, 2023, 12 (24)
[26] Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning
Naderializadeh, Navid
Sydir, Jaroslaw J.
Simsek, Meryem
Nikopour, Hosein
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (06) : 3507 - 3523
[27] A Multi-agent Approach to the Dynamic Vehicle Routing Problem with Time Windows
Barbucha, Dariusz
COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2013, 8083 : 467 - 476
[28] A Multi-Agent Reinforcement Learning Approach for Stock Portfolio Allocation
Koratamaddi, Prahlad
Wadhwani, Karan
Gupta, Mridul
Sanjeevi, Sriram G.
CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 410 - 410
[29] A Sample Efficient Multi-Agent Approach to Continuous Reinforcement Learning
Corcoran, Diarmuid
Kreuger, Per
Boman, Magnus
2022 18TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM 2022): INTELLIGENT MANAGEMENT OF DISRUPTIVE NETWORK TECHNOLOGIES AND SERVICES, 2022, : 338 - 344
[30] Urban Traffic Control in Software Defined Internet of Things via a Multi-Agent Deep Reinforcement Learning Approach
Yang, Jiachen
Zhang, Jipeng
Wang, Huihui
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) : 3742 - 3754

← 1 2 3 4 5 →