Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach

被引:4
|
作者
Mak, Stephen [1 ,4 ]
Xu, Liming [1 ]
Pearce, Tim [2 ,5 ]
Ostroumov, Michael [3 ]
Brintrup, Alexandra [1 ]
机构
[1] Univ Cambridge, Inst Mfg, Dept Engn, Cambridge, England
[2] Microsoft Res Cambridge, Cambridge, England
[3] Value Chain Lab, London, England
[4] 17 Charles Babbage Rd, Cambridge CB3 0FS, England
[5] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Collaborative vehicle routing; Deep multi-agent reinforcement learning; Negotiation; Gain sharing; Multi-agent systems; Machine learning; HORIZONTAL COOPERATION; ALLOCATION; LEVEL; COST; GAME;
D O I
10.1016/j.trc.2023.104376
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Collaborative vehicle routing occurs when carriers collaborate through sharing their transporta-tion requests and performing transportation requests on behalf of each other. This achieves economies of scale, thus reducing cost, greenhouse gas emissions and road congestion. But which carrier should partner with whom, and how much should each carrier be compensated? Traditional game theoretic solution concepts are expensive to calculate as the characteristic function scales exponentially with the number of agents. This would require solving the vehicle routing problem (NP-hard) an exponential number of times. We therefore propose to model this problem as a coalitional bargaining game solved using deep multi-agent reinforcement learning, where - crucially - agents are not given access to the characteristic function. Instead, we implicitly reason about the characteristic function; thus, when deployed in production, we only need to evaluate the expensive post-collaboration vehicle routing problem once. Our contribution is that we are the first to consider both the route allocation problem and gain sharing problem simultaneously - without access to the expensive characteristic function. Through decentralised machine learning, our agents bargain with each other and agree to outcomes that correlate well with the Shapley value - a fair profit allocation mechanism. Importantly, we are able to achieve a reduction in run-time of 88%.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Deep Multi-agent Reinforcement Learning in a Homogeneous Open Population
    Radulescu, Roxana
    Legrand, Manon
    Efthymiadis, Kyriakos
    Roijers, Diederik M.
    Nowe, Ann
    ARTIFICIAL INTELLIGENCE, BNAIC 2018, 2019, 1021 : 90 - 105
  • [22] A Multi-Agent Deep Reinforcement Learning Approach for Practical Decentralized UAV Collision Avoidance
    Thumiger, Nicholas
    Deghat, Mohammad
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2174 - 2179
  • [23] Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning
    Naderializadeh, Navid
    Sydir, Jaroslaw
    Simsek, Meryem
    Nikopour, Hosein
    PROCEEDINGS OF THE 21ST IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (IEEE SPAWC2020), 2020,
  • [24] Multi-Agent Deep Reinforcement Learning for Distributed Load Restoration
    Linh Vu
    Tuyen Vu
    Thanh Long Vu
    Srivastava, Anurag
    IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (02) : 1749 - 1760
  • [25] Multi-Agent Reinforcement Learning for Highway Platooning
    Kolat, Mate
    Becsi, Tamas
    ELECTRONICS, 2023, 12 (24)
  • [26] Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning
    Naderializadeh, Navid
    Sydir, Jaroslaw J.
    Simsek, Meryem
    Nikopour, Hosein
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (06) : 3507 - 3523
  • [27] A Multi-agent Approach to the Dynamic Vehicle Routing Problem with Time Windows
    Barbucha, Dariusz
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2013, 8083 : 467 - 476
  • [28] A Multi-Agent Reinforcement Learning Approach for Stock Portfolio Allocation
    Koratamaddi, Prahlad
    Wadhwani, Karan
    Gupta, Mridul
    Sanjeevi, Sriram G.
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 410 - 410
  • [29] A Sample Efficient Multi-Agent Approach to Continuous Reinforcement Learning
    Corcoran, Diarmuid
    Kreuger, Per
    Boman, Magnus
    2022 18TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM 2022): INTELLIGENT MANAGEMENT OF DISRUPTIVE NETWORK TECHNOLOGIES AND SERVICES, 2022, : 338 - 344
  • [30] Urban Traffic Control in Software Defined Internet of Things via a Multi-Agent Deep Reinforcement Learning Approach
    Yang, Jiachen
    Zhang, Jipeng
    Wang, Huihui
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) : 3742 - 3754