Deep hierarchical reinforcement learning for collaborative object transportation by heterogeneous agents

被引:2
|
作者
Hasan, Maram [1 ]
Niyogi, Rajdeep [1 ]
机构
[1] Indian Inst Technol Roorkee, Dept Comp Sci & Engn, Roorkee 247667, Uttaranchal, India
关键词
Multi-agent systems; Hierarchical reinforcement learning; Warehouse management; Sparse rewards; Curiosity-driven intrinsic rewards;
D O I
10.1016/j.compeleceng.2023.109066
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the logistics and supply chain domain, coordinated efforts among agents play a pivotal role, particularly in the context of collaborative object transportation within a warehouse. This paper addresses the multifaceted challenge of multi -agent coordination in warehouse environments characterized by sparse reward structures, where the ability to communicate among agents may be limited or infeasible. Due to various constraints such as power limitations, weight capacity, or specialized abilities, the individual execution of this task by a single agent remains unattainable. Our study focuses on heterogeneous agents, where each agent possesses a distinct subset of skills and capabilities. Our research examines the emergence of cooperative behavior among groups of agents with the requisite skill sets, aiming to accomplish the task without explicit inter -agent communication or prior coordination. To encourage implicit agent coordination, we introduce a hierarchical approach integrating a global evaluation of abstract actions with curiosity -driven intrinsic learning. This approach is well -suited for real -world settings with scarce rewards. We evaluated its effectiveness in a warehouse domain, and the results show that our approach consistently achieves higher average returns, faster convergence, and improved exploration efficiency, highlighting its effectiveness in diverse scenarios.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Collaborative hunting in artificial agents with deep reinforcement learning
    Tsutsui, Kazushi
    Tanaka, Ryoya
    Takeda, Kazuya
    Fujii, Keisuke
    ELIFE, 2024, 13
  • [2] Collaborative training of heterogeneous reinforcement learning agents in environments with sparse rewards: what and when to share?
    Alain Andres
    Esther Villar-Rodriguez
    Javier Del Ser
    Neural Computing and Applications, 2023, 35 : 16753 - 16780
  • [3] Collaborative training of heterogeneous reinforcement learning agents in environments with sparse rewards: what and when to share?
    Andres, Alain
    Villar-Rodriguez, Esther
    Del Ser, Javier
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (23) : 16753 - 16780
  • [4] Graph Convolutional Reinforcement Learning for Collaborative Queuing Agents
    Fawaz, Hassan
    Lesca, Julien
    Quang, Pham Tran Anh
    Leguay, Jeremie
    Zeghlache, Djamal
    Medagliani, Paolo
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (02): : 1363 - 1377
  • [5] Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
    Haydari, Ammar
    Yilmaz, Yasin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (01) : 11 - 32
  • [6] Goal Modelling for Deep Reinforcement Learning Agents
    Leung, Jonathan
    Shen, Zhiqi
    Zeng, Zhiwei
    Miao, Chunyan
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 271 - 286
  • [7] Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization
    Li, Xiaodong
    Wu, Pangjing
    Zou, Chenxin
    Li, Qing
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (03) : 288 - 300
  • [8] Deep Learning and Hierarchical Reinforcement Learning for modeling a Conversational Recommender System
    Basile, Pierpaolo
    Greco, Claudio
    Suglia, Alessandro
    Semeraro, Giovanni
    INTELLIGENZA ARTIFICIALE, 2018, 12 (02) : 125 - 141
  • [9] Object-Oriented Representation and Hierarchical Reinforcement Learning in Infinite Mario
    Joshi, Mandar
    Khobragade, Rakesh
    Sarda, Saurabh
    Deshpande, Umesh
    Mohan, Shiwali
    2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 1076 - 1081
  • [10] Coordinated behavior of cooperative agents using deep reinforcement learning
    Diallo, Elhadji Amadou Oury
    Sugiyama, Ayumi
    Sugawara, Toshiharu
    NEUROCOMPUTING, 2020, 396 : 230 - 240