Deep hierarchical reinforcement learning for collaborative object transportation by heterogeneous agents

被引：2

作者：

Hasan, Maram ^{[1
]}

Niyogi, Rajdeep ^{[1
]}

机构：

[1] Indian Inst Technol Roorkee, Dept Comp Sci & Engn, Roorkee 247667, Uttaranchal, India

来源：

COMPUTERS & ELECTRICAL ENGINEERING | 2024年 / 114卷

关键词：

Multi-agent systems; Hierarchical reinforcement learning; Warehouse management; Sparse rewards; Curiosity-driven intrinsic rewards;

D O I：

10.1016/j.compeleceng.2023.109066

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the logistics and supply chain domain, coordinated efforts among agents play a pivotal role, particularly in the context of collaborative object transportation within a warehouse. This paper addresses the multifaceted challenge of multi -agent coordination in warehouse environments characterized by sparse reward structures, where the ability to communicate among agents may be limited or infeasible. Due to various constraints such as power limitations, weight capacity, or specialized abilities, the individual execution of this task by a single agent remains unattainable. Our study focuses on heterogeneous agents, where each agent possesses a distinct subset of skills and capabilities. Our research examines the emergence of cooperative behavior among groups of agents with the requisite skill sets, aiming to accomplish the task without explicit inter -agent communication or prior coordination. To encourage implicit agent coordination, we introduce a hierarchical approach integrating a global evaluation of abstract actions with curiosity -driven intrinsic learning. This approach is well -suited for real -world settings with scarce rewards. We evaluated its effectiveness in a warehouse domain, and the results show that our approach consistently achieves higher average returns, faster convergence, and improved exploration efficiency, highlighting its effectiveness in diverse scenarios.

引用

页数：16

共 50 条

[1] Collaborative hunting in artificial agents with deep reinforcement learning
Tsutsui, Kazushi
Tanaka, Ryoya
Takeda, Kazuya
Fujii, Keisuke
ELIFE, 2024, 13
[2] Collaborative training of heterogeneous reinforcement learning agents in environments with sparse rewards: what and when to share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
Neural Computing and Applications, 2023, 35 : 16753 - 16780
[3] Collaborative training of heterogeneous reinforcement learning agents in environments with sparse rewards: what and when to share?
Andres, Alain
Villar-Rodriguez, Esther
Del Ser, Javier
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (23) : 16753 - 16780
[4] Graph Convolutional Reinforcement Learning for Collaborative Queuing Agents
Fawaz, Hassan
Lesca, Julien
Quang, Pham Tran Anh
Leguay, Jeremie
Zeghlache, Djamal
Medagliani, Paolo
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (02): : 1363 - 1377
[5] Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
Haydari, Ammar
Yilmaz, Yasin
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (01) : 11 - 32
[6] Goal Modelling for Deep Reinforcement Learning Agents
Leung, Jonathan
Shen, Zhiqi
Zeng, Zhiwei
Miao, Chunyan
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 271 - 286
[7] Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization
Li, Xiaodong
Wu, Pangjing
Zou, Chenxin
Li, Qing
IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (03) : 288 - 300
[8] Deep Learning and Hierarchical Reinforcement Learning for modeling a Conversational Recommender System
Basile, Pierpaolo
Greco, Claudio
Suglia, Alessandro
Semeraro, Giovanni
INTELLIGENZA ARTIFICIALE, 2018, 12 (02) : 125 - 141
[9] Object-Oriented Representation and Hierarchical Reinforcement Learning in Infinite Mario
Joshi, Mandar
Khobragade, Rakesh
Sarda, Saurabh
Deshpande, Umesh
Mohan, Shiwali
2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 1076 - 1081
[10] Coordinated behavior of cooperative agents using deep reinforcement learning
Diallo, Elhadji Amadou Oury
Sugiyama, Ayumi
Sugawara, Toshiharu
NEUROCOMPUTING, 2020, 396 : 230 - 240

← 1 2 3 4 5 →