A new multi-domain cooperative resource scheduling method using proximal policy optimization

被引：0

作者：

Liu, Haiying ^{[1
,4
]}

He, Zhaoyi ^{[2
]}

Wang, Rui ^{[3
,5
]}

Huang, Kuihua ^{[3
]}

Cheng, Guangquan ^{[3
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Astronaut, Nanjing 210016, Peoples R China

[2] Nanjing Res Inst Elect Engn, Nanjing 210007, Peoples R China

[3] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China

[4] Nanjing Ctr Appl Math, Nanjing 211135, Jiangsu, Peoples R China

[5] Xiangjiang Lab, Changsha 410205, Hunan, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2024年 / 36卷 / 09期

关键词：

Multi-domain cooperative; Resource scheduling; Deep reinforcement learning; Proximal policy optimization; Timing constraints; SHOP; ALGORITHM; HYBRID;

D O I：

10.1007/s00521-023-09326-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For the complex environment and massive multi-source data, the capability of multi-domain cooperative resource scheduling has become extremely important. Optimal scheduling can reduce operating costs and time, and MDLS is still the most commonly utilized algorithm in combat task scheduling today, despite of its defects. This research provides a plausible new method for the MDCRS problem, a resource scheduling method based on deep reinforcement learning (DRL), which has proven to be effective for other scheduling problems. Aiming at the resource scheduling problem in the multi-domain cooperative operation, under timing constraints, an MDCRS model is created using the shortest completion time as the objective function. On this premise, this paper presents an MDCRS-MDP model based on Markov decision processes, in which a two-dimensional action space that can simultaneously allocate action and match platform is designed and a dense reward function with strong connections to the criterion for sparse makespan minimization is provided. A resource scheduling approach utilizing DRL is proposed, including task-platform matching and task sequencing, based on the MDCRS-MDP model. Finally, combined with the joint landing operation, the experimental results verify the effectiveness of the proposed method for solving MDCRS and demonstrate the significant advantages over traditional dispatching rules and meta-heuristic optimization algorithms.

引用

页码：4931 / 4945

页数：15

共 30 条

[1] A hybrid projection method for resource-constrained project scheduling problem under uncertainty [J].

Aramesh, Saeed ;

Aickelin, Uwe ;

Khorshidi, Hadi Akbarzadeh .

NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17) :14557-14576

[2] Cost-aware job scheduling for cloud inutances using deep reinforcement learning [J].

Cheng, Feng ;

Huang, Yifeng ;

Tanpure, Bhavana ;

Sawalani, Pawan ;

Cheng, Long ;

Liu, Cong .

CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (01) :619-631

[3] Hybrid of human learning optimization algorithm and particle swarm optimization algorithm with scheduling strategies for the flexible job-shop scheduling problem [J].

Ding, Haojie ;

Gu, Xingsheng .

NEUROCOMPUTING, 2020, 414 (414) :313-332

[4]

Fu ZY, 2019, PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), P1944, DOI [10.1109/ITNEC.2019.8729238, 10.1109/itnec.2019.8729238]

[5] Dynamic scheduling of heterogeneous resources across mobile edge-cloud continuum using fruit fly-based simulated annealing optimization scheme [J].

Gabi, Danlami ;

Dankolo, Nasiru Muhammad ;

Muslim, Abubakar Atiku ;

Abraham, Ajith ;

Joda, Muhammad Usman ;

Zainal, Anazida ;

Zakaria, Zalmiyah .

NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16) :14085-14105

[6] Scheduling job shop associated with multiple routings with genetic and ant colony heuristics [J].

Girish, B. S. ;

Jawahar, N. .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2009, 47 (14) :3891-3917

[7] Research on Adaptive Job Shop Scheduling Problems Based on Dueling Double DQN [J].

Han, Bao-An ;

Yang, Jian-Jun .

IEEE ACCESS, 2020, 8 :186474-186495

[8] An Optimization-Based Distributed Planning Algorithm: A Blackboard-Based Collaborative Framework [J].

Han, Xu ;

Mandal, Suvasri ;

Pattipati, Krishna R. ;

Kleinman, David L. ;

Mishra, Manisha .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2014, 44 (06) :673-686

[9]

Jieyong Zhang, 2018, Journal of Physics: Conference Series, V1060, DOI [10.1088/1742-6596/1060/1/012051, 10.1088/1742-6596/1060/1/012051]

[10] An Improved Evolution Strategy Hybridization With Simulated Annealing for Permutation Flow Shop Scheduling Problems [J].

Khurshid, Bilal ;

Maqsood, Shahid ;

Omair, Muhammad ;

Sarkar, Biswajit ;

Ahmad, Imran ;

Muhammad, Khan .

IEEE ACCESS, 2021, 9 :94505-94522

← 1 2 3 →