Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning

被引：0

作者：

Zhicong Zhang

Li Zheng

Michael X. Weng

机构：

[1] Tsinghua University,Department of Industrial Engineering

[2] University of South Florida,Department of Industrial and Management Systems Engineering

来源：

The International Journal of Advanced Manufacturing Technology | 2007年 / 34卷

关键词：

Scheduling; Parallel machine; Reinforcement learning; Q-Learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we discuss a dynamic unrelated parallel machine scheduling problem with sequence-dependant setup times and machine–job qualification consideration. To apply the Q-Learning algorithm, we convert the scheduling problem into reinforcement learning problems by constructing a semi-Markov decision process (SMDP), including the definition of state representation, actions and the reward function. We use five heuristics, WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT, as actions and prove the equivalence of the reward function and the scheduling objective: minimisation of mean weighted tardiness. We carry out computational experiments to examine the performance of the Q-Learning algorithm and the heuristics. Experiment results show that Q-Learning always outperforms all heuristics remarkably. Averaged over all test problems, the Q-Learning algorithm achieved performance improvements over WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT by considerable amounts of 61.38%, 60.82%, 56.23%, 57.48% and 66.22%, respectively.

引用

页码：968 / 980

页数：12

共 50 条

[1] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
Zhang, Zhicong
Zheng, Li
Weng, Michael X.
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 34 (9-10): : 968 - 980
[2] Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning
Zhang, Zhicong
Zheng, Li
Li, Na
Wang, Weiping
Zhong, Shouyan
Hu, Kaishun
COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (07) : 1315 - 1324
[3] Dynamic single machine scheduling using Q-learning agent
Kong, LF
Wu, J
Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 3237 - 3241
[4] Dynamic Parallel Machine Scheduling Using the Learning Agent
Yuan, Biao
Wang, Lei
Jiang, Zhibin
2013 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM 2013), 2013, : 1565 - 1569
[5] Comparisons of metaheuristic algorithms for unrelated parallel machine weighted earliness/tardiness scheduling problems
Oğuzhan Ahmet Arık
Evolutionary Intelligence, 2020, 13 : 415 - 425
[6] Comparisons of metaheuristic algorithms for unrelated parallel machine weighted earliness/tardiness scheduling problems
Arik, Oguzhan Ahmet
EVOLUTIONARY INTELLIGENCE, 2020, 13 (03) : 415 - 425
[7] Weighted earliness/tardiness parallel machine scheduling problem with a common due date
Arik, Oguzhan Ahmet
Schutten, Marco
Topan, Engin
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
[8] Scheduling unrelated parallel machines to minimize total weighted tardiness
Na, Dong-Gil
Kim, Dong-Won
Jang, Wooseung
Chen, F. Frank
2006 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS (SOLI 2006), PROCEEDINGS, 2006, : 758 - +
[9] Dynamic parallel machine scheduling with random breakdowns using the learning agent
Yuan B.
Jiang Z.
Wang L.
Jiang, Zhibin (zbjiang@sjtu.edu.cn), 2016, Inderscience Enterprises Ltd. (08) : 94 - 103
[10] Parallel Algorithm with Blocks for a Single-Machine Total Weighted Tardiness Scheduling Problem
Uchronski, Mariusz
APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 17

← 1 2 3 4 5 →