Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning

被引:0
|
作者
Zhicong Zhang
Li Zheng
Michael X. Weng
机构
[1] Tsinghua University,Department of Industrial Engineering
[2] University of South Florida,Department of Industrial and Management Systems Engineering
来源
The International Journal of Advanced Manufacturing Technology | 2007年 / 34卷
关键词
Scheduling; Parallel machine; Reinforcement learning; Q-Learning;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we discuss a dynamic unrelated parallel machine scheduling problem with sequence-dependant setup times and machine–job qualification consideration. To apply the Q-Learning algorithm, we convert the scheduling problem into reinforcement learning problems by constructing a semi-Markov decision process (SMDP), including the definition of state representation, actions and the reward function. We use five heuristics, WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT, as actions and prove the equivalence of the reward function and the scheduling objective: minimisation of mean weighted tardiness. We carry out computational experiments to examine the performance of the Q-Learning algorithm and the heuristics. Experiment results show that Q-Learning always outperforms all heuristics remarkably. Averaged over all test problems, the Q-Learning algorithm achieved performance improvements over WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT by considerable amounts of 61.38%, 60.82%, 56.23%, 57.48% and 66.22%, respectively.
引用
收藏
页码:968 / 980
页数:12
相关论文
共 50 条
  • [1] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
    Zhang, Zhicong
    Zheng, Li
    Weng, Michael X.
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 34 (9-10): : 968 - 980
  • [2] Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning
    Zhang, Zhicong
    Zheng, Li
    Li, Na
    Wang, Weiping
    Zhong, Shouyan
    Hu, Kaishun
    COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (07) : 1315 - 1324
  • [3] Dynamic single machine scheduling using Q-learning agent
    Kong, LF
    Wu, J
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 3237 - 3241
  • [4] Dynamic Parallel Machine Scheduling Using the Learning Agent
    Yuan, Biao
    Wang, Lei
    Jiang, Zhibin
    2013 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM 2013), 2013, : 1565 - 1569
  • [5] Comparisons of metaheuristic algorithms for unrelated parallel machine weighted earliness/tardiness scheduling problems
    Oğuzhan Ahmet Arık
    Evolutionary Intelligence, 2020, 13 : 415 - 425
  • [6] Comparisons of metaheuristic algorithms for unrelated parallel machine weighted earliness/tardiness scheduling problems
    Arik, Oguzhan Ahmet
    EVOLUTIONARY INTELLIGENCE, 2020, 13 (03) : 415 - 425
  • [7] Weighted earliness/tardiness parallel machine scheduling problem with a common due date
    Arik, Oguzhan Ahmet
    Schutten, Marco
    Topan, Engin
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [8] Scheduling unrelated parallel machines to minimize total weighted tardiness
    Na, Dong-Gil
    Kim, Dong-Won
    Jang, Wooseung
    Chen, F. Frank
    2006 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS (SOLI 2006), PROCEEDINGS, 2006, : 758 - +
  • [9] Dynamic parallel machine scheduling with random breakdowns using the learning agent
    Yuan B.
    Jiang Z.
    Wang L.
    Jiang, Zhibin (zbjiang@sjtu.edu.cn), 2016, Inderscience Enterprises Ltd. (08) : 94 - 103
  • [10] Parallel Algorithm with Blocks for a Single-Machine Total Weighted Tardiness Scheduling Problem
    Uchronski, Mariusz
    APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 17