Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning

被引:0
|
作者
Zhicong Zhang
Li Zheng
Michael X. Weng
机构
[1] Tsinghua University,Department of Industrial Engineering
[2] University of South Florida,Department of Industrial and Management Systems Engineering
来源
The International Journal of Advanced Manufacturing Technology | 2007年 / 34卷
关键词
Scheduling; Parallel machine; Reinforcement learning; Q-Learning;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we discuss a dynamic unrelated parallel machine scheduling problem with sequence-dependant setup times and machine–job qualification consideration. To apply the Q-Learning algorithm, we convert the scheduling problem into reinforcement learning problems by constructing a semi-Markov decision process (SMDP), including the definition of state representation, actions and the reward function. We use five heuristics, WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT, as actions and prove the equivalence of the reward function and the scheduling objective: minimisation of mean weighted tardiness. We carry out computational experiments to examine the performance of the Q-Learning algorithm and the heuristics. Experiment results show that Q-Learning always outperforms all heuristics remarkably. Averaged over all test problems, the Q-Learning algorithm achieved performance improvements over WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT by considerable amounts of 61.38%, 60.82%, 56.23%, 57.48% and 66.22%, respectively.
引用
收藏
页码:968 / 980
页数:12
相关论文
共 50 条
  • [31] Tabu search for scheduling on identical parallel machines to minimize mean tardiness
    Armentano, VA
    Yamashita, DS
    JOURNAL OF INTELLIGENT MANUFACTURING, 2000, 11 (05) : 453 - 460
  • [32] Parallel machine scheduling problem to minimize the earliness/tardiness costs with learning effect and deteriorating jobs
    M. Duran Toksarı
    Ertan Güner
    Journal of Intelligent Manufacturing, 2010, 21 : 843 - 851
  • [33] Parallel machine scheduling problem to minimize the earliness/tardiness costs with learning effect and deteriorating jobs
    Toksari, M. Duran
    Guner, Ertan
    JOURNAL OF INTELLIGENT MANUFACTURING, 2010, 21 (06) : 843 - 851
  • [34] Tabu search for scheduling on identical parallel machines to minimize mean tardiness
    Vinı´cius A. Armentano
    Denise S. Yamashita
    Journal of Intelligent Manufacturing, 2000, 11 : 453 - 460
  • [35] Unrelated parallel machine scheduling with setup consideration and a total weighted completion time objective
    Weng, MX
    Lu, J
    Ren, HY
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2001, 70 (03) : 215 - 226
  • [36] Exact Approaches for Single Machine Total Weighted Tardiness Batch Scheduling
    Pessoa, Artur Alves
    Bulhoes, Teobaldo
    Nesello, Vitor
    Subramanian, Anand
    INFORMS JOURNAL ON COMPUTING, 2022, 34 (03) : 1512 - 1530
  • [37] Scatter search for minimizing weighted tardiness in a single machine scheduling with setups
    Gonzalez, Miguel A.
    Jose Palacios, Juan
    Vela, Camino R.
    Hernandez-Arauzo, Alejandro
    JOURNAL OF HEURISTICS, 2017, 23 (2-3) : 81 - 110
  • [38] Scatter search for minimizing weighted tardiness in a single machine scheduling with setups
    Miguel A. González
    Juan José Palacios
    Camino R. Vela
    Alejandro Hernández-Arauzo
    Journal of Heuristics, 2017, 23 : 81 - 110
  • [39] Extended GRASP for the job shop scheduling problem with total weighted tardiness objective
    Bierwirth, C.
    Kuhpfahl, J.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 261 (03) : 835 - 848
  • [40] The single-machine total weighted tardiness scheduling problem with position-based learning effects
    Yin, Yunqiang
    Wu, Chin-Chia
    Wu, Wen-Hsiang
    Cheng, Shuenn-Ren
    COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (05) : 1109 - 1116