Dynamic scheduling for dual-objective job shop with machine breakdown by reinforcement learning

被引:6
作者
Gan, Xuemei [1 ]
Zuo, Ying [2 ,5 ]
Yang, Guanci [3 ]
Zhang, Ansi [4 ]
Tao, Fei [2 ]
机构
[1] Guizhou Univ, Sch Mech Engn, Guiyang, Peoples R China
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
[3] Guizhou Univ, Key Lab Adv Mfg Technol, Guiyang, Peoples R China
[4] Guizhou Univ, State Key Lab Publ Big Data, Guiyang, Peoples R China
[5] Beihang Univ, Sch Automat Sci & Elect Engn, New Main Bldg F105,Xueyuan Rd 37, Beijing 100191, Peoples R China
基金
国家重点研发计划;
关键词
Dynamic manufacturing environment; machine breakdown; reinforcement learning; online scheduling framework; robust pro-active scheduling; ROBUST; BENCHMARKS; ALGORITHM;
D O I
10.1177/09544054231167086
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In modern complicated and changing manufacturing environments, unforeseen dynamic events such as machine breakdown or unexpected job arrival make required production resources unpredictable. The scheduling scheme is desired to maintain high stability in dynamic manufacturing environments. To cope with the classic disturbance of machine breakdown, a robust pro-active scheduling scheme is proposed by inserting the repair time into a disjunctive graph for reinforcement learning (IRDRL) in this paper. Firstly, a new mathematical model is developed to predict the machine fault which is assumed to be determined by service time and bearing load. Secondly, a disjunctive graph with breakdown information is designed to express the dynamic scheduling status. Then, an online scheduling framework is built based on the well-trained model through the proximal policy optimization (PPO) algorithm. Finally, compared with the classical methods such as the right-shift strategy and static model of reinforcement learning (RL), the proposed robust pro-active scheduling scheme is verified with high robustness, stability, and short running time.
引用
收藏
页码:3 / 17
页数:15
相关论文
共 53 条