A Reinforcement Learning Method for a Hybrid Flow-Shop Scheduling Problem

被引：33

作者：

Han, Wei ^{[1
]}

Guo, Fang ^{[1
]}

Su, Xichao ^{[1
]}

机构：

[1] Naval Aviat Univ, Dept Airborne Vehicle Engn, Yantai 264001, Peoples R China

来源：

ALGORITHMS | 2019年 / 12卷 / 11期

关键词：

reinforcement learning; hybrid flow-shop scheduling problem; Markov decision processes; sortie scheduling of carrier aircraft; SHOP; OPTIMIZATION; ALGORITHM;

D O I：

10.3390/a12110222

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The scheduling problems in mass production, manufacturing, assembly, synthesis, and transportation, as well as internet services, can partly be attributed to a hybrid flow-shop scheduling problem (HFSP). To solve the problem, a reinforcement learning (RL) method for HFSP is studied for the first time in this paper. HFSP is described and attributed to the Markov Decision Processes (MDP), for which the special states, actions, and reward function are designed. On this basis, the MDP framework is established. The Boltzmann exploration policy is adopted to trade-off the exploration and exploitation during choosing action in RL. Compared with the first-come-first-serve strategy that is frequently adopted when coding in most of the traditional intelligent algorithms, the rule in the RL method is first-come-first-choice, which is more conducive to achieving the global optimal solution. For validation, the RL method is utilized for scheduling in a metal processing workshop of an automobile engine factory. Then, the method is applied to the sortie scheduling of carrier aircraft in continuous dispatch. The results demonstrate that the machining and support scheduling obtained by this RL method are reasonable in result quality, real-time performance and complexity, indicating that this RL method is practical for HFSP.

引用

页数：15

共 41 条

[1] Using ant colony optimization to solve hybrid flow shop scheduling problems [J].

Alaykyran, Kemal ;

Engin, Orhan ;

Doyen, Alper .

INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 35 (5-6) :541-550

[2]

[Anonymous], 2018, P INT C MACH LEARN

[3]

[Anonymous], 2016, ARXIV161101626

[4]

[Anonymous], 2016, DYNAMIC PROGRAMMING

[5] Speeding up a Rollout algorithm for complex parallel machine scheduling [J].

Ciavotta, Michele ;

Meloni, Carlo ;

Pranzo, Marco .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2016, 54 (16) :4993-5009

[6] Scheduling Dispensing and Counting in Secondary Pharmaceutical Manufacturing [J].

Ciavotta, Michele ;

Meloni, Carlo ;

Pranzo, Marco .

AICHE JOURNAL, 2009, 55 (05) :1161-1170

[7]

[傅启明 Fu Qiming], 2014, [计算机学报, Chinese Journal of Computers], V37, P677

[8]

[高阳 GAO Yang], 2007, [计算机学报, Chinese Journal of Computers], V30, P1372

[9]

Gao Yang, 2018, ARXIV180205313

[10]

Gu SX, 2016, PR MACH LEARN RES, V48

← 1 2 3 4 5 →